KEGG   Homo sapiens (human): 1282
Entry
1282              CDS       T01001                                 
Symbol
COL4A1, BSVD, BSVD1, COL4A1s, PADMAL, RATOR
Name
(RefSeq) collagen type IV alpha 1 chain
  KO
K06237  collagen type IV alpha
Organism
hsa  Homo sapiens (human)
Pathway
hsa04151  PI3K-Akt signaling pathway
hsa04510  Focal adhesion
hsa04512  ECM-receptor interaction
hsa04820  Cytoskeleton in muscle cells
hsa04926  Relaxin signaling pathway
hsa04933  AGE-RAGE signaling pathway in diabetic complications
hsa04974  Protein digestion and absorption
hsa05146  Amoebiasis
hsa05165  Human papillomavirus infection
hsa05200  Pathways in cancer
hsa05222  Small cell lung cancer
Network
nt06539  Cytoskeleton in muscle cells
  Element
N01814  Extracellular matrix - Basal lamina
Disease
H00579  Hereditary angiopathy with nephropathy, aneurysms, and muscle cramps (HANAC)
H00839  Porencephaly
H00877  Brain small vessel disease
H02718  Autosomal dominant pontine microangiopathy and leukoencephalopathy
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    1282 (COL4A1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    1282 (COL4A1)
 09140 Cellular Processes
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    1282 (COL4A1)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    1282 (COL4A1)
  09154 Digestive system
   04974 Protein digestion and absorption
    1282 (COL4A1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    1282 (COL4A1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    1282 (COL4A1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    1282 (COL4A1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    1282 (COL4A1)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    1282 (COL4A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:hsa04147]
    1282 (COL4A1)
   00536 Glycosaminoglycan binding proteins [BR:hsa00536]
    1282 (COL4A1)
Exosome [BR:hsa04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   1282 (COL4A1)
Glycosaminoglycan binding proteins [BR:hsa00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   1282 (COL4A1)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 1282
NCBI-ProteinID: NP_001836
OMIM: 120130
HGNC: 2202
Ensembl: ENSG00000187498
Pharos: P02462(Tbio)
UniProt: P02462
Structure
LinkDB
Position
13:complement(110148963..110307157)
AA seq 1669 aa
MGPRLSVWLLLLPAALLLHEEHSRAAAKGGCAGSGCGKCDCHGVKGQKGERGLPGLQGVI
GFPGMQGPEGPQGPPGQKGDTGEPGLPGTKGTRGPPGASGYPGNPGLPGIPGQDGPPGPP
GIPGCNGTKGERGPLGPPGLPGFAGNPGPPGLPGMKGDPGEILGHVPGMLLKGERGFPGI
PGTPGPPGLPGLQGPVGPPGFTGPPGPPGPPGPPGEKGQMGLSFQGPKGDKGDQGVSGPP
GVPGQAQVQEKGDFATKGEKGQKGEPGFQGMPGVGEKGEPGKPGPRGKPGKDGDKGEKGS
PGFPGEPGYPGLIGRQGPQGEKGEAGPPGPPGIVIGTGPLGEKGERGYPGTPGPRGEPGP
KGFPGLPGQPGPPGLPVPGQAGAPGFPGERGEKGDRGFPGTSLPGPSGRDGLPGPPGSPG
PPGQPGYTNGIVECQPGPPGDQGPPGIPGQPGFIGEIGEKGQKGESCLICDIDGYRGPPG
PQGPPGEIGFPGQPGAKGDRGLPGRDGVAGVPGPQGTPGLIGQPGAKGEPGEFYFDLRLK
GDKGDPGFPGQPGMPGRAGSPGRDGHPGLPGPKGSPGSVGLKGERGPPGGVGFPGSRGDT
GPPGPPGYGPAGPIGDKGQAGFPGGPGSPGLPGPKGEPGKIVPLPGPPGAEGLPGSPGFP
GPQGDRGFPGTPGRPGLPGEKGAVGQPGIGFPGPPGPKGVDGLPGDMGPPGTPGRPGFNG
LPGNPGVQGQKGEPGVGLPGLKGLPGLPGIPGTPGEKGSIGVPGVPGEHGAIGPPGLQGI
RGEPGPPGLPGSVGSPGVPGIGPPGARGPPGGQGPPGLSGPPGIKGEKGFPGFPGLDMPG
PKGDKGAQGLPGITGQSGLPGLPGQQGAPGIPGFPGSKGEMGVMGTPGQPGSPGPVGAPG
LPGEKGDHGFPGSSGPRGDPGLKGDKGDVGLPGKPGSMDKVDMGSMKGQKGDQGEKGQIG
PIGEKGSRGDPGTPGVPGKDGQAGQPGQPGPKGDPGISGTPGAPGLPGPKGSVGGMGLPG
TPGEKGVPGIPGPQGSPGLPGDKGAKGEKGQAGPPGIGIPGLRGEKGDQGIAGFPGSPGE
KGEKGSIGIPGMPGSPGLKGSPGSVGYPGSPGLPGEKGDKGLPGLDGIPGVKGEAGLPGT
PGPTGPAGQKGEPGSDGIPGSAGEKGEPGLPGRGFPGFPGAKGDKGSKGEVGFPGLAGSP
GIPGSKGEQGFMGPPGPQGQPGLPGSPGHATEGPKGDRGPQGQPGLPGLPGPMGPPGLPG
IDGVKGDKGNPGWPGAPGVPGPKGDPGFQGMPGIGGSPGITGSKGDMGPPGVPGFQGPKG
LPGLQGIKGDQGDQGVPGAKGLPGPPGPPGPYDIIKGEPGLPGPEGPPGLKGLQGLPGPK
GQQGVTGLVGIPGPPGIPGFDGAPGQKGEMGPAGPTGPRGFPGPPGPDGLPGSMGPPGTP
SVDHGFLVTRHSQTIDDPQCPSGTKILYHGYSLLYVQGNERAHGQDLGTAGSCLRKFSTM
PFLFCNINNVCNFASRNDYSYWLSTPEPMPMSMAPITGENIRPFISRCAVCEAPAMVMAV
HSQTIQIPPCPSGWSSLWIGYSFVMHTSAGAEGSGQALASPGSCLEEFRSAPFIECHGRG
TCNYYANAYSFWLATIERSEMFKKPTPSTLKAGELRTHVSRCQVCMRRT
NT seq 5010 nt   +upstreamnt  +downstreamnt
atggggccccggctcagcgtctggctgctgctgctgcccgccgcccttctgctccacgag
gagcacagccgggccgctgcgaagggtggctgtgctggctctggctgtggcaaatgtgac
tgccatggagtgaagggacaaaagggtgaaagaggcctcccggggttacaaggtgtcatt
gggtttcctggaatgcaaggacctgaggggccacagggaccaccaggacaaaagggtgat
actggagaaccaggactacctggaacaaaagggacaagaggacctccgggagcatctggc
taccctggaaacccaggacttcccggaattcctggccaagacggcccgccaggcccccca
ggtattccaggatgcaatggcacaaagggggagagagggccgctcgggcctcctggcttg
cctggtttcgctggaaatcccggaccaccaggcttaccagggatgaagggtgatccaggt
gagatacttggccatgtgcccgggatgctgttgaaaggtgaaagaggatttcccggaatc
ccagggactccaggcccaccaggactgccagggcttcaaggtcctgttgggcctccagga
tttaccggaccaccaggtcccccaggccctcccggccctccaggtgaaaagggacaaatg
ggcttaagttttcaaggaccaaaaggtgacaagggtgaccaaggggtcagtgggcctcca
ggagtaccaggacaagctcaagttcaagaaaaaggagacttcgccaccaagggagaaaag
ggccaaaaaggtgaacctggatttcaggggatgccaggggtcggagagaaaggtgaaccc
ggaaaaccaggacccagaggcaaacccggaaaagatggtgacaaaggggaaaaagggagt
cccggttttcctggtgaacccgggtacccaggactcataggccgccagggcccgcaggga
gaaaagggtgaagcaggtcctcctggcccacctggaattgttataggcacaggacctttg
ggagaaaaaggagagaggggctaccctggaactccggggccaagaggagagccaggccca
aaaggtttcccaggactaccaggccaacccggacctccaggcctccctgtacctgggcag
gctggtgcccctggcttccctggtgaaagaggagaaaaaggtgaccgaggatttcctggt
acatctctgccaggaccaagtggaagagatgggctcccgggtcctcctggttcccctggg
ccccctgggcagcctggctacacaaatggaattgtggaatgtcagcccggacctccaggt
gaccagggtcctcctggaattccagggcagccaggatttataggcgaaattggagagaaa
ggtcaaaaaggagagagttgcctcatctgtgatatagacggatatcgggggcctcccggg
ccacagggacccccgggagaaataggtttcccagggcagccaggggccaagggcgacaga
ggtttgcctggcagagatggtgttgcaggagtgccaggccctcaaggtacaccagggctg
ataggccagccaggagccaagggggagcctggtgagttttatttcgacttgcggctcaaa
ggtgacaaaggagacccaggctttccaggacagcccggcatgccagggagagcgggttct
cctggaagagatggccatccgggtcttcctggccccaagggctcgccgggttctgtagga
ttgaaaggagagcgtggcccccctggaggagttggattcccaggcagtcgtggtgacacc
ggcccccctgggcctccaggatatggtcctgctggtcccattggtgacaaaggacaagca
ggctttcctggaggccctggatccccaggcctgccaggtccaaagggtgaaccaggaaaa
attgttcctttaccaggcccccctggagcagaaggactgccggggtccccaggcttccca
ggtccccaaggagaccgaggctttcccggaaccccaggaaggccaggcctgccaggagag
aagggcgctgtgggccagccaggcattggatttccagggccccccggccccaaaggtgtt
gacggcttacctggagacatggggccaccggggactccaggtcgcccgggatttaatggc
ttacctgggaacccaggtgtgcagggccagaagggagagcctggagttggtctaccggga
ctcaaaggtttgccaggtcttcccggcattcctggcacacccggggagaaggggagcatt
ggggtaccaggcgttcctggagaacatggagcgatcggaccccctgggcttcaggggatc
agaggtgaaccgggacctcctggattgccaggctccgtggggtctccaggagttccagga
ataggcccccctggagctaggggtccccctggaggacagggaccaccggggttgtcaggc
cctcctggaataaaaggagagaagggtttccccggattccctggactggacatgccgggc
cctaaaggagataaaggggctcaaggactccctggcataacgggacagtcggggctccct
ggccttcctggacagcagggggctcctgggattcctgggtttccaggttccaagggagaa
atgggcgtcatggggacccccgggcagccgggctcaccaggaccagtgggtgctcctgga
ttaccgggtgaaaaaggggaccatggctttccgggctcctcaggacccaggggagaccct
ggcttgaaaggtgataagggggatgtcggtctccctggcaagcctggctccatggataag
gtggacatgggcagcatgaagggccagaaaggagaccaaggagagaaaggacaaattgga
ccaattggtgagaagggatcccgaggagaccctgggaccccaggagtgcctggaaaggac
gggcaggcaggacagcctgggcagccaggacctaaaggtgatccaggtataagtggaacc
ccaggtgctccaggacttccgggaccaaaaggatctgttggtggaatgggcttgccagga
acacctggagagaaaggtgtgcctggcatccctggcccacaaggttcacctggcttacct
ggagacaaaggtgcaaaaggagagaaagggcaggcaggcccacctggcataggcatccca
gggctgcgaggtgaaaagggagatcaagggatagcgggtttcccaggaagccctggagag
aagggagaaaaaggaagcattgggatcccaggaatgccagggtccccaggccttaaaggg
tctcccgggagtgttggctatccaggaagtcctgggctacctggagaaaaaggtgacaaa
ggcctcccaggattggatggcatccctggtgtcaaaggagaagcaggtcttcctgggact
cctggccccacaggcccagctggccagaaaggggagccaggcagtgatggaatcccgggg
tcagcaggagagaagggtgaaccaggtctaccaggaagaggattcccagggtttccaggg
gccaaaggagacaaaggttcaaagggtgaggtgggtttcccaggattagccgggagccca
ggaattcctggatccaaaggagagcaaggattcatgggtcctccggggccccagggacag
ccggggttaccgggatccccaggccatgccacggaggggcccaaaggagaccgcggacct
cagggccagcctggcctgccaggacttccgggacccatggggcctccagggcttcctggg
attgatggagttaaaggtgacaaaggaaatccaggctggccaggagcacccggtgtccca
gggcccaagggagaccctggattccagggcatgcctggtattggtggctctccaggaatc
acaggctctaagggtgatatggggcctccaggagttccaggatttcaaggtccaaaaggt
cttcctggcctccagggaattaaaggtgatcaaggcgatcaaggcgtcccgggagctaaa
ggtctcccgggtcctcctggccccccaggtccttacgacatcatcaaaggggagcccggg
ctccctggtcctgagggccccccagggctgaaagggcttcagggactgccaggcccgaaa
ggccagcaaggtgttacaggattggtgggtatacctggacctccaggtattcctgggttt
gacggtgcccctggccagaaaggagagatgggacctgccgggcctactggtccaagagga
tttccaggtccaccaggccccgatgggttgccaggatccatggggcccccaggcacccca
tctgttgatcacggcttccttgtgaccaggcatagtcaaacaatagatgacccacagtgt
ccttctgggaccaaaattctttaccacgggtactctttgctctacgtgcaaggcaatgaa
cgggcccatggccaggacttgggcacggccggcagctgcctgcgcaagttcagcacaatg
cccttcctgttctgcaatattaacaacgtgtgcaactttgcatcacgaaatgactactcg
tactggctgtccacccctgagcccatgcccatgtcaatggcacccatcacgggggaaaac
ataagaccatttattagtaggtgtgctgtgtgtgaggcgcctgccatggtgatggccgtg
cacagccagaccattcagatcccaccgtgccccagcgggtggtcctcgctgtggatcggc
tactcttttgtgatgcacaccagcgctggtgcagaaggctctggccaagccctggcgtcc
cccggctcctgcctggaggagtttagaagtgcgccattcatcgagtgtcacggccgtggg
acctgcaattactacgcaaacgcttacagcttttggctcgccaccatagagaggagcgag
atgttcaagaagcctacgccgtccaccttgaaggcaggggagctgcgcacgcacgtcagc
cgctgccaagtctgtatgagaagaacataa

KEGG   Homo sapiens (human): 1284
Entry
1284              CDS       T01001                                 
Symbol
COL4A2, BSVD2, ICH, POREN2
Name
(RefSeq) collagen type IV alpha 2 chain
  KO
K06237  collagen type IV alpha
Organism
hsa  Homo sapiens (human)
Pathway
hsa04151  PI3K-Akt signaling pathway
hsa04510  Focal adhesion
hsa04512  ECM-receptor interaction
hsa04820  Cytoskeleton in muscle cells
hsa04926  Relaxin signaling pathway
hsa04933  AGE-RAGE signaling pathway in diabetic complications
hsa04974  Protein digestion and absorption
hsa05146  Amoebiasis
hsa05165  Human papillomavirus infection
hsa05200  Pathways in cancer
hsa05222  Small cell lung cancer
Network
nt06539  Cytoskeleton in muscle cells
  Element
N01814  Extracellular matrix - Basal lamina
Disease
H00839  Porencephaly
H00877  Brain small vessel disease
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    1284 (COL4A2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    1284 (COL4A2)
 09140 Cellular Processes
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    1284 (COL4A2)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    1284 (COL4A2)
  09154 Digestive system
   04974 Protein digestion and absorption
    1284 (COL4A2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    1284 (COL4A2)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    1284 (COL4A2)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    1284 (COL4A2)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    1284 (COL4A2)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    1284 (COL4A2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:hsa04147]
    1284 (COL4A2)
   00536 Glycosaminoglycan binding proteins [BR:hsa00536]
    1284 (COL4A2)
Exosome [BR:hsa04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   1284 (COL4A2)
Glycosaminoglycan binding proteins [BR:hsa00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   1284 (COL4A2)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 1284
NCBI-ProteinID: NP_001837
OMIM: 120090
HGNC: 2203
Ensembl: ENSG00000134871
Pharos: P08572(Tbio)
UniProt: P08572
Structure
LinkDB
Position
13:110307284..110513209
AA seq 1712 aa
MGRDQRAVAGPALRRWLLLGTVTVGFLAQSVLAGVKKFDVPCGGRDCSGGCQCYPEKGGR
GQPGPVGPQGYNGPPGLQGFPGLQGRKGDKGERGAPGVTGPKGDVGARGVSGFPGADGIP
GHPGQGGPRGRPGYDGCNGTQGDSGPQGPPGSEGFTGPPGPQGPKGQKGEPYALPKEERD
RYRGEPGEPGLVGFQGPPGRPGHVGQMGPVGAPGRPGPPGPPGPKGQQGNRGLGFYGVKG
EKGDVGQPGPNGIPSDTLHPIIAPTGVTFHPDQYKGEKGSEGEPGIRGISLKGEEGIMGF
PGLRGYPGLSGEKGSPGQKGSRGLDGYQGPDGPRGPKGEAGDPGPPGLPAYSPHPSLAKG
ARGDPGFPGAQGEPGSQGEPGDPGLPGPPGLSIGDGDQRRGLPGEMGPKGFIGDPGIPAL
YGGPPGPDGKRGPPGPPGLPGPPGPDGFLFGLKGAKGRAGFPGLPGSPGARGPKGWKGDA
GECRCTEGDEAIKGLPGLPGPKGFAGINGEPGRKGDRGDPGQHGLPGFPGLKGVPGNIGA
PGPKGAKGDSRTITTKGERGQPGVPGVPGMKGDDGSPGRDGLDGFPGLPGPPGDGIKGPP
GDPGYPGIPGTKGTPGEMGPPGLGLPGLKGQRGFPGDAGLPGPPGFLGPPGPAGTPGQID
CDTDVKRAVGGDRQEAIQPGCIGGPKGLPGLPGPPGPTGAKGLRGIPGFAGADGGPGPRG
LPGDAGREGFPGPPGFIGPRGSKGAVGLPGPDGSPGPIGLPGPDGPPGERGLPGEVLGAQ
PGPRGDAGVPGQPGLKGLPGDRGPPGFRGSQGMPGMPGLKGQPGLPGPSGQPGLYGPPGL
HGFPGAPGQEGPLGLPGIPGREGLPGDRGDPGDTGAPGPVGMKGLSGDRGDAGFTGEQGH
PGSPGFKGIDGMPGTPGLKGDRGSPGMDGFQGMPGLKGRPGFPGSKGEAGFFGIPGLKGL
AGEPGFKGSRGDPGPPGPPPVILPGMKDIKGEKGDEGPMGLKGYLGAKGIQGMPGIPGLS
GIPGLPGRPGHIKGVKGDIGVPGIPGLPGFPGVAGPPGITGFPGFIGSRGDKGAPGRAGL
YGEIGATGDFGDIGDTINLPGRPGLKGERGTTGIPGLKGFFGEKGTEGDIGFPGITGVTG
VQGPPGLKGQTGFPGLTGPPGSQGELGRIGLPGGKGDDGWPGAPGLPGFPGLRGIRGLHG
LPGTKGFPGSPGSDIHGDPGFPGPPGERGDPGEANTLPGPVGVPGQKGDQGAPGERGPPG
SPGLQGFPGITPPSNISGAPGDKGAPGIFGLKGYRGPPGPPGSAALPGSKGDTGNPGAPG
TPGTKGWAGDSGPQGRPGVFGLPGEKGPRGEQGFMGNTGPTGAVGDRGPKGPKGDPGFPG
APGTVGAPGIAGIPQKIAVQPGTVGPQGRRGPPGAPGEMGPQGPPGEPGFRGAPGKAGPQ
GRGGVSAVPGFRGDEGPIGHQGPIGQEGAPGRPGSPGLPGMPGRSVSIGYLLVKHSQTDQ
EPMCPVGMNKLWSGYSLLYFEGQEKAHNQDLGLAGSCLARFSTMPFLYCNPGDVCYYASR
NDKSYWLSTTAPLPMMPVAEDEIKPYISRCSVCEAPAIAIAVHSQDVSIPHCPAGWRSLW
IGYSFLMHTAAGDEGGGQSLVSPGSCLEDFRATPFIECNGGRGTCHYYANKYSFWLTTIP
EQSFQGSPSADTLKAGLIRTHISRCQVCMKNL
NT seq 5139 nt   +upstreamnt  +downstreamnt
atggggagagaccagcgcgcggtggccggccctgccctacggcggtggctgctgctgggg
acagtgaccgtggggttcctcgcccagagcgtcttggcgggtgtgaagaagtttgatgtg
ccgtgtggaggaagagattgcagtgggggctgccagtgctaccctgagaaaggtggacgt
ggtcagcctgggccagtgggcccccaggggtacaatgggccaccaggattacaaggattc
ccgggactgcagggacgtaaaggagacaagggtgaaaggggagcccccggagtaacggga
cccaagggcgacgtgggagcaagaggcgtttctggattccctggtgccgatggaattcct
ggacacccggggcaaggtgggcccaggggaaggccgggctacgatggctgcaacggaacc
cagggagactcaggtccacaggggccccccggctctgaggggttcaccgggcctcccggg
ccccaaggaccaaaagggcagaaaggtgagccttatgcactgcctaaagaggagcgcgac
agatatcggggtgaacctggagagcctggattggtcggtttccagggacctcccggccgc
cctgggcatgtgggacagatgggtccagttggagctccagggagaccaggaccacctgga
ccccctggaccaaaaggacagcaaggcaacagaggacttggtttctacggagttaagggt
gaaaagggtgacgtagggcagccgggacccaacgggattccatcagacaccctccacccc
atcatcgcgcccacaggagtcaccttccacccagatcagtacaagggtgaaaaaggcagt
gagggggaaccaggaataagaggcatttccttgaagggagaagaaggaatcatgggcttt
cctggactgaggggttaccctggcttgagtggtgaaaaaggatcaccaggacagaaggga
agccgaggcctggatggctatcaagggcctgatggaccccggggacccaagggagaagcc
ggagacccagggccccctggactacctgcctactcccctcacccttccctagcaaaaggt
gccagaggtgacccgggattcccaggggcccaaggggagccaggaagccagggtgagcca
ggagacccgggcctcccaggtccccctggcctctccatcggagatggagatcagaggaga
ggcctgccgggtgagatgggacccaagggcttcatcggagaccccggcatccctgcgctc
tacgggggcccacctggacctgatggaaagcgagggcctccaggaccccccgggctccct
ggaccacctggacctgatggcttcctgtttgggctgaaaggagcaaaaggaagagcaggc
ttccctgggcttcccggctcccctggagcccgcggaccaaaggggtggaaaggtgacgct
ggggaatgcagatgtacagaaggcgacgaagctatcaaaggtcttccgggactgccagga
cccaagggcttcgcaggcatcaacggggagccggggaggaaaggggacagaggagacccc
ggccaacacggcctccctgggttcccagggctcaagggagtgcctggcaacattggtgct
cccggacccaaaggagcaaaaggagattccagaacaatcacaaccaaaggtgagcgggga
cagcccggcgtcccaggtgtgcccgggatgaaaggtgacgatggcagcccaggccgcgat
gggctcgatggattccccggcctcccaggccctcccggtgatggcatcaagggccctcca
ggggacccaggctatccaggaatacctggaacgaagggtactccaggagaaatgggcccc
ccaggactgggccttcccggcctcaaaggccaacgtggtttccctggagacgccggctta
cctggaccaccaggcttcctgggccctcctggccccgcagggaccccaggacaaatagat
tgtgacacagatgtgaaaagggccgttggaggtgacagacaggaggccatccagccaggt
tgcataggagggcccaagggattgccaggcctgccaggacccccaggccccacaggtgcc
aaaggcctccgaggaatcccaggcttcgcaggagctgatggaggaccagggcccaggggc
ttgccaggagacgcaggtcgtgaagggttcccaggacccccagggttcataggaccccga
ggatccaaaggtgcagtgggcctccctggcccagatggatccccaggtcccatcggcctg
ccagggccagatgggccccctggggaaaggggcctccctggagaagtcctgggagctcag
cccgggccacggggagatgctggtgtgcctggacagcctgggcttaaaggccttcccgga
gacagaggcccccctggattcagaggaagccaagggatgcctgggatgccagggctgaag
ggccagccaggcctcccaggaccttccggccagccaggcctgtatgggcctccaggactg
catggattcccaggagctcctggccaagaggggcccttggggctgccaggaatcccaggc
cgtgaaggtctgcctggtgatagaggggaccctggggacacaggcgctcctggccctgtg
ggcatgaaaggtctctctggtgacagaggagatgctggcttcacaggggagcaaggccat
ccaggaagccctggatttaaaggaattgatggaatgcctgggacccccgggctaaaagga
gatagaggctcacctgggatggatggtttccaaggcatgcctggactcaaagggagaccc
gggtttccagggagcaaaggcgaggctggatttttcggaatacccggtctgaagggtctg
gctggtgagccaggttttaaaggcagccgaggggaccctgggcccccaggaccacctcct
gtcatcctgccaggaatgaaagacattaaaggagagaaaggagatgaagggcctatgggg
ctgaaaggatacctgggcgcaaaaggtatccaaggaatgccaggcatcccagggctgtca
ggaatccctgggctgcctgggaggcccggccacatcaaaggagtcaagggagacatcgga
gtccccggcatccccggtttgccaggattccctggggtggctggcccccctggaattacg
ggattcccaggattcataggaagccggggtgacaaaggtgccccagggagagcaggcctg
tatggcgagattggcgcgactggtgatttcggtgacatcggggacactataaatttacca
ggaagaccaggcctgaagggggagcggggcaccactggaataccaggtctgaagggattc
tttggagagaagggaacagaaggtgacatcggcttccctgggataacaggcgtgactgga
gtccaaggccctcctggacttaaaggacaaacaggctttccagggctgactgggcctcca
gggtcgcagggagagctggggcggattggactgcctggtggcaaaggagatgatggctgg
ccgggagctccgggcttaccaggttttccgggactccgtgggatccgcggcttacacggc
ttgccaggcaccaagggctttccaggatccccaggttctgacatccacggagacccaggc
ttcccaggccctcctggggaaagaggtgacccaggagaggccaacacccttccaggccct
gtgggagtcccaggacagaaaggagaccaaggagctccaggggaacgaggcccacctggg
agcccaggacttcaggggttccctggtatcacacccccttccaacatctctggggcacct
ggtgacaaaggggcgccagggatatttggcctgaaaggttatcggggcccaccagggcca
ccaggttctgctgctcttcctggaagcaaaggtgacacagggaacccaggagctccagga
accccagggaccaaaggatgggccggggactccgggccccagggcaggcctggtgtgttt
ggtctcccaggagaaaaagggcccaggggtgaacaaggcttcatggggaacactggaccc
actggggcggtgggcgacagaggccccaagggacccaagggagacccaggattccctggt
gcccccgggactgtgggagcccccgggattgcaggaatcccccagaagattgccgtccaa
ccagggacagtgggtccccaggggaggcgaggcccccctggggcaccgggggagatgggg
ccccagggcccccccggagaaccaggtttccgtggggctccagggaaagctgggccccaa
ggaagaggtggtgtgtctgctgttcccggcttccggggagatgaaggacccataggccac
caggggccgattggccaagaaggtgcaccaggccgtccagggagcccgggcctgccgggt
atgccaggccgcagcgtcagcatcggctacctcctggtgaagcacagccagacggaccag
gagcccatgtgcccagtgggcatgaacaaactctggagtggatacagcctgctgtacttc
gagggccaggagaaggcgcacaaccaggacctggggctggcgggctcctgcctggcgcgg
ttcagcaccatgcccttcctgtactgcaaccctggtgatgtctgctactatgccagccgg
aacgacaagtcctactggctctctaccactgcgccgctgcccatgatgcccgtggccgag
gacgagatcaagccctacatcagccgctgttctgtgtgtgaggccccggccatcgccatc
gcggtccacagtcaggatgtctccatcccacactgcccagctgggtggcggagtttgtgg
atcggatattccttcctcatgcacacggcggcgggagacgaaggcggtggccaatcactg
gtgtcaccgggcagctgtctagaggacttccgcgccacaccattcatcgaatgcaatgga
ggccgcggcacctgccactactacgccaacaagtacagcttctggctgaccaccattccc
gagcagagcttccagggctcgccctccgccgacacgctcaaggccggcctcatccgcaca
cacatcagccgctgccaggtgtgcatgaagaacctgtga

KEGG   Homo sapiens (human): 1285
Entry
1285              CDS       T01001                                 
Symbol
COL4A3, ATS2, ATS3, ATS3A, ATS3B, BFH2
Name
(RefSeq) collagen type IV alpha 3 chain
  KO
K06237  collagen type IV alpha
Organism
hsa  Homo sapiens (human)
Pathway
hsa04151  PI3K-Akt signaling pathway
hsa04510  Focal adhesion
hsa04512  ECM-receptor interaction
hsa04820  Cytoskeleton in muscle cells
hsa04926  Relaxin signaling pathway
hsa04933  AGE-RAGE signaling pathway in diabetic complications
hsa04974  Protein digestion and absorption
hsa05146  Amoebiasis
hsa05165  Human papillomavirus infection
hsa05200  Pathways in cancer
hsa05222  Small cell lung cancer
Network
nt06539  Cytoskeleton in muscle cells
  Element
N01814  Extracellular matrix - Basal lamina
Disease
H00581  Alport syndrome
H00582  Benign familial hematuria
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    1285 (COL4A3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    1285 (COL4A3)
 09140 Cellular Processes
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    1285 (COL4A3)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    1285 (COL4A3)
  09154 Digestive system
   04974 Protein digestion and absorption
    1285 (COL4A3)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    1285 (COL4A3)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    1285 (COL4A3)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    1285 (COL4A3)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    1285 (COL4A3)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    1285 (COL4A3)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:hsa04147]
    1285 (COL4A3)
   00536 Glycosaminoglycan binding proteins [BR:hsa00536]
    1285 (COL4A3)
Exosome [BR:hsa04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   1285 (COL4A3)
Glycosaminoglycan binding proteins [BR:hsa00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   1285 (COL4A3)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 1285
NCBI-ProteinID: NP_000082
OMIM: 120070
HGNC: 2204
Ensembl: ENSG00000169031
Pharos: Q01955(Tbio)
UniProt: Q01955
Structure
LinkDB
Position
2:227164624..227314792
AA seq 1670 aa
MSARTAPRPQVLLLPLLLVLLAAAPAASKGCVCKDKGQCFCDGAKGEKGEKGFPGPPGSP
GQKGFTGPEGLPGPQGPKGFPGLPGLTGSKGVRGISGLPGFSGSPGLPGTPGNTGPYGLV
GVPGCSGSKGEQGFPGLPGTLGYPGIPGAAGLKGQKGAPAKEEDIELDAKGDPGLPGAPG
PQGLPGPPGFPGPVGPPGPPGFFGFPGAMGPRGPKGHMGERVIGHKGERGVKGLTGPPGP
PGTVIVTLTGPDNRTDLKGEKGDKGAMGEPGPPGPSGLPGESYGSEKGAPGDPGLQGKPG
KDGVPGFPGSEGVKGNRGFPGLMGEDGIKGQKGDIGPPGFRGPTEYYDTYQEKGDEGTPG
PPGPRGARGPQGPSGPPGVPGSPGSSRPGLRGAPGWPGLKGSKGERGRPGKDAMGTPGSP
GCAGSPGLPGSPGPPGPPGDIVFRKGPPGDHGLPGYLGSPGIPGVDGPKGEPGLLCTQCP
YIPGPPGLPGLPGLHGVKGIPGRQGAAGLKGSPGSPGNTGLPGFPGFPGAQGDPGLKGEK
GETLQPEGQVGVPGDPGLRGQPGRKGLDGIPGTPGVKGLPGPKGELALSGEKGDQGPPGD
PGSPGSPGPAGPAGPPGYGPQGEPGLQGTQGVPGAPGPPGEAGPRGELSVSTPVPGPPGP
PGPPGHPGPQGPPGIPGSLGKCGDPGLPGPDGEPGIPGIGFPGPPGPKGDQGFPGTKGSL
GCPGKMGEPGLPGKPGLPGAKGEPAVAMPGGPGTPGFPGERGNSGEHGEIGLPGLPGLPG
TPGNEGLDGPRGDPGQPGPPGEQGPPGRCIEGPRGAQGLPGLNGLKGQQGRRGKTGPKGD
PGIPGLDRSGFPGETGSPGIPGHQGEMGPLGQRGYPGNPGILGPPGEDGVIGMMGFPGAI
GPPGPPGNPGTPGQRGSPGIPGVKGQRGTPGAKGEQGDKGNPGPSEISHVIGDKGEPGLK
GFAGNPGEKGNRGVPGMPGLKGLKGLPGPAGPPGPRGDLGSTGNPGEPGLRGIPGSMGNM
GMPGSKGKRGTLGFPGRAGRPGLPGIHGLQGDKGEPGYSEGTRPGPPGPTGDPGLPGDMG
KKGEMGQPGPPGHLGPAGPEGAPGSPGSPGLPGKPGPHGDLGFKGIKGLLGPPGIRGPPG
LPGFPGSPGPMGIRGDQGRDGIPGPAGEKGETGLLRAPPGPRGNPGAQGAKGDRGAPGFP
GLPGRKGAMGDAGPRGPTGIEGFPGPPGLPGAIIPGQTGNRGPPGSRGSPGAPGPPGPPG
SHVIGIKGDKGSMGHPGPKGPPGTAGDMGPPGRLGAPGTPGLPGPRGDPGFQGFPGVKGE
KGNPGFLGSIGPPGPIGPKGPPGVRGDPGTLKIISLPGSPGPPGTPGEPGMQGEPGPPGP
PGNLGPCGPRGKPGKDGKPGTPGPAGEKGNKGSKGEPGPAGSDGLPGLKGKRGDSGSPAT
WTTRGFVFTRHSQTTAIPSCPEGTVPLYSGFSFLFVQGNQRAHGQDLGTLGSCLQRFTTM
PFLFCNVNDVCNFASRNDYSYWLSTPALMPMNMAPITGRALEPYISRCTVCEGPAIAIAV
HSQTTDIPPCPHGWISLWKGFSFIMFTSAGSEGTGQALASPGSCLEEFRASPFLECHGRG
TCNYYSNSYSFWLASLNPERMFRKPIPSTVKAGELEKIISRCQVCMKKRH
NT seq 5013 nt   +upstreamnt  +downstreamnt
atgagcgcccggaccgcccccaggccgcaggtgctcctgctgccgctcctgctggtgctc
ctggcggcggcgcccgcagccagcaagggttgtgtctgtaaagacaaaggccagtgcttc
tgtgacggggccaaaggggagaagggggagaagggctttcctggaccccccggttctcct
ggccagaaaggattcacaggtcctgaaggcttgcctggaccgcagggacccaagggcttt
ccaggacttccaggactcacgggttccaaaggtgtaaggggaataagtggattgccagga
ttttctggttctcctggacttccaggcaccccaggcaataccgggccttacggacttgtc
ggtgtaccaggatgcagtggttctaagggtgagcaggggtttccaggactcccagggaca
ctgggctacccagggatcccgggtgctgctggtttgaaaggacaaaagggtgctcctgct
aaagaagaagatatagaacttgatgcaaaaggcgaccccgggttgccaggggctccagga
ccccagggtttgccaggccctccaggttttcctgggcctgttggcccacctggtcctccg
ggattctttggctttccaggagccatgggacctagaggacctaagggtcacatgggtgaa
agagtgataggacataaaggagagcggggtgtgaaagggttaacaggacccccgggacca
ccaggaacagttattgtgaccctaactggcccagataacagaacggacctcaagggggaa
aagggagacaagggagcaatgggcgagcctggacctcctggaccctcaggactgcctgga
gaatcatatggatctgaaaagggtgctcctggagaccctggcctgcagggaaaacccgga
aaagatggtgttcctggcttccctggaagtgagggagtcaagggcaacaggggtttccct
gggttaatgggtgaagatggcattaagggacagaaaggggacattggccctccaggattt
cgtggtccaacagaatattatgacacataccaggaaaagggagatgaaggcactccaggc
ccaccagggcccagaggagctcgtggcccacaaggtcccagtggtccccccggagttcct
ggaagtcctggatcatcaaggcctggcctcagaggagcccctggatggccaggcctgaaa
ggaagtaaaggggaacgaggccgcccaggaaaggatgccatggggactcctgggtcccca
ggttgtgctggttcaccaggtcttccaggatcaccgggacctccaggaccgccaggtgac
atcgtttttcgcaagggtccacctggagatcacggactgccaggctatctagggtctcca
ggaatcccaggagttgatgggcccaaaggagaaccaggcctcctgtgtacacagtgccct
tatatcccagggcctcccggtctcccaggattgccagggttacatggtgtaaaaggaatc
ccaggaagacaaggcgcagctggcttgaaaggaagcccagggtccccaggaaatacaggt
cttccaggatttccaggtttcccaggtgcccagggtgacccaggacttaaaggagaaaaa
ggtgaaacacttcagcctgaggggcaagtgggtgtcccaggtgacccggggctcagaggc
caacctgggagaaagggcttggatggaattcctggaactccgggagtgaaaggattacca
ggacctaaaggcgaactggctctgagtggtgagaaaggggaccaaggtcctccaggggat
cctggctcccctgggtccccaggacctgcaggaccagctggaccacctggctacggaccc
caaggagaacctggtctccagggcacgcaaggagttcctggagcccccggaccacccgga
gaagccggccctaggggagagctcagtgtttcaacaccagttccaggcccaccaggacct
ccagggccccctggccatcctggcccccaaggtccacctggtatccctggatccctgggg
aaatgtggagatcctggtcttccagggcctgatggtgaaccaggaattccaggaattgga
tttcctgggcctcctggacctaagggagaccaaggttttccaggtacaaaaggatcactg
ggttgtcctggaaaaatgggagagcctgggttacctggaaagccaggcctcccaggagcc
aagggagaaccagcagtagccatgcctggaggaccaggaacaccaggttttccaggagaa
agaggcaattctggggaacatggagaaattggactccctggacttccaggtctccctgga
actccaggaaatgaagggcttgatggaccacgaggagatccagggcagcctggaccacct
ggagaacaaggacccccaggaaggtgcatagagggtcccaggggagcccaaggacttcca
ggcttaaatggattgaaagggcaacaaggcagaagaggtaaaacggggccaaagggagac
ccaggaattccaggcttggatagatcaggatttcctggagaaactggatcaccaggaatt
ccaggtcatcaaggtgaaatgggaccactgggtcaaagaggatatccaggaaatccggga
attttagggccaccaggtgaagatggagtgattgggatgatgggctttcctggagccatt
ggccctccagggccccctgggaacccaggcacaccagggcagagggggagccctggaatt
ccaggagtaaagggccagagaggaaccccaggagccaagggggaacaaggagataaagga
aatcccgggccttcagagatatcccacgtaataggggacaaaggagaaccaggtctcaaa
ggattcgcaggaaatccaggtgagaaaggaaacagaggcgttccagggatgccaggttta
aagggcctcaaaggactacccggaccagcaggaccaccaggccccagaggagatttgggc
agcactgggaatcctggagaaccaggactgcgtggtataccaggaagcatggggaacatg
ggcatgccaggttctaaaggaaaaaggggaactttgggattcccaggtcgagcaggaaga
ccaggcctcccaggtattcatggtctccagggagataagggagagccaggttattcagaa
ggtacaaggccaggaccaccgggaccaacgggggatccaggactgccgggtgatatggga
aagaaaggagaaatggggcaacctggcccacctggacatttggggcctgctggacctgag
ggagcccctggaagtcctggaagtcctggcctcccaggaaagccaggtcctcatggtgat
ttgggttttaaaggaatcaaaggcctcctgggccctccaggaatcagaggccctccaggt
cttccaggatttccaggatctcctggaccaatgggtataagaggtgaccaaggacgtgat
ggaattcctggtccagccggagaaaagggagaaacgggtttattgagggcccctccaggc
ccaagagggaaccctggtgctcaaggagccaaaggagacaggggagccccaggttttcct
ggcctcccgggcagaaaaggggccatgggagatgctggacctcgaggacccacaggcata
gaaggattcccagggccaccaggtctgcccggtgcaattatccctggccagacaggaaat
cgtggtccaccaggctcaagaggaagcccaggtgcgcctggtccccctggacctccaggg
agtcatgtaataggcataaaaggagacaaagggtctatgggccaccctggcccaaaaggt
ccacctggaactgcaggagacatgggaccaccaggtcgtctgggagcaccaggtactcca
ggtcttccaggacccagaggtgatcctggattccaggggtttccaggcgtgaaaggagaa
aagggtaatcctggatttctaggatccattggacctccaggaccaattgggccaaaagga
ccacctggtgtacgtggagaccctggcacacttaagattatctcccttccaggaagccca
gggccacctggcacacctggagaaccagggatgcagggagaacctgggccaccagggcca
cctggaaacctaggaccctgtgggccaagaggtaagccaggcaaggatggaaaaccagga
actcctggaccagctggagaaaaaggcaacaaaggttctaaaggagagccaggaccagct
ggatcagatggattgccaggtttgaaaggaaaacgtggagacagtggatcacctgcaacc
tggacaacgagaggctttgtcttcacccgacacagtcaaaccacagcaattccttcatgt
ccagaggggacagtgccactctacagtgggttttcttttctttttgtacaaggaaatcaa
cgagcccacggacaagaccttggaactcttggcagctgcctgcagcgatttaccacaatg
ccattcttattctgcaatgtcaatgatgtatgtaattttgcatctcgaaatgattattca
tactggctgtcaacaccagctctgatgccaatgaacatggctcccattactggcagagcc
cttgagccttatataagcagatgcactgtttgtgaaggtcctgcgatcgccatagccgtt
cacagccaaaccactgacattcctccatgtcctcacggctggatttctctctggaaagga
ttttcattcatcatgttcacaagtgcaggttctgagggcaccgggcaagcactggcctcc
cctggctcctgcctggaagaattccgagccagcccatttctagaatgtcatggaagagga
acgtgcaactactattcaaattcctacagtttctggctggcttcattaaacccagaaaga
atgttcagaaagcctattccatcaactgtgaaagctggggaattagaaaaaataataagt
cgctgtcaggtgtgcatgaagaaaagacactga

KEGG   Homo sapiens (human): 1286
Entry
1286              CDS       T01001                                 
Symbol
COL4A4, ATS2, BFH, BFH1, CA44
Name
(RefSeq) collagen type IV alpha 4 chain
  KO
K06237  collagen type IV alpha
Organism
hsa  Homo sapiens (human)
Pathway
hsa04151  PI3K-Akt signaling pathway
hsa04510  Focal adhesion
hsa04512  ECM-receptor interaction
hsa04820  Cytoskeleton in muscle cells
hsa04926  Relaxin signaling pathway
hsa04933  AGE-RAGE signaling pathway in diabetic complications
hsa04974  Protein digestion and absorption
hsa05146  Amoebiasis
hsa05165  Human papillomavirus infection
hsa05200  Pathways in cancer
hsa05222  Small cell lung cancer
Network
nt06539  Cytoskeleton in muscle cells
  Element
N01814  Extracellular matrix - Basal lamina
Disease
H00581  Alport syndrome
H00582  Benign familial hematuria
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    1286 (COL4A4)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    1286 (COL4A4)
 09140 Cellular Processes
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    1286 (COL4A4)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    1286 (COL4A4)
  09154 Digestive system
   04974 Protein digestion and absorption
    1286 (COL4A4)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    1286 (COL4A4)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    1286 (COL4A4)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    1286 (COL4A4)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    1286 (COL4A4)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    1286 (COL4A4)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:hsa04147]
    1286 (COL4A4)
   00536 Glycosaminoglycan binding proteins [BR:hsa00536]
    1286 (COL4A4)
Exosome [BR:hsa04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   1286 (COL4A4)
Glycosaminoglycan binding proteins [BR:hsa00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   1286 (COL4A4)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 1286
NCBI-ProteinID: NP_000083
OMIM: 120131
HGNC: 2206
Ensembl: ENSG00000081052
Pharos: P53420(Tbio)
UniProt: P53420
Structure
LinkDB
Position
2:complement(226967360..227164488)
AA seq 1690 aa
MWSLHIVLMRCSFRLTKSLATGPWSLILILFSVQYVYGSGKKYIGPCGGRDCSVCHCVPE
KGSRGPPGPPGPQGPIGPLGAPGPIGLSGEKGMRGDRGPPGAAGDKGDKGPTGVPGFPGL
DGIPGHPGPPGPRGKPGMSGHNGSRGDPGFPGGRGALGPGGPLGHPGEKGEKGNSVFILG
AVKGIQGDRGDPGLPGLPGSWGAGGPAGPTGYPGEPGLVGPPGQPGRPGLKGNPGVGVKG
QMGDPGEVGQQGSPGPTLLVEPPDFCLYKGEKGIKGIPGMVGLPGPPGRKGESGIGAKGE
KGIPGFPGPRGDPGSYGSPGFPGLKGELGLVGDPGLFGLIGPKGDPGNRGHPGPPGVLVT
PPLPLKGPPGDPGFPGRYGETGDVGPPGPPGLLGRPGEACAGMIGPPGPQGFPGLPGLPG
EAGIPGRPDSAPGKPGKPGSPGLPGAPGLQGLPGSSVIYCSVGNPGPQGIKGKVGPPGGR
GPKGEKGNEGLCACEPGPMGPPGPPGLPGRQGSKGDLGLPGWLGTKGDPGPPGAEGPPGL
PGKHGASGPPGNKGAKGDMVVSRVKGHKGERGPDGPPGFPGQPGSHGRDGHAGEKGDPGP
PGDHEDATPGGKGFPGPLGPPGKAGPVGPPGLGFPGPPGERGHPGVPGHPGVRGPDGLKG
QKGDTISCNVTYPGRHGPPGFDGPPGPKGFPGPQGAPGLSGSDGHKGRPGTPGTAEIPGP
PGFRGDMGDPGFGGEKGSSPVGPPGPPGSPGVNGQKGIPGDPAFGHLGPPGKRGLSGVPG
IKGPRGDPGCPGAEGPAGIPGFLGLKGPKGREGHAGFPGVPGPPGHSCERGAPGIPGQPG
LPGYPGSPGAPGGKGQPGDVGPPGPAGMKGLPGLPGRPGAHGPPGLPGIPGPFGDDGLPG
PPGPKGPRGLPGFPGFPGERGKPGAEGCPGAKGEPGEKGMSGLPGDRGLRGAKGAIGPPG
DEGEMAIISQKGTPGEPGPPGDDGFPGERGDKGTPGMQGRRGEPGRYGPPGFHRGEPGEK
GQPGPPGPPGPPGSTGLRGFIGFPGLPGDQGEPGSPGPPGFSGIDGARGPKGNKGDPASH
FGPPGPKGEPGSPGCPGHFGASGEQGLPGIQGPRGSPGRPGPPGSSGPPGCPGDHGMPGL
RGQPGEMGDPGPRGLQGDPGIPGPPGIKGPSGSPGLNGLHGLKGQKGTKGASGLHDVGPP
GPVGIPGLKGERGDPGSPGISPPGPRGKKGPPGPPGSSGPPGPAGATGRAPKDIPDPGPP
GDQGPPGPDGPRGAPGPPGLPGSVDLLRGEPGDCGLPGPPGPPGPPGPPGYKGFPGCDGK
DGQKGPVGFPGPQGPHGFPGPPGEKGLPGPPGRKGPTGLPGPRGEPGPPADVDDCPRIPG
LPGAPGMRGPEGAMGLPGMRGPSGPGCKGEPGLDGRRGVDGVPGSPGPPGRKGDTGEDGY
PGGPGPPGPIGDPGPKGFGPGYLGGFLLVLHSQTDQEPTCPLGMPRLWTGYSLLYLEGQE
KAHNQDLGLAGSCLPVFSTLPFAYCNIHQVCHYAQRNDRSYWLASAAPLPMMPLSEEAIR
PYVSRCAVCEAPAQAVAVHSQDQSIPPCPQTWRSLWIGYSFLMHTGAGDQGGGQALMSPG
SCLEDFRAAPFLECQGRQGTCHFFANKYSFWLTTVKADLQFSSAPAPDTLKESQAQRQKI
SRCQVCVKYS
NT seq 5073 nt   +upstreamnt  +downstreamnt
atgtggtctctgcacatagtactaatgaggtgctccttcagattgaccaagtccttggcc
acaggtccctggtcacttatactcattctcttttctgtacaatatgtatatgggagtgga
aagaaatacattggtccttgtggaggaagagattgctctgtttgccactgtgttcctgaa
aaggggtctcggggtccaccaggaccaccagggccacagggtccaattggacccctggga
gccccaggacccattgggctttcaggagagaaaggaatgagaggggaccgcggccctcct
ggagcagcaggggacaaaggagataagggtccaactggtgttcctggatttccaggttta
gatggcatacctgggcacccagggcctcctggacccagaggcaaacctggtatgagtggc
cacaatggctcaagaggtgacccagggtttccaggaggaagaggagctcttggcccagga
ggccccctaggccatcctggggaaaagggagaaaaaggaaattcagtgttcattttaggt
gccgttaaaggtattcagggagacagaggggacccaggactgcctggcttaccaggatct
tggggtgcaggaggaccggcaggtcccacaggatatcctggagagccagggttagtggga
cctccgggccaaccagggcgtccaggtttgaagggaaatcccggtgtgggagtaaagggg
caaatgggagacccgggtgaggttggtcagcaaggttctcctggacccaccctgttggta
gagccacctgacttttgtctctataaaggagaaaagggtataaaaggaattcctggaatg
gttggactgccaggaccaccaggacgcaagggagaatctggtattggggcaaaaggagaa
aaaggtattcctggatttccagggcctcggggggatcctggttcctatggatctccaggt
tttccaggattaaagggagaactaggactggttggagatcctgggctatttggattaatt
ggcccaaagggggatcctggaaatcgagggcacccaggaccaccaggtgttttggtgact
ccacctcttccactcaaaggcccaccaggggacccagggttccctggccgctatggagaa
acaggggatgttggaccacctggtcccccaggtctcttgggcagaccaggggaagcctgt
gcaggcatgataggaccccctgggccacaaggatttcctggtcttcctgggcttccagga
gaagctggtattcctgggagacctgattctgctccaggaaaaccagggaagccaggatca
cctggcttgcctggagcaccaggcctgcagggcctcccaggatcaagtgtgatatactgt
agtgttgggaaccccggaccacaaggaataaaaggcaaagttggtcccccaggaggaaga
ggcccaaaaggagaaaaaggaaatgaaggactctgtgcctgtgagcctggacccatgggc
ccccctggccctccaggacttcctgggaggcaggggagtaagggagacttggggctccct
ggctggcttggaacaaaaggtgacccaggacctcctggtgctgaaggacctccagggcta
ccaggaaagcatggtgcctctggaccacctggcaacaaaggggcgaagggtgacatggtt
gtatcaagagttaaagggcacaaaggagaaagaggtcctgatgggcccccaggatttcca
gggcagccaggatcacatggtcgggatggacatgctggagaaaaaggggatccaggacct
ccaggggatcatgaagatgcgaccccaggtggtaaaggatttcctggacctctgggcccc
ccaggcaaagcaggacctgtggggcccccaggactgggatttcctggtccaccaggagag
cgaggccacccaggagttccaggccacccaggtgtgaggggccctgatggcttgaagggt
cagaaaggtgacacaatttcttgcaacgtaacctaccctgggaggcatggccctccaggt
tttgatggacctccaggtccgaagggatttccaggtccccaaggtgcccctgggctgagt
ggttcagatgggcataaaggcagacctggcacaccaggaacagcggaaataccaggtcca
cctggttttcgtggtgacatgggagatccgggttttggaggtgaaaaggggtcctcccct
gttgggcccccaggccctcccggctcaccaggagtgaatggtcagaaaggaatcccggga
gaccctgcatttggtcacctgggacccccgggaaagaggggtctttcaggagtgccaggg
ataaaaggacccagaggtgatccgggatgtccaggggctgaagggccagctggcattcct
ggattcctaggtctcaaaggtcccaaaggcagagagggacatgctgggtttccaggtgtc
ccaggtccacctggccattcctgtgaaagaggtgctccagggataccagggcaaccggga
ctccctgggtatccaggtagcccaggtgctccaggtgggaaaggacagccgggagatgtg
gggcctcccgggccagctggaatgaaaggcctccccggactcccaggacggcctggggca
catggtcccccaggcctcccaggaatcccaggtccctttggagatgatgggctacctggt
cctccaggtccaaagggaccccgggggctgcctggtttcccaggttttcccggagaaaga
ggaaagcctggtgcagagggatgtcctggcgcaaagggagaacctggagagaagggcatg
tctggccttcctggagaccggggactgagaggggccaaaggagccataggacctcccgga
gatgaaggagaaatggctatcatttcacaaaagggaacacctggggaacctggacctcct
ggagatgatggattcccaggagaaagaggtgataaaggaactcccgggatgcaagggaga
agaggagagccgggaagatacggaccacctggatttcacagaggggaacctggtgagaaa
ggtcagccagggcctcctggacccccaggccctccaggctcaactggtctaagagggttc
attggttttccaggacttccaggtgaccagggtgagccaggttctccaggtccccctgga
ttttcaggaattgatggagcaagaggacctaaaggaaacaaaggtgaccctgccagtcac
tttggtccacctggtccaaagggtgagccaggtagccctggatgtccagggcattttgga
gcatccggagagcagggcttgcctggtattcaagggcccagaggatcacctggaaggcca
gggccacctggctcctctggaccaccagggtgcccaggtgatcacgggatgcctgggctg
aggggacagccaggagaaatgggagaccctgggccaagaggcctccagggggatccaggg
ataccaggtcctccgggaataaaaggtccctccggatcacctggcctgaacggcttgcat
ggattgaaaggtcagaaaggaactaaaggtgcttcaggtttgcatgatgtggggccacct
ggtccagtgggaatacctgggctaaaaggggagagaggagaccctgggagcccaggaatc
tctcctccaggtcctcgtggaaagaaaggtcccccaggacccccagggagttcaggacca
cctggtcctgcaggtgccacaggaagagctcctaaggacattcctgacccgggtccacct
ggagatcagggacctcctggtcctgatggcccaagaggagcacctgggcctccaggcctc
cctgggagtgttgaccttctgagaggggagccaggtgactgtggtctaccagggccacca
ggtccccctggcccaccaggccctccaggatacaaaggctttccaggatgtgatggaaaa
gatggccagaaaggaccagtgggattcccgggaccgcagggaccacatggatttcctggg
ccacctggagagaagggtttacctggacctccagggagaaaagggcccactggtcttccg
ggtcccagaggtgaaccggggccacctgcagatgtggatgactgtccccgaatcccaggc
cttcctggggcgccaggcatgagaggaccagaaggagccatggggctccctggaatgaga
ggcccctcaggaccagggtgcaaaggagagcctgggctggatggcaggaggggtgtggat
ggcgtccctgggtctcctgggcctcccggacgtaaaggtgacacaggagaagacggctac
cctggaggaccagggcctcctggtcccattggggatcctgggcccaaagggtttggccct
ggatacctcggtggcttcctcctggttctccacagtcagacggaccaggagcccacctgc
cccctgggcatgcccaggctctggactgggtatagtctgttatacctggaagggcaagag
aaagctcacaatcaagaccttggtctggcagggtcttgccttcccgtatttagcacgctg
ccctttgcctactgcaacatccaccaggtgtgccactatgcccagagaaacgacagatcc
tactggctggccagcgctgcgcccctccccatgatgccactctctgaagaggcgatccgc
ccctatgtcagccgctgtgcggtatgcgaggccccggcccaggcggtggcggtgcacagc
caggaccagtccatccccccatgtccgcagacctggaggagcctctggatcgggtattca
ttcctgatgcacacaggagctggggaccaaggaggagggcaggcccttatgtcacctggc
agctgcctggaagatttcagagcagcaccattccttgaatgccagggccggcagggaact
tgccactttttcgcaaataagtatagcttctggctcacaacggtgaaagcagacttgcag
ttttcctctgctccagcaccagacaccttaaaagaaagccaggcccaacgccagaaaatc
agccggtgccaggtctgcgtgaagtatagctag

KEGG   Homo sapiens (human): 1287
Entry
1287              CDS       T01001                                 
Symbol
COL4A5, ASLN, ATS, ATS1, CA54
Name
(RefSeq) collagen type IV alpha 5 chain
  KO
K06237  collagen type IV alpha
Organism
hsa  Homo sapiens (human)
Pathway
hsa04151  PI3K-Akt signaling pathway
hsa04510  Focal adhesion
hsa04512  ECM-receptor interaction
hsa04820  Cytoskeleton in muscle cells
hsa04926  Relaxin signaling pathway
hsa04933  AGE-RAGE signaling pathway in diabetic complications
hsa04974  Protein digestion and absorption
hsa05146  Amoebiasis
hsa05165  Human papillomavirus infection
hsa05200  Pathways in cancer
hsa05222  Small cell lung cancer
Network
nt06539  Cytoskeleton in muscle cells
  Element
N01814  Extracellular matrix - Basal lamina
Disease
H00581  Alport syndrome
H01640  Uterine leiomyoma
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    1287 (COL4A5)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    1287 (COL4A5)
 09140 Cellular Processes
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    1287 (COL4A5)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    1287 (COL4A5)
  09154 Digestive system
   04974 Protein digestion and absorption
    1287 (COL4A5)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    1287 (COL4A5)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    1287 (COL4A5)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    1287 (COL4A5)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    1287 (COL4A5)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    1287 (COL4A5)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:hsa04147]
    1287 (COL4A5)
   00536 Glycosaminoglycan binding proteins [BR:hsa00536]
    1287 (COL4A5)
Exosome [BR:hsa04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   1287 (COL4A5)
Glycosaminoglycan binding proteins [BR:hsa00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   1287 (COL4A5)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 1287
NCBI-ProteinID: NP_000486
OMIM: 303630
HGNC: 2207
Ensembl: ENSG00000188153
Pharos: P29400(Tbio)
UniProt: P29400 Q49AM6 A7MBN3
Structure
LinkDB
Position
X:108439838..108697545
AA seq 1685 aa
MKLRGVSLAAGLFLLALSLWGQPAEAAACYGCSPGSKCDCSGIKGEKGERGFPGLEGHPG
LPGFPGPEGPPGPRGQKGDDGIPGPPGPKGIRGPPGLPGFPGTPGLPGMPGHDGAPGPQG
IPGCNGTKGERGFPGSPGFPGLQGPPGPPGIPGMKGEPGSIIMSSLPGPKGNPGYPGPPG
IQGLPGPTGIPGPIGPPGPPGLMGPPGPPGLPGPKGNMGLNFQGPKGEKGEQGLQGPPGP
PGQISEQKRPIDVEFQKGDQGLPGDRGPPGPPGIRGPPGPPGGEKGEKGEQGEPGKRGKP
GKDGENGQPGIPGLPGDPGYPGEPGRDGEKGQKGDTGPPGPPGLVIPRPGTGITIGEKGN
IGLPGLPGEKGERGFPGIQGPPGLPGPPGAAVMGPPGPPGFPGERGQKGDEGPPGISIPG
PPGLDGQPGAPGLPGPPGPAGPHIPPSDEICEPGPPGPPGSPGDKGLQGEQGVKGDKGDT
CFNCIGTGISGPPGQPGLPGLPGPPGSLGFPGQKGEKGQAGATGPKGLPGIPGAPGAPGF
PGSKGEPGDILTFPGMKGDKGELGSPGAPGLPGLPGTPGQDGLPGLPGPKGEPGGITFKG
ERGPPGNPGLPGLPGNIGPMGPPGFGPPGPVGEKGIQGVAGNPGQPGIPGPKGDPGQTIT
QPGKPGLPGNPGRDGDVGLPGDPGLPGQPGLPGIPGSKGEPGIPGIGLPGPPGPKGFPGI
PGPPGAPGTPGRIGLEGPPGPPGFPGPKGEPGFALPGPPGPPGLPGFKGALGPKGDRGFP
GPPGPPGRTGLDGLPGPKGDVGPNGQPGPMGPPGLPGIGVQGPPGPPGIPGPIGQPGLHG
IPGEKGDPGPPGLDVPGPPGERGSPGIPGAPGPIGPPGSPGLPGKAGASGFPGTKGEMGM
MGPPGPPGPLGIPGRSGVPGLKGDDGLQGQPGLPGPTGEKGSKGEPGLPGPPGPMDPNLL
GSKGEKGEPGLPGIPGVSGPKGYQGLPGDPGQPGLSGQPGLPGPPGPKGNPGLPGQPGLI
GPPGLKGTIGDMGFPGPQGVEGPPGPSGVPGQPGSPGLPGQKGDKGDPGISSIGLPGLPG
PKGEPGLPGYPGNPGIKGSVGDPGLPGLPGTPGAKGQPGLPGFPGTPGPPGPKGISGPPG
NPGLPGEPGPVGGGGHPGQPGPPGEKGKPGQDGIPGPAGQKGEPGQPGFGNPGPPGLPGL
SGQKGDGGLPGIPGNPGLPGPKGEPGFHGFPGVQGPPGPPGSPGPALEGPKGNPGPQGPP
GRPGLPGPEGPPGLPGNGGIKGEKGNPGQPGLPGLPGLKGDQGPPGLQGNPGRPGLNGMK
GDPGLPGVPGFPGMKGPSGVPGSAGPEGEPGLIGPPGPPGLPGPSGQSIIIKGDAGPPGI
PGQPGLKGLPGPQGPQGLPGPTGPPGDPGRNGLPGFDGAGGRKGDPGLPGQPGTRGLDGP
PGPDGLQGPPGPPGTSSVAHGFLITRHSQTTDAPQCPQGTLQVYEGFSLLYVQGNKRAHG
QDLGTAGSCLRRFSTMPFMFCNINNVCNFASRNDYSYWLSTPEPMPMSMQPLKGQSIQPF
ISRCAVCEAPAVVIAVHSQTIQIPHCPQGWDSLWIGYSFMMHTSAGAEGSGQALASPGSC
LEEFRSAPFIECHGRGTCNYYANSYSFWLATVDVSDMFSKPQSETLKAGDLRTRISRCQV
CMKRT
NT seq 5058 nt   +upstreamnt  +downstreamnt
atgaaactgcgtggagtcagcctggctgccggcttgttcttactggccctgagtctttgg
gggcagcctgcagaggctgcggcttgctatgggtgttctccaggatcaaagtgtgactgc
agtggcataaaaggggaaaagggagagagagggtttccaggtttggaaggacacccagga
ttgcctggatttccaggtccagaagggcctccggggcctcggggacaaaagggtgatgat
ggaattccagggccaccaggaccaaaaggaatcagaggtcctcctggacttcctggattt
ccagggacaccaggtcttcctggaatgccaggccacgatggggccccaggacctcaaggt
attcccggatgcaatggaaccaagggagaacgtggatttccaggcagtcccggttttcct
ggtttacagggtcctccaggaccccctgggatcccaggtatgaagggtgaaccaggtagt
ataattatgtcatcactgccaggaccaaagggtaatccaggatatccaggtcctcctgga
atacaaggcctacctggtcccactggtataccagggccaattggtcccccaggaccacca
ggtttgatgggccctcctggtccaccaggacttccaggacctaaggggaatatgggctta
aatttccagggacccaaaggtgaaaaaggtgagcaaggtcttcagggcccacctgggcca
cctgggcagatcagtgaacagaaaagaccaattgatgtagagtttcagaaaggagatcag
ggacttcctggtgaccgagggcctcctggacctccagggatacgtggtcctccaggtccc
ccaggtggtgagaaaggtgagaagggtgagcaaggagagccaggcaaaagaggtaaacca
ggcaaagatggagaaaatggccaaccaggaattcctggtttgcctggtgatcctggttac
cctggtgaacccggaagggatggtgaaaagggccaaaaaggtgacactggcccacctgga
cctcctggacttgtaattcctagacctgggactggtataactataggagaaaaaggaaac
attgggttgcctgggttgcctggagaaaaaggagagcgaggatttcctggaatacagggt
ccacctggccttcctggacctccaggggctgcagttatgggtcctcctggccctcctgga
tttcctggagaaaggggtcagaaaggtgatgaaggaccacctggaatttccattcctgga
cctcctggacttgacggacagcctggggctcctgggcttccagggcctcctggccctgct
ggccctcacattcctcctagtgatgagatatgtgaaccaggccctccaggccccccagga
tctccaggtgataaaggactccaaggagaacaaggagtgaaaggtgacaaaggtgacact
tgcttcaactgcattggaactggtatttcagggcctccaggtcaacctggtttgccaggt
ctcccaggtcctccaggatctcttggtttccctggacagaaaggggaaaaaggacaagct
ggtgcaactggtcccaaaggattaccaggcattccaggagctccaggtgctccaggcttt
cctggatctaaaggtgaacctggtgatatcctcacttttccaggaatgaagggtgacaaa
ggagagttgggttcccctggagctccagggcttcctggtttacctggcactcctggacag
gatggattgccagggcttcctggcccgaaaggagagcctggtggaattacttttaagggt
gaaagaggtccccctgggaacccaggtttaccaggcctcccagggaatatagggcctatg
ggtccccctggtttcggccctccaggcccagtaggtgaaaaaggcatacaaggtgtggca
ggaaatccaggccagccaggaataccaggtcctaaaggggatccaggtcagactataacc
cagccggggaagcctggcttgcctggtaacccaggcagagatggtgatgtaggtcttcca
ggtgaccctggacttccagggcaaccaggcttgccagggatacctggtagcaaaggagaa
ccaggtatccctggaattgggcttcctggaccacctggtcccaaaggctttcctggaatt
ccaggacctccaggagcacctgggacacctggaagaattggtctagaaggccctcctggg
ccacccggctttccaggaccaaagggtgaaccaggatttgcattacctgggccacctggg
ccaccaggacttccaggtttcaaaggagcacttggtccaaaaggtgatcgtggtttccca
ggacctccgggtcctccaggacgcactggcttagatgggctccctggaccaaaaggtgat
gttggaccaaatggacaacctggaccaatgggacctcctgggctgccaggaataggtgtt
cagggaccaccaggaccaccagggattcctgggccaataggtcaacctggtttacatgga
ataccaggagagaagggggatccaggacctcctggacttgatgttccaggacccccaggt
gaaagaggcagtccagggatccccggagcacctggtcctataggacctccaggatcacca
gggcttccaggaaaagcaggtgcctctggatttccaggtaccaaaggtgaaatgggtatg
atgggacctccaggcccaccaggacctttgggaattcctggcaggagtggtgtacctggt
cttaaaggtgatgatggcttgcagggtcagccaggacttcctggccctacaggagaaaaa
ggtagtaaaggagagcctggccttccaggccctcctggaccaatggatccaaatcttctg
ggctcaaaaggagagaagggggaacctggcttaccaggtatacctggagtttcagggcca
aaaggttatcagggtttgcctggagacccagggcaacctggactgagtggacaacctgga
ttaccaggaccaccaggtcccaaaggtaaccctggtctccctggacagccaggtcttata
ggacctcctggacttaaaggaaccatcggtgatatgggttttccagggcctcagggtgtg
gaagggcctcctggaccttctggagttcctggacaacctggctccccaggattacctgga
cagaaaggcgacaaaggtgatcctggtatttcaagcattggtcttccaggtcttcctggt
ccaaagggtgagcctggtctgcctggatacccagggaaccctggtatcaaaggttctgtg
ggagatcctggtttgcccggattaccaggaacccctggagcaaaaggacaaccaggcctt
cctggattcccaggaaccccaggccctcctggaccaaaaggtattagtggccctcctggg
aaccccggccttccaggagaacctggtcctgtaggtggtggaggtcatcctgggcaacca
gggcctccaggcgaaaaaggcaaacccggtcaagatggtattcctggaccagctggacag
aagggtgaaccaggtcaaccaggctttggaaacccaggaccccctggacttccaggactt
tctggccaaaagggtgatggaggattacctgggattccaggaaatcctggccttccaggt
ccaaagggcgaaccaggctttcacggtttccctggtgtgcagggtcccccaggccctcct
ggttctccgggtccagctctggaaggacctaaaggcaaccctgggccccaaggtcctcct
gggagaccaggtctaccaggtccagaaggtcctccaggtctccctggaaatggaggtatt
aaaggagagaagggaaatccaggccaacctgggctacctggcttgcctggtttgaaagga
gatcaaggaccaccaggactccagggtaatcctggccggccgggtctcaatggaatgaaa
ggagatcctggtctccctggtgttccaggattcccaggcatgaaaggacccagtggagta
cctggatcagctggccctgagggggaaccgggacttattggtcctccaggtcctcctgga
ttacctggtccttcaggacagagtatcataattaaaggagatgctggtcctccaggaatc
cctggccagcctgggctaaagggtctaccaggaccccaaggacctcaaggcttaccaggt
ccaactggccctccaggagatcctggacgcaatggactccctggctttgatggtgcagga
gggcgcaaaggagacccaggtctgccaggacagccaggtacccgtggtttggatggtccc
cctggtccagatggattgcaaggtcccccaggtccccctggaacctcctctgttgcacat
ggatttcttattacacgccacagccagacaacggatgcaccacaatgcccacagggaaca
cttcaggtctatgaaggcttttctctcctgtatgtacaaggaaataaaagagcccacggt
caagacttggggacggctggcagctgccttcgtcgctttagtaccatgcctttcatgttc
tgcaacatcaataatgtttgcaactttgcttcaagaaatgactattcttactggctctct
accccagagcccatgccaatgagcatgcaacccctaaagggccagagcatccagccattc
attagtcgatgtgcagtatgtgaagctccagctgtggtgatcgcagttcacagtcagacg
atccagattccccattgtcctcagggatgggattctctgtggattggttattccttcatg
atgcatacaagtgcaggggcagaaggctcaggtcaagccctagcctcccctggttcctgc
ttggaagagtttcgttcagctcccttcatcgaatgtcatgggaggggtacctgtaactac
tatgccaactcctacagcttttggctggcaactgtagatgtgtcagacatgttcagtaaa
cctcagtcagaaacgctgaaagcaggagacttgaggacacgaattagccgatgtcaagtg
tgcatgaagaggacataa

KEGG   Homo sapiens (human): 1288
Entry
1288              CDS       T01001                                 
Symbol
COL4A6, CXDELq22.3, DELXq22.3, DFNX6
Name
(RefSeq) collagen type IV alpha 6 chain
  KO
K06237  collagen type IV alpha
Organism
hsa  Homo sapiens (human)
Pathway
hsa04151  PI3K-Akt signaling pathway
hsa04510  Focal adhesion
hsa04512  ECM-receptor interaction
hsa04820  Cytoskeleton in muscle cells
hsa04926  Relaxin signaling pathway
hsa04933  AGE-RAGE signaling pathway in diabetic complications
hsa04974  Protein digestion and absorption
hsa05146  Amoebiasis
hsa05165  Human papillomavirus infection
hsa05200  Pathways in cancer
hsa05222  Small cell lung cancer
Network
nt06539  Cytoskeleton in muscle cells
  Element
N01814  Extracellular matrix - Basal lamina
Disease
H01209  Deafness, X-linked
H01640  Uterine leiomyoma
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    1288 (COL4A6)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    1288 (COL4A6)
 09140 Cellular Processes
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    1288 (COL4A6)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    1288 (COL4A6)
  09154 Digestive system
   04974 Protein digestion and absorption
    1288 (COL4A6)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    1288 (COL4A6)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    1288 (COL4A6)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    1288 (COL4A6)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    1288 (COL4A6)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    1288 (COL4A6)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:hsa04147]
    1288 (COL4A6)
   00536 Glycosaminoglycan binding proteins [BR:hsa00536]
    1288 (COL4A6)
Exosome [BR:hsa04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   1288 (COL4A6)
Glycosaminoglycan binding proteins [BR:hsa00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   1288 (COL4A6)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 1288
NCBI-ProteinID: NP_001838
OMIM: 303631
HGNC: 2208
Ensembl: ENSG00000197565
Pharos: Q14031(Tbio)
UniProt: Q14031
LinkDB
Position
X:complement(108155614..108439458)
AA seq 1691 aa
MLINKLWLLLVTLCLTEELAAAGEKSYGKPCGGQDCSGSCQCFPEKGARGRPGPIGIQGP
TGPQGFTGSTGLSGLKGERGFPGLLGPYGPKGDKGPMGVPGFLGINGIPGHPGQPGPRGP
PGLDGCNGTQGAVGFPGPDGYPGLLGPPGLPGQKGSKGDPVLAPGSFKGMKGDPGLPGLD
GITGPQGAPGFPGAVGPAGPPGLQGPPGPPGPLGPDGNMGLGFQGEKGVKGDVGLPGPAG
PPPSTGELEFMGFPKGKKGSKGEPGPKGFPGISGPPGFPGLGTTGEKGEKGEKGIPGLPG
PRGPMGSEGVQGPPGQQGKKGTLGFPGLNGFQGIEGQKGDIGLPGPDVFIDIDGAVISGN
PGDPGVPGLPGLKGDEGIQGLRGPSGVPGLPALSGVPGALGPQGFPGLKGDQGNPGRTTI
GAAGLPGRDGLPGPPGPPGPPSPEFETETLHNKESGFPGLRGEQGPKGNLGLKGIKGDSG
FCACDGGVPNTGPPGEPGPPGPWGLIGLPGLKGARGDRGSGGAQGPAGAPGLVGPLGPSG
PKGKKGEPILSTIQGMPGDRGDSGSQGFRGVIGEPGKDGVPGLPGLPGLPGDGGQGFPGE
KGLPGLPGEKGHPGPPGLPGNGLPGLPGPRGLPGDKGKDGLPGQQGLPGSKGITLPCIIP
GSYGPSGFPGTPGFPGPKGSRGLPGTPGQPGSSGSKGEPGSPGLVHLPELPGFPGPRGEK
GLPGFPGLPGKDGLPGMIGSPGLPGSKGATGDIFGAENGAPGEQGLQGLTGHKGFLGDSG
LPGLKGVHGKPGLLGPKGERGSPGTPGQVGQPGTPGSSGPYGIKGKSGLPGAPGFPGISG
HPGKKGTRGKKGPPGSIVKKGLPGLKGLPGNPGLVGLKGSPGSPGVAGLPALSGPKGEKG
SVGFVGFPGIPGLPGIPGTRGLKGIPGSTGKMGPSGRAGTPGEKGDRGNPGPVGIPSPRR
PMSNLWLKGDKGSQGSAGSNGFPGPRGDKGEAGRPGPPGLPGAPGLPGIIKGVSGKPGPP
GFMGIRGLPGLKGSSGITGFPGMPGESGSQGIRGSPGLPGASGLPGLKGDNGQTVEISGS
PGPKGQPGESGFKGTKGRDGLIGNIGFPGNKGEDGKVGVSGDVGLPGAPGFPGVAGMRGE
PGLPGSSGHQGAIGPLGSPGLIGPKGFPGFPGLHGLNGLPGTKGTHGTPGPSITGVPGPA
GLPGPKGEKGYPGIGIGAPGKPGLRGQKGDRGFPGLQGPAGLPGAPGISLPSLIAGQPGD
PGRPGLDGERGRPGPAGPPGPPGPSSNQGDTGDPGFPGIPGPKGPKGDQGIPGFSGLPGE
LGLKGMRGEPGFMGTPGKVGPPGDPGFPGMKGKAGPRGSSGLQGDPGQTPTAEAVQVPPG
PLGLPGIDGIPGLTGDPGAQGPVGLQGSKGLPGIPGKDGPSGLPGPPGALGDPGLPGLQG
PPGFEGAPGQQGPFGMPGMPGQSMRVGYTLVKHSQSEQVPPCPIGMSQLWVGYSLLFVEG
QEKAHNQDLGFAGSCLPRFSTMPFIYCNINEVCHYARRNDKSYWLSTTAPIPMMPVSQTQ
IPQYISRCSVCEAPSQAIAVHSQDITIPQCPLGWRSLWIGYSFLMHTAAGAEGGGQSLVS
PGSCLEDFRATPFIECSGARGTCHYFANKYSFWLTTVEERQQFGELPVSETLKAGQLHTR
VSRCQVCMKSL
NT seq 5076 nt   +upstreamnt  +downstreamnt
atgcttataaacaagttgtggctgctcctggttacgttgtgcctgaccgaggaactggca
gcagcgggagagaagtcttatggaaagccatgtgggggccaggactgcagtgggagctgt
cagtgttttcctgagaaaggagcgagaggacgacctggaccaattggaattcaaggccca
acaggtcctcaaggattcactggctctactggtttatcgggattgaaaggagaaaggggt
ttcccaggccttctgggaccttatggaccaaaaggagataagggtcccatgggagttcct
ggctttcttggcatcaatgggattccgggccaccctggacaaccaggccccagaggccca
cctggtctggatggctgtaatggaactcaaggagctgttggatttccaggccctgatggc
tatcctgggcttctcggaccacccgggcttcctggtcagaaaggatcaaaaggtgaccct
gtccttgctccaggtagtttcaaaggaatgaagggggatcctgggctgcctggactggat
ggaatcactggcccacaaggagcacccggatttcctggagctgtaggacctgcaggacca
ccaggattacaaggtcctccagggcctcctggtcctcttggtcctgatgggaatatgggg
ctaggttttcaaggagagaaaggagtcaagggggatgttggcctccctggcccagcagga
cctccaccatctactggagagctggaattcatgggattccccaaagggaagaaaggatcc
aagggtgaaccagggcctaagggttttccaggcataagtggccctccaggcttcccgggc
cttggaactactggagaaaagggagaaaagggagaaaagggaatccctggtttgccagga
cctaggggtcccatgggttcagaaggagtccaaggccctccagggcaacagggcaagaaa
gggaccctgggatttcctgggcttaatggattccaaggaattgagggtcaaaagggtgac
attggcctgccaggcccagatgttttcatcgatatagatggtgctgtgatctcaggtaat
cctggagatcctggtgtacctggcctcccaggccttaaaggagatgaaggcatccaaggc
ctacgtggcccttctggtgtccctggattgccagcattatcaggtgtcccaggagcccta
gggcctcagggatttccagggctgaagggggaccaaggaaacccaggccgtaccacaatt
ggagcagctggcctccctggcagagatggtttgccaggcccaccaggtccaccaggccca
cctagtccagaatttgagactgaaactctacacaacaaagagtcagggttccctggtctc
cgaggagaacaaggtccaaaaggaaacctaggcctcaaaggaataaaaggagactcaggt
ttctgtgcttgtgacggtggtgttcccaacactggaccacccggggaaccaggcccacct
ggtccatggggtctcataggccttccaggccttaaaggagccagaggagatcgaggctct
gggggtgcacagggcccagcaggggctccaggcttagttgggcctctgggtccttcagga
cccaaaggaaagaagggggaaccaattctcagtacaatccaaggaatgccaggagatcgg
ggtgattctggctcccagggcttccgtggtgtaataggagaaccaggcaaggacggagta
ccaggtttaccaggtctgccaggccttccgggtgatggtggacagggcttcccaggtgaa
aaggggttacctggacttcctggtgaaaaaggccatcctggtccacctggcctcccagga
aatgggttaccaggacttcctggaccccgtgggcttcctggagataaaggcaaggatgga
ttaccgggacaacaaggccttcccggatctaagggaatcaccctgccctgtattattcct
gggtcatacggtccatcaggatttccaggcactcccggattcccaggccctaaagggtct
cgaggcctccctgggaccccaggccagcctgggtcaagtggaagtaaaggagagccaggg
agtccaggattggttcatcttcctgaattaccaggatttcctggacctcgtggggagaag
ggcttgcctgggtttcctgggctccctggaaaagatggcttgcctgggatgattggcagt
ccaggcttacctggttccaagggagccactggtgacatctttggtgctgaaaatggtgct
ccgggggaacaaggcctacaaggattaacagggcacaaaggatttcttggagactctggc
cttccaggactcaagggtgtgcacgggaagcctggcttactaggccccaaaggtgagcgg
ggcagccctgggacaccaggacaggtgggacagccaggcaccccaggatctagtggtcca
tatggcatcaagggcaaatctgggctcccaggagcaccaggcttcccaggcatctcagga
catcctggaaagaaaggaacaagaggcaagaaaggtcctcctggatcaattgtaaagaaa
gggctgccagggctaaaaggccttcctggaaatccaggcctagtaggactgaaaggaagc
ccaggctctccaggggtcgctgggttgccagccctctctggacccaagggagagaagggg
tctgttggattcgtaggttttccaggaataccaggtctgcctggtattcctggaacaaga
ggattaaaaggaattccaggatcaactggaaaaatgggaccatctggacgtgctggtact
cctggtgaaaagggagacagaggcaatccggggccagtcggaatacctagtccaagacgt
ccaatgtcaaacctttggctcaaaggagacaaaggctctcaaggctcagccggatccaat
ggatttcctgggccaagaggtgacaaaggagaggctggtcgacctggaccaccaggccta
cctggagctcctggcctcccaggcattatcaaaggagttagtggaaagccagggccccct
ggcttcatgggaatccggggcttacctggcctgaaggggtcctctgggatcacaggtttc
ccaggaatgccaggagaaagtggttcacaaggtatcagagggtcgcctggactcccagga
gcatctggtctcccaggcctgaaaggagacaacggccagacagttgaaatttccggtagc
ccaggacccaagggacagcctggcgaatctggttttaaaggcacaaaaggaagagatgga
ctaataggcaatataggcttccctggaaacaaaggtgaagatggaaaagttggtgtttct
ggagatgttggccttcctggagctccaggatttccaggagttgccggcatgagaggagaa
ccaggacttccaggttcttctggtcaccaaggggcaattgggcctctaggatcccccgga
ttaataggacccaaaggcttccctggatttcctggtttacatggactgaatgggcttccg
ggcaccaagggtacccatggcactccaggacctagtatcaccggtgtgcctgggcctgct
ggtctccctggacccaaaggagaaaaaggatatccaggaattggcatcggagctccaggg
aagccgggcctgagagggcaaaaaggtgatcgaggtttcccaggtctccagggccctgct
ggtctccccggtgccccaggcatctccttgccctcactcatagcaggacagcctggtgac
cccgggcgaccaggcctagatggagaacgaggccgcccaggccccgctggacccccaggt
ccccctgggccatcctcgaatcaaggcgacaccggagaccctggcttccctggaattcct
ggacctaaagggcctaagggagaccaaggaattccaggtttttctggcctccctggagag
ctaggactgaaaggcatgagaggtgagcctggcttcatggggactccaggcaaggttggg
ccacctggagacccaggatttcccggaatgaaggggaaggcagggccaagaggctcttct
ggcctccaaggtgatcctggacaaacaccaactgcagaagctgtccaggttcctcctgga
cccttgggtctaccagggatcgatggcatccctggcctcactggggaccctggggctcaa
ggccctgtaggcctacaaggctccaaaggtttacctggcatccccggtaaagatggcccc
agtgggctcccaggcccacctggggctcttggtgatcctggtctgcctggactgcaaggc
cctccaggatttgaaggagctccagggcagcaaggccccttcgggatgcctggaatgcct
ggccagagcatgagagtgggctacacgttggtaaagcacagccagtcggaacaggtgccc
ccgtgtcccatcgggatgagccagctgtgggtggggtacagcttactgtttgtggagggg
caagagaaagcccacaaccaggacctgggctttgctggctcctgtctgccccgcttcagc
accatgcccttcatctactgcaacatcaacgaggtgtgccactatgccaggcgcaatgat
aaatcttactggctctccactaccgcccctatccccatgatgcccgtcagccagacccag
attccccagtacatcagccgctgctctgtgtgtgaggcaccctcgcaagccattgctgtg
cacagccaggacatcaccatcccgcagtgccccctgggctggcgcagcctctggattggg
tactctttcctcatgcacactgccgctggtgccgagggtggaggccagtccctggtctca
cctggctcctgcctagaggactttcgggccactcctttcatcgaatgcagtggtgcccga
ggcacctgccactactttgcaaacaagtacagtttctggttgaccacagtggaggagagg
cagcagtttggggagttgcctgtgtctgaaacgctgaaagctgggcagctccacactcga
gtcagtcgctgccaggtgtgtatgaaaagcctgtag

DBGET integrated database retrieval system