KEGG   Homo sapiens (human): 1277
Entry
1277              CDS       T01001                                 

Gene name
COL1A1, CAFYD, EDSARTH1, EDSC, OI1, OI2, OI3, OI4
Definition
(RefSeq) collagen type I alpha 1 chain
  KO
K06236  collagen type I alpha
Organism
hsa  Homo sapiens (human)
Pathway
hsa04151  PI3K-Akt signaling pathway
hsa04510  Focal adhesion
hsa04512  ECM-receptor interaction
hsa04611  Platelet activation
hsa04926  Relaxin signaling pathway
hsa04933  AGE-RAGE signaling pathway in diabetic complications
hsa04974  Protein digestion and absorption
hsa05146  Amoebiasis
hsa05165  Human papillomavirus infection
hsa05205  Proteoglycans in cancer
Disease
H00506  Osteogenesis imperfecta
H00613  Infantile cortical hyperostosis
H01593  Osteoporosis
H02243  Ehlers-Danlos syndrome arthrochalasia type
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    1277 (COL1A1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    1277 (COL1A1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    1277 (COL1A1)
 09150 Organismal Systems
  09151 Immune system
   04611 Platelet activation
    1277 (COL1A1)
  09152 Endocrine system
   04926 Relaxin signaling pathway
    1277 (COL1A1)
  09154 Digestive system
   04974 Protein digestion and absorption
    1277 (COL1A1)
 09160 Human Diseases
  09161 Cancer: overview
   05205 Proteoglycans in cancer
    1277 (COL1A1)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    1277 (COL1A1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    1277 (COL1A1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    1277 (COL1A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:hsa00536]
    1277 (COL1A1)
Glycosaminoglycan binding proteins [BR:hsa00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   1277 (COL1A1)
SSDB
Motif
Pfam: Collagen COLFI VWC
Other DBs
NCBI-GeneID: 1277
NCBI-ProteinID: NP_000079
OMIM: 120150
HGNC: 2197
Ensembl: ENSG00000108821
Vega: OTTHUMG00000148674
Pharos: P02452(Tbio)
UniProt: P02452
LinkDB
Structure
PDB: 
5K31

Position
17q21.33
AA seq 1464 aa
MFSFVDLRLLLLLAATALLTHGQEEGQVEGQDEDIPPITCVQNGLRYHDRDVWKPEPCRI
CVCDNGKVLCDDVICDETKNCPGAEVPEGECCPVCPDGSESPTDQETTGVEGPKGDTGPR
GPRGPAGPPGRDGIPGQPGLPGPPGPPGPPGPPGLGGNFAPQLSYGYDEKSTGGISVPGP
MGPSGPRGLPGPPGAPGPQGFQGPPGEPGEPGASGPMGPRGPPGPPGKNGDDGEAGKPGR
PGERGPPGPQGARGLPGTAGLPGMKGHRGFSGLDGAKGDAGPAGPKGEPGSPGENGAPGQ
MGPRGLPGERGRPGAPGPAGARGNDGATGAAGPPGPTGPAGPPGFPGAVGAKGEAGPQGP
RGSEGPQGVRGEPGPPGPAGAAGPAGNPGADGQPGAKGANGAPGIAGAPGFPGARGPSGP
QGPGGPPGPKGNSGEPGAPGSKGDTGAKGEPGPVGVQGPPGPAGEEGKRGARGEPGPTGL
PGPPGERGGPGSRGFPGADGVAGPKGPAGERGSPGPAGPKGSPGEAGRPGEAGLPGAKGL
TGSPGSPGPDGKTGPPGPAGQDGRPGPPGPPGARGQAGVMGFPGPKGAAGEPGKAGERGV
PGPPGAVGPAGKDGEAGAQGPPGPAGPAGERGEQGPAGSPGFQGLPGPAGPPGEAGKPGE
QGVPGDLGAPGPSGARGERGFPGERGVQGPPGPAGPRGANGAPGNDGAKGDAGAPGAPGS
QGAPGLQGMPGERGAAGLPGPKGDRGDAGPKGADGSPGKDGVRGLTGPIGPPGPAGAPGD
KGESGPSGPAGPTGARGAPGDRGEPGPPGPAGFAGPPGADGQPGAKGEPGDAGAKGDAGP
PGPAGPAGPPGPIGNVGAPGAKGARGSAGPPGATGFPGAAGRVGPPGPSGNAGPPGPPGP
AGKEGGKGPRGETGPAGRPGEVGPPGPPGPAGEKGSPGADGPAGAPGTPGPQGIAGQRGV
VGLPGQRGERGFPGLPGPSGEPGKQGPSGASGERGPPGPMGPPGLAGPPGESGREGAPGA
EGSPGRDGSPGAKGDRGETGPAGPPGAPGAPGAPGPVGPAGKSGDRGETGPAGPAGPVGP
VGARGPAGPQGPRGDKGETGEQGDRGIKGHRGFSGLQGPPGPPGSPGEQGPSGASGPAGP
RGPPGSAGAPGKDGLNGLPGPIGPPGPRGRTGDAGPVGPPGPPGPPGPPGPPSAGFDFSF
LPQPPQEKAHDGGRYYRADDANVVRDRDLEVDTTLKSLSQQIENIRSPEGSRKNPARTCR
DLKMCHSDWKSGEYWIDPNQGCNLDAIKVFCNMETGETCVYPTQPSVAQKNWYISKNPKD
KRHVWFGESMTDGFQFEYGGQGSDPADVAIQLTFLRLMSTEASQNITYHCKNSVAYMDQQ
TGNLKKALLLQGSNEIEIRAEGNSRFTYSVTVDGCTSHTGAWGKTVIEYKTTKTSRLPII
DVAPLDVGAPDQEFGFDVGPVCFL
NT seq 4395 nt   +upstreamnt  +downstreamnt
atgttcagctttgtggacctccggctcctgctcctcttagcggccaccgccctcctgacg
cacggccaagaggaaggccaagtcgagggccaagacgaagacatcccaccaatcacctgc
gtacagaacggcctcaggtaccatgaccgagacgtgtggaaacccgagccctgccggatc
tgcgtctgcgacaacggcaaggtgttgtgcgatgacgtgatctgtgacgagaccaagaac
tgccccggcgccgaagtccccgagggcgagtgctgtcccgtctgccccgacggctcagag
tcacccaccgaccaagaaaccaccggcgtcgagggacccaagggagacactggcccccga
ggcccaaggggacccgcaggcccccctggccgagatggcatccctggacagcctggactt
cccggaccccccggaccccccggacctcccggaccccctggcctcggaggaaactttgct
ccccagctgtcttatggctatgatgagaaatcaaccggaggaatttccgtgcctggcccc
atgggtccctctggtcctcgtggtctccctggcccccctggtgcacctggtccccaaggc
ttccaaggtccccctggtgagcctggcgagcctggagcttcaggtcccatgggtccccga
ggtcccccaggtccccctggaaagaatggagatgatggggaagctggaaaacctggtcgt
cctggtgagcgtgggcctcctgggcctcagggtgctcgaggattgcccggaacagctggc
ctccctggaatgaagggacacagaggtttcagtggtttggatggtgccaagggagatgct
ggtcctgctggtcctaagggtgagcctggcagccctggtgaaaatggagctcctggtcag
atgggcccccgtggcctgcctggtgagagaggtcgccctggagcccctggccctgctggt
gctcgtggaaatgatggtgctactggtgctgccgggccccctggtcccaccggccccgct
ggtcctcctggcttccctggtgctgttggtgctaagggtgaagctggtccccaagggccc
cgaggctctgaaggtccccagggtgtgcgtggtgagcctggcccccctggccctgctggt
gctgctggccctgctggaaaccctggtgctgatggacagcctggtgctaaaggtgccaat
ggtgctcctggtattgctggtgctcctggcttccctggtgcccgaggcccctctggaccc
cagggccccggcggccctcctggtcccaagggtaacagcggtgaacctggtgctcctggc
agcaaaggagacactggtgctaagggagagcctggccctgttggtgttcaaggaccccct
ggccctgctggagaggaaggaaagcgaggagctcgaggtgaacccggacccactggcctg
cccggaccccctggcgagcgtggtggacctggtagccgtggtttccctggcgcagatggt
gttgctggtcccaagggtcccgctggtgaacgtggttctcctggccctgctggccccaaa
ggatctcctggtgaagctggtcgtcccggtgaagctggtctgcctggtgccaagggtctg
actggaagccctggcagccctggtcctgatggcaaaactggcccccctggtcccgccggt
caagatggtcgccccggacccccaggcccacctggtgcccgtggtcaggctggtgtgatg
ggattccctggacctaaaggtgctgctggagagcccggcaaggctggagagcgaggtgtt
cccggaccccctggcgctgtcggtcctgctggcaaagatggagaggctggagctcaggga
ccccctggccctgctggtcccgctggcgagagaggtgaacaaggccctgctggctccccc
ggattccagggtctccctggtcctgctggtcctccaggtgaagcaggcaaacctggtgaa
cagggtgttcctggagaccttggcgcccctggcccctctggagcaagaggcgagagaggt
ttccctggcgagcgtggtgtgcaaggtccccctggtcctgctggtccccgaggggccaac
ggtgctcccggcaacgatggtgctaagggtgatgctggtgcccctggagctcccggtagc
cagggcgcccctggccttcagggaatgcctggtgaacgtggtgcagctggtcttccaggg
cctaagggtgacagaggtgatgctggtcccaaaggtgctgatggctctcctggcaaagat
ggcgtccgtggtctgactggccccattggtcctcctggccctgctggtgcccctggtgac
aagggtgaaagtggtcccagcggccctgctggtcccactggagctcgtggtgcccccgga
gaccgtggtgagcctggtccccccggccctgctggctttgctggcccccctggtgctgac
ggccaacctggtgctaaaggcgaacctggtgatgctggtgctaaaggcgatgctggtccc
cctggccctgccggacccgctggaccccctggccccattggtaatgttggtgctcctgga
gccaaaggtgctcgcggcagcgctggtccccctggtgctactggtttccctggtgctgct
ggccgagtcggtcctcctggcccctctggaaatgctggaccccctggccctcctggtcct
gctggcaaagaaggcggcaaaggtccccgtggtgagactggccctgctggacgtcctggt
gaagttggtccccctggtccccctggccctgctggcgagaaaggatcccctggtgctgat
ggtcctgctggtgctcctggtactcccgggcctcaaggtattgctggacagcgtggtgtg
gtcggcctgcctggtcagagaggagagagaggcttccctggtcttcctggcccctctggt
gaacctggcaaacaaggtccctctggagcaagtggtgaacgtggtccccctggtcccatg
ggcccccctggattggctggaccccctggtgaatctggacgtgagggggctcctggtgcc
gaaggttcccctggacgagacggttctcctggcgccaagggtgaccgtggtgagaccggc
cccgctggaccccctggtgctcctggtgctcctggtgcccctggccccgttggccctgct
ggcaagagtggtgatcgtggtgagactggtcctgctggtcccgccggtcctgtcggccct
gttggcgcccgtggccccgccggaccccaaggcccccgtggtgacaagggtgagacaggc
gaacagggcgacagaggcataaagggtcaccgtggcttctctggcctccagggtccccct
ggccctcctggctctcctggtgaacaaggtccctctggagcctctggtcctgctggtccc
cgaggtccccctggctctgctggtgctcctggcaaagatggactcaacggtctccctggc
cccattgggccccctggtcctcgcggtcgcactggtgatgctggtcctgttggtcccccc
ggccctcctggacctcctggtccccctggtcctcccagcgctggtttcgacttcagcttc
ctgccccagccacctcaagagaaggctcacgatggtggccgctactaccgggctgatgat
gccaatgtggttcgtgaccgtgacctcgaggtggacaccaccctcaagagcctgagccag
cagatcgagaacatccggagcccagagggcagccgcaagaaccccgcccgcacctgccgt
gacctcaagatgtgccactctgactggaagagtggagagtactggattgaccccaaccaa
ggctgcaacctggatgccatcaaagtcttctgcaacatggagactggtgagacctgcgtg
taccccactcagcccagtgtggcccagaagaactggtacatcagcaagaaccccaaggac
aagaggcatgtctggttcggcgagagcatgaccgatggattccagttcgagtatggcggc
cagggctccgaccctgccgatgtggccatccagctgaccttcctgcgcctgatgtccacc
gaggcctcccagaacatcacctaccactgcaagaacagcgtggcctacatggaccagcag
actggcaacctcaagaaggccctgctcctccagggctccaacgagatcgagatccgcgcc
gagggcaacagccgcttcacctacagcgtcactgtcgatggctgcacgagtcacaccgga
gcctggggcaagacagtgattgaatacaaaaccaccaagacctcccgcctgcccatcatc
gatgtggcccccttggacgttggtgccccagaccaggaattcggcttcgacgttggccct
gtctgcttcctgtaa

KEGG   Homo sapiens (human): 1278
Entry
1278              CDS       T01001                                 

Gene name
COL1A2, EDSARTH2, EDSCV, OI4
Definition
(RefSeq) collagen type I alpha 2 chain
  KO
K06236  collagen type I alpha
Organism
hsa  Homo sapiens (human)
Pathway
hsa04151  PI3K-Akt signaling pathway
hsa04510  Focal adhesion
hsa04512  ECM-receptor interaction
hsa04611  Platelet activation
hsa04926  Relaxin signaling pathway
hsa04933  AGE-RAGE signaling pathway in diabetic complications
hsa04974  Protein digestion and absorption
hsa05146  Amoebiasis
hsa05165  Human papillomavirus infection
hsa05205  Proteoglycans in cancer
Disease
H00506  Osteogenesis imperfecta
H01593  Osteoporosis
H02241  Ehlers-Danlos syndrome cardiac valvular type
H02243  Ehlers-Danlos syndrome arthrochalasia type
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    1278 (COL1A2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    1278 (COL1A2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    1278 (COL1A2)
 09150 Organismal Systems
  09151 Immune system
   04611 Platelet activation
    1278 (COL1A2)
  09152 Endocrine system
   04926 Relaxin signaling pathway
    1278 (COL1A2)
  09154 Digestive system
   04974 Protein digestion and absorption
    1278 (COL1A2)
 09160 Human Diseases
  09161 Cancer: overview
   05205 Proteoglycans in cancer
    1278 (COL1A2)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    1278 (COL1A2)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    1278 (COL1A2)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    1278 (COL1A2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:hsa00536]
    1278 (COL1A2)
Glycosaminoglycan binding proteins [BR:hsa00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   1278 (COL1A2)
SSDB
Motif
Pfam: Collagen COLFI
Other DBs
NCBI-GeneID: 1278
NCBI-ProteinID: NP_000080
OMIM: 120160
HGNC: 2198
Ensembl: ENSG00000164692
Vega: OTTHUMG00000148675
Pharos: P08123(Tbio)
UniProt: P08123 A0A0S2Z3H5
LinkDB
Position
7q21.3
AA seq 1366 aa
MLSFVDTRTLLLLAVTLCLATCQSLQEETVRKGPAGDRGPRGERGPPGPPGRDGEDGPTG
PPGPPGPPGPPGLGGNFAAQYDGKGVGLGPGPMGLMGPRGPPGAAGAPGPQGFQGPAGEP
GEPGQTGPAGARGPAGPPGKAGEDGHPGKPGRPGERGVVGPQGARGFPGTPGLPGFKGIR
GHNGLDGLKGQPGAPGVKGEPGAPGENGTPGQTGARGLPGERGRVGAPGPAGARGSDGSV
GPVGPAGPIGSAGPPGFPGAPGPKGEIGAVGNAGPAGPAGPRGEVGLPGLSGPVGPPGNP
GANGLTGAKGAAGLPGVAGAPGLPGPRGIPGPVGAAGATGARGLVGEPGPAGSKGESGNK
GEPGSAGPQGPPGPSGEEGKRGPNGEAGSAGPPGPPGLRGSPGSRGLPGADGRAGVMGPP
GSRGASGPAGVRGPNGDAGRPGEPGLMGPRGLPGSPGNIGPAGKEGPVGLPGIDGRPGPI
GPAGARGEPGNIGFPGPKGPTGDPGKNGDKGHAGLAGARGAPGPDGNNGAQGPPGPQGVQ
GGKGEQGPPGPPGFQGLPGPSGPAGEVGKPGERGLHGEFGLPGPAGPRGERGPPGESGAA
GPTGPIGSRGPSGPPGPDGNKGEPGVVGAVGTAGPSGPSGLPGERGAAGIPGGKGEKGEP
GLRGEIGNPGRDGARGAPGAVGAPGPAGATGDRGEAGAAGPAGPAGPRGSPGERGEVGPA
GPNGFAGPAGAAGQPGAKGERGAKGPKGENGVVGPTGPVGAAGPAGPNGPPGPAGSRGDG
GPPGMTGFPGAAGRTGPPGPSGISGPPGPPGPAGKEGLRGPRGDQGPVGRTGEVGAVGPP
GFAGEKGPSGEAGTAGPPGTPGPQGLLGAPGILGLPGSRGERGLPGVAGAVGEPGPLGIA
GPPGARGPPGAVGSPGVNGAPGEAGRDGNPGNDGPPGRDGQPGHKGERGYPGNIGPVGAA
GAPGPHGPVGPAGKHGNRGETGPSGPVGPAGAVGPRGPSGPQGIRGDKGEPGEKGPRGLP
GLKGHNGLQGLPGIAGHHGDQGAPGSVGPAGPRGPAGPSGPAGKDGRTGHPGTVGPAGIR
GPQGHQGPAGPPGPPGPPGPPGVSGGGYDFGYDGDFYRADQPRSAPSLRPKDYEVDATLK
SLNNQIETLLTPEGSRKNPARTCRDLRLSHPEWSSGYYWIDPNQGCTMDAIKVYCDFSTG
ETCIRAQPENIPAKNWYRSSKDKKHVWLGETINAGSQFEYNVEGVTSKEMATQLAFMRLL
ANYASQNITYHCKNSIAYMDEETGNLKKAVILQGSNDVELVAEGNSRFTYTVLVDGCSKK
TNEWGKTIIEYKTNKPSRLPFLDIAPLDIGGADQEFFVDIGPVCFK
NT seq 4101 nt   +upstreamnt  +downstreamnt
atgctcagctttgtggatacgcggactttgttgctgcttgcagtaaccttatgcctagca
acatgccaatctttacaagaggaaactgtaagaaagggcccagccggagatagaggacca
cgtggagaaaggggtccaccaggccccccaggcagagatggtgaagatggtcccacaggc
cctcctggtccacctggtcctcctggcccccctggtctcggtgggaactttgctgctcag
tatgatggaaaaggagttggacttggccctggaccaatgggcttaatgggacctagaggc
ccacctggtgcagctggagccccaggccctcaaggtttccaaggacctgctggtgagcct
ggtgaacctggtcaaactggtcctgcaggtgctcgtggtccagctggccctcctggcaag
gctggtgaagatggtcaccctggaaaacccggacgacctggtgagagaggagttgttgga
ccacagggtgctcgtggtttccctggaactcctggacttcctggcttcaaaggcattagg
ggacacaatggtctggatggattgaagggacagcccggtgctcctggtgtgaagggtgaa
cctggtgcccctggtgaaaatggaactccaggtcaaacaggagcccgtgggcttcctggt
gagagaggacgtgttggtgcccctggcccagctggtgcccgtggcagtgatggaagtgtg
ggtcccgtgggtcctgctggtcccattgggtctgctggccctccaggcttcccaggtgcc
cctggccccaagggtgaaattggagctgttggtaacgctggtcctgctggtcccgccggt
ccccgtggtgaagtgggtcttccaggcctctccggccccgttggacctcctggtaatcct
ggagcaaacggccttactggtgccaagggtgctgctggccttcccggcgttgctggggct
cccggcctccctggaccccgcggtattcctggccctgttggtgctgccggtgctactggt
gccagaggacttgttggtgagcctggtccagctggctccaaaggagagagcggtaacaag
ggtgagcccggctctgctgggccccaaggtcctcctggtcccagtggtgaagaaggaaag
agaggccctaatggggaagctggatctgccggccctccaggacctcctgggctgagaggt
agtcctggttctcgtggtcttcctggagctgatggcagagctggcgtcatgggccctcct
ggtagtcgtggtgcaagtggccctgctggagtccgaggacctaatggagatgctggtcgc
cctggggagcctggtctcatgggacccagaggtcttcctggttcccctggaaatatcggc
cccgctggaaaagaaggtcctgtcggcctccctggcatcgacggcaggcctggcccaatt
ggcccagctggagcaagaggagagcctggcaacattggattccctggacccaaaggcccc
actggtgatcctggcaaaaacggtgataaaggtcatgctggtcttgctggtgctcggggt
gctccaggtcctgatggaaacaatggtgctcagggacctcctggaccacagggtgttcaa
ggtggaaaaggtgaacagggtccccctggtcctccaggcttccagggtctgcctggcccc
tcaggtcccgctggtgaagttggcaaaccaggagaaaggggtctccatggtgagtttggt
ctccctggtcctgctggtccaagaggggaacgcggtcccccaggtgagagtggtgctgcc
ggtcctactggtcctattggaagccgaggtccttctggacccccagggcctgatggaaac
aagggtgaacctggtgtggttggtgctgtgggcactgctggtccatctggtcctagtgga
ctcccaggagagaggggtgctgctggcatacctggaggcaagggagaaaagggtgaacct
ggtctcagaggtgaaattggtaaccctggcagagatggtgctcgtggtgctcctggtgct
gtaggtgcccctggtcctgctggagccacaggtgaccggggcgaagctggggctgctggt
cctgctggtcctgctggtcctcggggaagccctggtgaacgtggtgaggtcggtcctgct
ggccccaatggatttgctggtcctgctggtgctgctggtcaacctggtgctaaaggagaa
agaggagccaaagggcctaagggtgaaaacggtgttgttggtcccacaggccccgttgga
gctgctggcccagctggtccaaatggtccccccggtcctgctggaagtcgtggtgatgga
ggcccccctggtatgactggtttccctggtgctgctggacggactggtcccccaggaccc
tctggtatttctggccctcctggtccccctggtcctgctgggaaagaagggcttcgtggt
cctcgtggtgaccaaggtccagttggccgaactggagaagtaggtgcagttggtccccct
ggcttcgctggtgagaagggtccctctggagaggctggtactgctggacctcctggcact
ccaggtcctcagggtcttcttggtgctcctggtattctgggtctccctggctcgagaggt
gaacgtggtctaccaggtgttgctggtgctgtgggtgaacctggtcctcttggcattgcc
ggccctcctggggcccgtggtcctcctggtgctgtgggtagtcctggagtcaacggtgct
cctggtgaagctggtcgtgatggcaaccctgggaacgatggtcccccaggtcgcgatggt
caacccggacacaagggagagcgcggttaccctggcaatattggtcccgttggtgctgca
ggtgcacctggtcctcatggccccgtgggtcctgctggcaaacatggaaaccgtggtgaa
actggtccttctggtcctgttggtcctgctggtgctgttggcccaagaggtcctagtggc
ccacaaggcattcgtggcgataagggagagcccggtgaaaaggggcccagaggtcttcct
ggcttaaagggacacaatggattgcaaggtctgcctggtatcgctggtcaccatggtgat
caaggtgctcctggctccgtgggtcctgctggtcctaggggccctgctggtccttctggc
cctgctggaaaagatggtcgcactggacatcctggtacagttggacctgctggcattcga
ggccctcagggtcaccaaggccctgctggcccccctggtccccctggccctcctggacct
ccaggtgtaagcggtggtggttatgactttggttacgatggagacttctacagggctgac
cagcctcgctcagcaccttctctcagacccaaggactatgaagttgatgctactctgaag
tctctcaacaaccagattgagacccttcttactcctgaaggctctagaaagaacccagct
cgcacatgccgtgacttgagactcagccacccagagtggagcagtggttactactggatt
gaccctaaccaaggatgcactatggatgctatcaaagtatactgtgatttctctactggc
gaaacctgtatccgggcccaacctgaaaacatcccagccaagaactggtataggagctcc
aaggacaagaaacacgtctggctaggagaaactatcaatgctggcagccagtttgaatat
aatgtagaaggagtgacttccaaggaaatggctacccaacttgccttcatgcgcctgctg
gccaactatgcctctcagaacatcacctaccactgcaagaacagcattgcatacatggat
gaggagactggcaacctgaaaaaggctgtcattctacagggctctaatgatgttgaactt
gttgctgagggcaacagcaggttcacttacactgttcttgtagatggctgctctaaaaag
acaaatgaatggggaaagacaatcattgaatacaaaacaaataagccatcacgcctgccc
ttccttgatattgcacctttggacatcggtggtgctgaccaggaattctttgtggacatt
ggcccagtctgtttcaaataa

KEGG   Homo sapiens (human): 1281
Entry
1281              CDS       T01001                                 

Gene name
COL3A1, EDS4A, EDSVASC, PMGEDSV
Definition
(RefSeq) collagen type III alpha 1 chain
  KO
K19720  collagen type III alpha
Organism
hsa  Homo sapiens (human)
Pathway
hsa04611  Platelet activation
hsa04926  Relaxin signaling pathway
hsa04933  AGE-RAGE signaling pathway in diabetic complications
hsa04974  Protein digestion and absorption
hsa05146  Amoebiasis
Disease
H02242  Ehlers-Danlos syndrome vascular type
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09150 Organismal Systems
  09151 Immune system
   04611 Platelet activation
    1281 (COL3A1)
  09152 Endocrine system
   04926 Relaxin signaling pathway
    1281 (COL3A1)
  09154 Digestive system
   04974 Protein digestion and absorption
    1281 (COL3A1)
 09160 Human Diseases
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    1281 (COL3A1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    1281 (COL3A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:hsa00536]
    1281 (COL3A1)
Glycosaminoglycan binding proteins [BR:hsa00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   1281 (COL3A1)
SSDB
Motif
Pfam: Collagen COLFI VWC
Other DBs
NCBI-GeneID: 1281
NCBI-ProteinID: NP_000081
OMIM: 120180
HGNC: 2201
Ensembl: ENSG00000168542
Vega: OTTHUMG00000132648
Pharos: P02461(Tbio)
UniProt: P02461
LinkDB
Structure
PDB: 
4AE2 4AEJ 6FZV 6FZW 4AK3

Position
2q32.2
AA seq 1466 aa
MMSFVQKGSWLLLALLHPTIILAQQEAVEGGCSHLGQSYADRDVWKPEPCQICVCDSGSV
LCDDIICDDQELDCPNPEIPFGECCAVCPQPPTAPTRPPNGQGPQGPKGDPGPPGIPGRN
GDPGIPGQPGSPGSPGPPGICESCPTGPQNYSPQYDSYDVKSGVAVGGLAGYPGPAGPPG
PPGPPGTSGHPGSPGSPGYQGPPGEPGQAGPSGPPGPPGAIGPSGPAGKDGESGRPGRPG
ERGLPGPPGIKGPAGIPGFPGMKGHRGFDGRNGEKGETGAPGLKGENGLPGENGAPGPMG
PRGAPGERGRPGLPGAAGARGNDGARGSDGQPGPPGPPGTAGFPGSPGAKGEVGPAGSPG
SNGAPGQRGEPGPQGHAGAQGPPGPPGINGSPGGKGEMGPAGIPGAPGLMGARGPPGPAG
ANGAPGLRGGAGEPGKNGAKGEPGPRGERGEAGIPGVPGAKGEDGKDGSPGEPGANGLPG
AAGERGAPGFRGPAGPNGIPGEKGPAGERGAPGPAGPRGAAGEPGRDGVPGGPGMRGMPG
SPGGPGSDGKPGPPGSQGESGRPGPPGPSGPRGQPGVMGFPGPKGNDGAPGKNGERGGPG
GPGPQGPPGKNGETGPQGPPGPTGPGGDKGDTGPPGPQGLQGLPGTGGPPGENGKPGEPG
PKGDAGAPGAPGGKGDAGAPGERGPPGLAGAPGLRGGAGPPGPEGGKGAAGPPGPPGAAG
TPGLQGMPGERGGLGSPGPKGDKGEPGGPGADGVPGKDGPRGPTGPIGPPGPAGQPGDKG
EGGAPGLPGIAGPRGSPGERGETGPPGPAGFPGAPGQNGEPGGKGERGAPGEKGEGGPPG
VAGPPGGSGPAGPPGPQGVKGERGSPGGPGAAGFPGARGLPGPPGSNGNPGPPGPSGSPG
KDGPPGPAGNTGAPGSPGVSGPKGDAGQPGEKGSPGAQGPPGAPGPLGIAGITGARGLAG
PPGMPGPRGSPGPQGVKGESGKPGANGLSGERGPPGPQGLPGLAGTAGEPGRDGNPGSDG
LPGRDGSPGGKGDRGENGSPGAPGAPGHPGPPGPVGPAGKSGDRGESGPAGPAGAPGPAG
SRGAPGPQGPRGDKGETGERGAAGIKGHRGFPGNPGAPGSPGPAGQQGAIGSPGPAGPRG
PVGPSGPPGKDGTSGHPGPIGPPGPRGNRGERGSEGSPGHPGQPGPPGPPGAPGPCCGGV
GAAAIAGIGGEKAGGFAPYYGDEPMDFKINTDEIMTSLKSVNGQIESLISPDGSRKNPAR
NCRDLKFCHPELKSGEYWVDPNQGCKLDAIKVFCNMETGETCISANPLNVPRKHWWTDSS
AEKKHVWFGESMDGGFQFSYGNPELPEDVLDVHLAFLRLLSSRASQNITYHCKNSIAYMD
QASGNVKKALKLMGSNEGEFKAEGNSKFTYTVLEDGCTKHTGEWSKTVFEYRTRKAVRLP
IVDIAPYDIGGPDQEFGVDVGPVCFL
NT seq 4401 nt   +upstreamnt  +downstreamnt
atgatgagctttgtgcaaaaggggagctggctacttctcgctctgcttcatcccactatt
attttggcacaacaggaagctgttgaaggaggatgttcccatcttggtcagtcctatgcg
gatagagatgtctggaagccagaaccatgccaaatatgtgtctgtgactcaggatccgtt
ctctgcgatgacataatatgtgacgatcaagaattagactgccccaacccagaaattcca
tttggagaatgttgtgcagtttgcccacagcctccaactgctcctactcgccctcctaat
ggtcaaggacctcaaggccccaagggagatccaggccctcctggtattcctgggagaaat
ggtgaccctggtattccaggacaaccagggtcccctggttctcctggcccccctggaatc
tgtgaatcatgccctactggtcctcagaactattctccccagtatgattcatatgatgtc
aagtctggagtagcagtaggaggactcgcaggctatcctggaccagctggccccccaggc
cctcccggtccccctggtacatctggtcatcctggttcccctggatctccaggataccaa
ggaccccctggtgaacctgggcaagctggtccttcaggccctccaggacctcctggtgct
ataggtccatctggtcctgctggaaaagatggagaatcaggtagacccggacgacctgga
gagcgaggattgcctggacctccaggtatcaaaggtccagctgggatacctggattccct
ggtatgaaaggacacagaggcttcgatggacgaaatggagaaaagggtgaaacaggtgct
cctggattaaagggtgaaaatggtcttccaggcgaaaatggagctcctggacccatgggt
ccaagaggggctcctggtgagcgaggacggccaggacttcctggggctgcaggtgctcgg
ggtaatgacggtgctcgaggcagtgatggtcaaccaggccctcctggtcctcctggaact
gccggattccctggatcccctggtgctaagggtgaagttggacctgcagggtctcctggt
tcaaatggtgcccctggacaaagaggagaacctggacctcagggacacgctggtgctcaa
ggtcctcctggccctcctgggattaatggtagtcctggtggtaaaggcgaaatgggtccc
gctggcattcctggagctcctggactgatgggagcccggggtcctccaggaccagccggt
gctaatggtgctcctggactgcgaggtggtgcaggtgagcctggtaagaatggtgccaaa
ggagagcccggaccacgtggtgaacgcggtgaggctggtattccaggtgttccaggagct
aaaggcgaagatggcaaggatggatcacctggagaacctggtgcaaatgggcttccagga
gctgcaggagaaaggggtgcccctgggttccgaggacctgctggaccaaatggcatccca
ggagaaaagggtcctgctggagagcgtggtgctccaggccctgcagggcccagaggagct
gctggagaacctggcagagatggcgtccctggaggtccaggaatgaggggcatgcccgga
agtccaggaggaccaggaagtgatgggaaaccagggcctcccggaagtcaaggagaaagt
ggtcgaccaggtcctcctgggccatctggtccccgaggtcagcctggtgtcatgggcttc
cccggtcctaaaggaaatgatggtgctcctggtaagaatggagaacgaggtggccctgga
ggacctggccctcagggtcctcctggaaagaatggtgaaactggacctcagggaccccca
gggcctactgggcctggtggtgacaaaggagacacaggaccccctggtccacaaggatta
caaggcttgcctggtacaggtggtcctccaggagaaaatggaaaacctggggaaccaggt
ccaaagggtgatgccggtgcacctggagctccaggaggcaagggtgatgctggtgcccct
ggtgaacgtggacctcctggattggcaggggccccaggacttagaggtggagctggtccc
cctggtcccgaaggaggaaagggtgctgctggtcctcctgggccacctggtgctgctggt
actcctggtctgcaaggaatgcctggagaaagaggaggtcttggaagtcctggtccaaag
ggtgacaagggtgaaccaggcggtccaggtgctgatggtgtcccagggaaagatggccca
aggggtcctactggtcctattggtcctcctggcccagctggccagcctggagataagggt
gaaggtggtgcccccggacttccaggtatagctggacctcgtggtagccctggtgagaga
ggtgaaactggccctccaggacctgctggtttccctggtgctcctggacagaatggtgaa
cctggtggtaaaggagaaagaggggctccgggtgagaaaggtgaaggaggccctcctgga
gttgcaggaccccctggaggttctggacctgctggtcctcctggtccccaaggtgtcaaa
ggtgaacgtggcagtcctggtggacctggtgctgctggcttccctggtgctcgtggtctt
cctggtcctcctggtagtaatggtaacccaggacccccaggtcccagcggttctccaggc
aaggatgggcccccaggtcctgcgggtaacactggtgctcctggcagccctggagtgtct
ggaccaaaaggtgatgctggccaaccaggagagaagggatcgcctggtgcccagggccca
ccaggagctccaggcccacttgggattgctgggatcactggagcacggggtcttgcagga
ccaccaggcatgccaggtcctaggggaagccctggccctcagggtgtcaagggtgaaagt
gggaaaccaggagctaacggtctcagtggagaacgtggtccccctggaccccagggtctt
cctggtctggctggtacagctggtgaacctggaagagatggaaaccctggatcagatggt
cttccaggccgagatggatctcctggtggcaagggtgatcgtggtgaaaatggctctcct
ggtgcccctggcgctcctggtcatccaggcccacctggtcctgtcggtccagctggaaag
agtggtgacagaggagaaagtggccctgctggccctgctggtgctcccggtcctgctggt
tcccgaggtgctcctggtcctcaaggcccacgtggtgacaaaggtgaaacaggtgaacgt
ggagctgctggcatcaaaggacatcgaggattccctggtaatccaggtgccccaggttct
ccaggccctgctggtcagcagggtgcaatcggcagtccaggacctgcaggccccagagga
cctgttggacccagtggacctcctggcaaagatggaaccagtggacatccaggtcccatt
ggaccaccagggcctcgaggtaacagaggtgaaagaggatctgagggctccccaggccac
ccagggcaaccaggccctcctggacctcctggtgcccctggtccttgctgtggtggtgtt
ggagccgctgccattgctgggattggaggtgaaaaagctggcggttttgccccgtattat
ggagatgaaccaatggatttcaaaatcaacaccgatgagattatgacttcactcaagtct
gttaatggacaaatagaaagcctcattagtcctgatggttctcgtaaaaaccccgctaga
aactgcagagacctgaaattctgccatcctgaactcaagagtggagaatactgggttgac
cctaaccaaggatgcaaattggatgctatcaaggtattctgtaatatggaaactggggaa
acatgcataagtgccaatcctttgaatgttccacggaaacactggtggacagattctagt
gctgagaagaaacacgtttggtttggagagtccatggatggtggttttcagtttagctac
ggcaatcctgaacttcctgaagatgtccttgatgtgcatctggcattccttcgacttctc
tccagccgagcttcccagaacatcacatatcactgcaaaaatagcattgcatacatggat
caggccagtggaaatgtaaagaaggccctgaagctgatggggtcaaatgaaggtgaattc
aaggctgaaggaaatagcaaattcacctacacagttctggaggatggttgcacgaaacac
actggggaatggagcaaaacagtctttgaatatcgaacacgcaaggctgtgagactacct
attgtagatattgcaccctatgacattggtggtcctgatcaagaatttggtgtggacgtt
ggccctgtttgctttttataa

KEGG   Homo sapiens (human): 1282
Entry
1282              CDS       T01001                                 

Gene name
COL4A1, BSVD, BSVD1, COL4A1s, PADMAL, RATOR
Definition
(RefSeq) collagen type IV alpha 1 chain
  KO
K06237  collagen type IV alpha
Organism
hsa  Homo sapiens (human)
Pathway
hsa04151  PI3K-Akt signaling pathway
hsa04510  Focal adhesion
hsa04512  ECM-receptor interaction
hsa04926  Relaxin signaling pathway
hsa04933  AGE-RAGE signaling pathway in diabetic complications
hsa04974  Protein digestion and absorption
hsa05146  Amoebiasis
hsa05165  Human papillomavirus infection
hsa05200  Pathways in cancer
hsa05222  Small cell lung cancer
Disease
H00579  Hereditary angiopathy with nephropathy, aneurysms, and muscle cramps (HANAC)
H00839  Porencephaly
H00877  Brain small vessel disease with hemorrhage
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    1282 (COL4A1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    1282 (COL4A1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    1282 (COL4A1)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    1282 (COL4A1)
  09154 Digestive system
   04974 Protein digestion and absorption
    1282 (COL4A1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    1282 (COL4A1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    1282 (COL4A1)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    1282 (COL4A1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    1282 (COL4A1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    1282 (COL4A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:hsa04147]
    1282 (COL4A1)
   00536 Glycosaminoglycan binding proteins [BR:hsa00536]
    1282 (COL4A1)
Exosome [BR:hsa04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   1282 (COL4A1)
Glycosaminoglycan binding proteins [BR:hsa00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   1282 (COL4A1)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 1282
NCBI-ProteinID: NP_001836
OMIM: 120130
HGNC: 2202
Ensembl: ENSG00000187498
Vega: OTTHUMG00000017342
Pharos: P02462(Tbio)
UniProt: P02462
LinkDB
Structure
PDB: 
5NAY 6MPX 1LI1 5NAX

Position
13q34
AA seq 1669 aa
MGPRLSVWLLLLPAALLLHEEHSRAAAKGGCAGSGCGKCDCHGVKGQKGERGLPGLQGVI
GFPGMQGPEGPQGPPGQKGDTGEPGLPGTKGTRGPPGASGYPGNPGLPGIPGQDGPPGPP
GIPGCNGTKGERGPLGPPGLPGFAGNPGPPGLPGMKGDPGEILGHVPGMLLKGERGFPGI
PGTPGPPGLPGLQGPVGPPGFTGPPGPPGPPGPPGEKGQMGLSFQGPKGDKGDQGVSGPP
GVPGQAQVQEKGDFATKGEKGQKGEPGFQGMPGVGEKGEPGKPGPRGKPGKDGDKGEKGS
PGFPGEPGYPGLIGRQGPQGEKGEAGPPGPPGIVIGTGPLGEKGERGYPGTPGPRGEPGP
KGFPGLPGQPGPPGLPVPGQAGAPGFPGERGEKGDRGFPGTSLPGPSGRDGLPGPPGSPG
PPGQPGYTNGIVECQPGPPGDQGPPGIPGQPGFIGEIGEKGQKGESCLICDIDGYRGPPG
PQGPPGEIGFPGQPGAKGDRGLPGRDGVAGVPGPQGTPGLIGQPGAKGEPGEFYFDLRLK
GDKGDPGFPGQPGMPGRAGSPGRDGHPGLPGPKGSPGSVGLKGERGPPGGVGFPGSRGDT
GPPGPPGYGPAGPIGDKGQAGFPGGPGSPGLPGPKGEPGKIVPLPGPPGAEGLPGSPGFP
GPQGDRGFPGTPGRPGLPGEKGAVGQPGIGFPGPPGPKGVDGLPGDMGPPGTPGRPGFNG
LPGNPGVQGQKGEPGVGLPGLKGLPGLPGIPGTPGEKGSIGVPGVPGEHGAIGPPGLQGI
RGEPGPPGLPGSVGSPGVPGIGPPGARGPPGGQGPPGLSGPPGIKGEKGFPGFPGLDMPG
PKGDKGAQGLPGITGQSGLPGLPGQQGAPGIPGFPGSKGEMGVMGTPGQPGSPGPVGAPG
LPGEKGDHGFPGSSGPRGDPGLKGDKGDVGLPGKPGSMDKVDMGSMKGQKGDQGEKGQIG
PIGEKGSRGDPGTPGVPGKDGQAGQPGQPGPKGDPGISGTPGAPGLPGPKGSVGGMGLPG
TPGEKGVPGIPGPQGSPGLPGDKGAKGEKGQAGPPGIGIPGLRGEKGDQGIAGFPGSPGE
KGEKGSIGIPGMPGSPGLKGSPGSVGYPGSPGLPGEKGDKGLPGLDGIPGVKGEAGLPGT
PGPTGPAGQKGEPGSDGIPGSAGEKGEPGLPGRGFPGFPGAKGDKGSKGEVGFPGLAGSP
GIPGSKGEQGFMGPPGPQGQPGLPGSPGHATEGPKGDRGPQGQPGLPGLPGPMGPPGLPG
IDGVKGDKGNPGWPGAPGVPGPKGDPGFQGMPGIGGSPGITGSKGDMGPPGVPGFQGPKG
LPGLQGIKGDQGDQGVPGAKGLPGPPGPPGPYDIIKGEPGLPGPEGPPGLKGLQGLPGPK
GQQGVTGLVGIPGPPGIPGFDGAPGQKGEMGPAGPTGPRGFPGPPGPDGLPGSMGPPGTP
SVDHGFLVTRHSQTIDDPQCPSGTKILYHGYSLLYVQGNERAHGQDLGTAGSCLRKFSTM
PFLFCNINNVCNFASRNDYSYWLSTPEPMPMSMAPITGENIRPFISRCAVCEAPAMVMAV
HSQTIQIPPCPSGWSSLWIGYSFVMHTSAGAEGSGQALASPGSCLEEFRSAPFIECHGRG
TCNYYANAYSFWLATIERSEMFKKPTPSTLKAGELRTHVSRCQVCMRRT
NT seq 5010 nt   +upstreamnt  +downstreamnt
atggggccccggctcagcgtctggctgctgctgctgcccgccgcccttctgctccacgag
gagcacagccgggccgctgcgaagggtggctgtgctggctctggctgtggcaaatgtgac
tgccatggagtgaagggacaaaagggtgaaagaggcctcccggggttacaaggtgtcatt
gggtttcctggaatgcaaggacctgaggggccacagggaccaccaggacaaaagggtgat
actggagaaccaggactacctggaacaaaagggacaagaggacctccgggagcatctggc
taccctggaaacccaggacttcccggaattcctggccaagacggcccgccaggcccccca
ggtattccaggatgcaatggcacaaagggggagagagggccgctcgggcctcctggcttg
cctggtttcgctggaaatcccggaccaccaggcttaccagggatgaagggtgatccaggt
gagatacttggccatgtgcccgggatgctgttgaaaggtgaaagaggatttcccggaatc
ccagggactccaggcccaccaggactgccagggcttcaaggtcctgttgggcctccagga
tttaccggaccaccaggtcccccaggccctcccggccctccaggtgaaaagggacaaatg
ggcttaagttttcaaggaccaaaaggtgacaagggtgaccaaggggtcagtgggcctcca
ggagtaccaggacaagctcaagttcaagaaaaaggagacttcgccaccaagggagaaaag
ggccaaaaaggtgaacctggatttcaggggatgccaggggtcggagagaaaggtgaaccc
ggaaaaccaggacccagaggcaaacccggaaaagatggtgacaaaggggaaaaagggagt
cccggttttcctggtgaacccgggtacccaggactcataggccgccagggcccgcaggga
gaaaagggtgaagcaggtcctcctggcccacctggaattgttataggcacaggacctttg
ggagaaaaaggagagaggggctaccctggaactccggggccaagaggagagccaggccca
aaaggtttcccaggactaccaggccaacccggacctccaggcctccctgtacctgggcag
gctggtgcccctggcttccctggtgaaagaggagaaaaaggtgaccgaggatttcctggt
acatctctgccaggaccaagtggaagagatgggctcccgggtcctcctggttcccctggg
ccccctgggcagcctggctacacaaatggaattgtggaatgtcagcccggacctccaggt
gaccagggtcctcctggaattccagggcagccaggatttataggcgaaattggagagaaa
ggtcaaaaaggagagagttgcctcatctgtgatatagacggatatcgggggcctcccggg
ccacagggacccccgggagaaataggtttcccagggcagccaggggccaagggcgacaga
ggtttgcctggcagagatggtgttgcaggagtgccaggccctcaaggtacaccagggctg
ataggccagccaggagccaagggggagcctggtgagttttatttcgacttgcggctcaaa
ggtgacaaaggagacccaggctttccaggacagcccggcatgccagggagagcgggttct
cctggaagagatggccatccgggtcttcctggccccaagggctcgccgggttctgtagga
ttgaaaggagagcgtggcccccctggaggagttggattcccaggcagtcgtggtgacacc
ggcccccctgggcctccaggatatggtcctgctggtcccattggtgacaaaggacaagca
ggctttcctggaggccctggatccccaggcctgccaggtccaaagggtgaaccaggaaaa
attgttcctttaccaggcccccctggagcagaaggactgccggggtccccaggcttccca
ggtccccaaggagaccgaggctttcccggaaccccaggaaggccaggcctgccaggagag
aagggcgctgtgggccagccaggcattggatttccagggccccccggccccaaaggtgtt
gacggcttacctggagacatggggccaccggggactccaggtcgcccgggatttaatggc
ttacctgggaacccaggtgtgcagggccagaagggagagcctggagttggtctaccggga
ctcaaaggtttgccaggtcttcccggcattcctggcacacccggggagaaggggagcatt
ggggtaccaggcgttcctggagaacatggagcgatcggaccccctgggcttcaggggatc
agaggtgaaccgggacctcctggattgccaggctccgtggggtctccaggagttccagga
ataggcccccctggagctaggggtccccctggaggacagggaccaccggggttgtcaggc
cctcctggaataaaaggagagaagggtttccccggattccctggactggacatgccgggc
cctaaaggagataaaggggctcaaggactccctggcataacgggacagtcggggctccct
ggccttcctggacagcagggggctcctgggattcctgggtttccaggttccaagggagaa
atgggcgtcatggggacccccgggcagccgggctcaccaggaccagtgggtgctcctgga
ttaccgggtgaaaaaggggaccatggctttccgggctcctcaggacccaggggagaccct
ggcttgaaaggtgataagggggatgtcggtctccctggcaagcctggctccatggataag
gtggacatgggcagcatgaagggccagaaaggagaccaaggagagaaaggacaaattgga
ccaattggtgagaagggatcccgaggagaccctgggaccccaggagtgcctggaaaggac
gggcaggcaggacagcctgggcagccaggacctaaaggtgatccaggtataagtggaacc
ccaggtgctccaggacttccgggaccaaaaggatctgttggtggaatgggcttgccagga
acacctggagagaaaggtgtgcctggcatccctggcccacaaggttcacctggcttacct
ggagacaaaggtgcaaaaggagagaaagggcaggcaggcccacctggcataggcatccca
gggctgcgaggtgaaaagggagatcaagggatagcgggtttcccaggaagccctggagag
aagggagaaaaaggaagcattgggatcccaggaatgccagggtccccaggccttaaaggg
tctcccgggagtgttggctatccaggaagtcctgggctacctggagaaaaaggtgacaaa
ggcctcccaggattggatggcatccctggtgtcaaaggagaagcaggtcttcctgggact
cctggccccacaggcccagctggccagaaaggggagccaggcagtgatggaatcccgggg
tcagcaggagagaagggtgaaccaggtctaccaggaagaggattcccagggtttccaggg
gccaaaggagacaaaggttcaaagggtgaggtgggtttcccaggattagccgggagccca
ggaattcctggatccaaaggagagcaaggattcatgggtcctccggggccccagggacag
ccggggttaccgggatccccaggccatgccacggaggggcccaaaggagaccgcggacct
cagggccagcctggcctgccaggacttccgggacccatggggcctccagggcttcctggg
attgatggagttaaaggtgacaaaggaaatccaggctggccaggagcacccggtgtccca
gggcccaagggagaccctggattccagggcatgcctggtattggtggctctccaggaatc
acaggctctaagggtgatatggggcctccaggagttccaggatttcaaggtccaaaaggt
cttcctggcctccagggaattaaaggtgatcaaggcgatcaaggcgtcccgggagctaaa
ggtctcccgggtcctcctggccccccaggtccttacgacatcatcaaaggggagcccggg
ctccctggtcctgagggccccccagggctgaaagggcttcagggactgccaggcccgaaa
ggccagcaaggtgttacaggattggtgggtatacctggacctccaggtattcctgggttt
gacggtgcccctggccagaaaggagagatgggacctgccgggcctactggtccaagagga
tttccaggtccaccaggccccgatgggttgccaggatccatggggcccccaggcacccca
tctgttgatcacggcttccttgtgaccaggcatagtcaaacaatagatgacccacagtgt
ccttctgggaccaaaattctttaccacgggtactctttgctctacgtgcaaggcaatgaa
cgggcccatggccaggacttgggcacggccggcagctgcctgcgcaagttcagcacaatg
cccttcctgttctgcaatattaacaacgtgtgcaactttgcatcacgaaatgactactcg
tactggctgtccacccctgagcccatgcccatgtcaatggcacccatcacgggggaaaac
ataagaccatttattagtaggtgtgctgtgtgtgaggcgcctgccatggtgatggccgtg
cacagccagaccattcagatcccaccgtgccccagcgggtggtcctcgctgtggatcggc
tactcttttgtgatgcacaccagcgctggtgcagaaggctctggccaagccctggcgtcc
cccggctcctgcctggaggagtttagaagtgcgccattcatcgagtgtcacggccgtggg
acctgcaattactacgcaaacgcttacagcttttggctcgccaccatagagaggagcgag
atgttcaagaagcctacgccgtccaccttgaaggcaggggagctgcgcacgcacgtcagc
cgctgccaagtctgtatgagaagaacataa

KEGG   Homo sapiens (human): 1284
Entry
1284              CDS       T01001                                 

Gene name
COL4A2, BSVD2, ICH, POREN2
Definition
(RefSeq) collagen type IV alpha 2 chain
  KO
K06237  collagen type IV alpha
Organism
hsa  Homo sapiens (human)
Pathway
hsa04151  PI3K-Akt signaling pathway
hsa04510  Focal adhesion
hsa04512  ECM-receptor interaction
hsa04926  Relaxin signaling pathway
hsa04933  AGE-RAGE signaling pathway in diabetic complications
hsa04974  Protein digestion and absorption
hsa05146  Amoebiasis
hsa05165  Human papillomavirus infection
hsa05200  Pathways in cancer
hsa05222  Small cell lung cancer
Disease
H00839  Porencephaly
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    1284 (COL4A2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    1284 (COL4A2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    1284 (COL4A2)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    1284 (COL4A2)
  09154 Digestive system
   04974 Protein digestion and absorption
    1284 (COL4A2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    1284 (COL4A2)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    1284 (COL4A2)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    1284 (COL4A2)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    1284 (COL4A2)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    1284 (COL4A2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:hsa04147]
    1284 (COL4A2)
   00536 Glycosaminoglycan binding proteins [BR:hsa00536]
    1284 (COL4A2)
Exosome [BR:hsa04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   1284 (COL4A2)
Glycosaminoglycan binding proteins [BR:hsa00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   1284 (COL4A2)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 1284
NCBI-ProteinID: NP_001837
OMIM: 120090
HGNC: 2203
Ensembl: ENSG00000134871
Vega: OTTHUMG00000017344
Pharos: P08572(Tbio)
UniProt: P08572 A0A024RDW8
LinkDB
Structure
PDB: 
1LI1 5NB2 5NAX

Position
13q34
AA seq 1712 aa
MGRDQRAVAGPALRRWLLLGTVTVGFLAQSVLAGVKKFDVPCGGRDCSGGCQCYPEKGGR
GQPGPVGPQGYNGPPGLQGFPGLQGRKGDKGERGAPGVTGPKGDVGARGVSGFPGADGIP
GHPGQGGPRGRPGYDGCNGTQGDSGPQGPPGSEGFTGPPGPQGPKGQKGEPYALPKEERD
RYRGEPGEPGLVGFQGPPGRPGHVGQMGPVGAPGRPGPPGPPGPKGQQGNRGLGFYGVKG
EKGDVGQPGPNGIPSDTLHPIIAPTGVTFHPDQYKGEKGSEGEPGIRGISLKGEEGIMGF
PGLRGYPGLSGEKGSPGQKGSRGLDGYQGPDGPRGPKGEAGDPGPPGLPAYSPHPSLAKG
ARGDPGFPGAQGEPGSQGEPGDPGLPGPPGLSIGDGDQRRGLPGEMGPKGFIGDPGIPAL
YGGPPGPDGKRGPPGPPGLPGPPGPDGFLFGLKGAKGRAGFPGLPGSPGARGPKGWKGDA
GECRCTEGDEAIKGLPGLPGPKGFAGINGEPGRKGDRGDPGQHGLPGFPGLKGVPGNIGA
PGPKGAKGDSRTITTKGERGQPGVPGVPGMKGDDGSPGRDGLDGFPGLPGPPGDGIKGPP
GDPGYPGIPGTKGTPGEMGPPGLGLPGLKGQRGFPGDAGLPGPPGFLGPPGPAGTPGQID
CDTDVKRAVGGDRQEAIQPGCIGGPKGLPGLPGPPGPTGAKGLRGIPGFAGADGGPGPRG
LPGDAGREGFPGPPGFIGPRGSKGAVGLPGPDGSPGPIGLPGPDGPPGERGLPGEVLGAQ
PGPRGDAGVPGQPGLKGLPGDRGPPGFRGSQGMPGMPGLKGQPGLPGPSGQPGLYGPPGL
HGFPGAPGQEGPLGLPGIPGREGLPGDRGDPGDTGAPGPVGMKGLSGDRGDAGFTGEQGH
PGSPGFKGIDGMPGTPGLKGDRGSPGMDGFQGMPGLKGRPGFPGSKGEAGFFGIPGLKGL
AGEPGFKGSRGDPGPPGPPPVILPGMKDIKGEKGDEGPMGLKGYLGAKGIQGMPGIPGLS
GIPGLPGRPGHIKGVKGDIGVPGIPGLPGFPGVAGPPGITGFPGFIGSRGDKGAPGRAGL
YGEIGATGDFGDIGDTINLPGRPGLKGERGTTGIPGLKGFFGEKGTEGDIGFPGITGVTG
VQGPPGLKGQTGFPGLTGPPGSQGELGRIGLPGGKGDDGWPGAPGLPGFPGLRGIRGLHG
LPGTKGFPGSPGSDIHGDPGFPGPPGERGDPGEANTLPGPVGVPGQKGDQGAPGERGPPG
SPGLQGFPGITPPSNISGAPGDKGAPGIFGLKGYRGPPGPPGSAALPGSKGDTGNPGAPG
TPGTKGWAGDSGPQGRPGVFGLPGEKGPRGEQGFMGNTGPTGAVGDRGPKGPKGDPGFPG
APGTVGAPGIAGIPQKIAVQPGTVGPQGRRGPPGAPGEMGPQGPPGEPGFRGAPGKAGPQ
GRGGVSAVPGFRGDEGPIGHQGPIGQEGAPGRPGSPGLPGMPGRSVSIGYLLVKHSQTDQ
EPMCPVGMNKLWSGYSLLYFEGQEKAHNQDLGLAGSCLARFSTMPFLYCNPGDVCYYASR
NDKSYWLSTTAPLPMMPVAEDEIKPYISRCSVCEAPAIAIAVHSQDVSIPHCPAGWRSLW
IGYSFLMHTAAGDEGGGQSLVSPGSCLEDFRATPFIECNGGRGTCHYYANKYSFWLTTIP
EQSFQGSPSADTLKAGLIRTHISRCQVCMKNL
NT seq 5139 nt   +upstreamnt  +downstreamnt
atggggagagaccagcgcgcggtggccggccctgccctacggcggtggctgctgctgggg
acagtgaccgtggggttcctcgcccagagcgtcttggcgggtgtgaagaagtttgatgtg
ccgtgtggaggaagagattgcagtgggggctgccagtgctaccctgagaaaggtggacgt
ggtcagcctgggccagtgggcccccaggggtacaatgggccaccaggattacaaggattc
ccgggactgcagggacgtaaaggagacaagggtgaaaggggagcccccggagtaacggga
cccaagggcgacgtgggagcaagaggcgtttctggattccctggtgccgatggaattcct
ggacacccggggcaaggtgggcccaggggaaggccgggctacgatggctgcaacggaacc
cagggagactcaggtccacaggggccccccggctctgaggggttcaccgggcctcccggg
ccccaaggaccaaaagggcagaaaggtgagccttatgcactgcctaaagaggagcgcgac
agatatcggggtgaacctggagagcctggattggtcggtttccagggacctcccggccgc
cctgggcatgtgggacagatgggtccagttggagctccagggagaccaggaccacctgga
ccccctggaccaaaaggacagcaaggcaacagaggacttggtttctacggagttaagggt
gaaaagggtgacgtagggcagccgggacccaacgggattccatcagacaccctccacccc
atcatcgcgcccacaggagtcaccttccacccagatcagtacaagggtgaaaaaggcagt
gagggggaaccaggaataagaggcatttccttgaagggagaagaaggaatcatgggcttt
cctggactgaggggttaccctggcttgagtggtgaaaaaggatcaccaggacagaaggga
agccgaggcctggatggctatcaagggcctgatggaccccggggacccaagggagaagcc
ggagacccagggccccctggactacctgcctactcccctcacccttccctagcaaaaggt
gccagaggtgacccgggattcccaggggcccaaggggagccaggaagccagggtgagcca
ggagacccgggcctcccaggtccccctggcctctccatcggagatggagatcagaggaga
ggcctgccgggtgagatgggacccaagggcttcatcggagaccccggcatccctgcgctc
tacgggggcccacctggacctgatggaaagcgagggcctccaggaccccccgggctccct
ggaccacctggacctgatggcttcctgtttgggctgaaaggagcaaaaggaagagcaggc
ttccctgggcttcccggctcccctggagcccgcggaccaaaggggtggaaaggtgacgct
ggggaatgcagatgtacagaaggcgacgaagctatcaaaggtcttccgggactgccagga
cccaagggcttcgcaggcatcaacggggagccggggaggaaaggggacagaggagacccc
ggccaacacggcctccctgggttcccagggctcaagggagtgcctggcaacattggtgct
cccggacccaaaggagcaaaaggagattccagaacaatcacaaccaaaggtgagcgggga
cagcccggcgtcccaggtgtgcccgggatgaaaggtgacgatggcagcccaggccgcgat
gggctcgatggattccccggcctcccaggccctcccggtgatggcatcaagggccctcca
ggggacccaggctatccaggaatacctggaacgaagggtactccaggagaaatgggcccc
ccaggactgggccttcccggcctcaaaggccaacgtggtttccctggagacgccggctta
cctggaccaccaggcttcctgggccctcctggccccgcagggaccccaggacaaatagat
tgtgacacagatgtgaaaagggccgttggaggtgacagacaggaggccatccagccaggt
tgcataggagggcccaagggattgccaggcctgccaggacccccaggccccacaggtgcc
aaaggcctccgaggaatcccaggcttcgcaggagctgatggaggaccagggcccaggggc
ttgccaggagacgcaggtcgtgaagggttcccaggacccccagggttcataggaccccga
ggatccaaaggtgcagtgggcctccctggcccagatggatccccaggtcccatcggcctg
ccagggccagatgggccccctggggaaaggggcctccctggagaagtcctgggagctcag
cccgggccacggggagatgctggtgtgcctggacagcctgggcttaaaggccttcccgga
gacagaggcccccctggattcagaggaagccaagggatgcctgggatgccagggctgaag
ggccagccaggcctcccaggaccttccggccagccaggcctgtatgggcctccaggactg
catggattcccaggagctcctggccaagaggggcccttggggctgccaggaatcccaggc
cgtgaaggtctgcctggtgatagaggggaccctggggacacaggcgctcctggccctgtg
ggcatgaaaggtctctctggtgacagaggagatgctggcttcacaggggagcaaggccat
ccaggaagccctggatttaaaggaattgatggaatgcctgggacccccgggctaaaagga
gatagaggctcacctgggatggatggtttccaaggcatgcctggactcaaagggagaccc
gggtttccagggagcaaaggcgaggctggatttttcggaatacccggtctgaagggtctg
gctggtgagccaggttttaaaggcagccgaggggaccctgggcccccaggaccacctcct
gtcatcctgccaggaatgaaagacattaaaggagagaaaggagatgaagggcctatgggg
ctgaaaggatacctgggcgcaaaaggtatccaaggaatgccaggcatcccagggctgtca
ggaatccctgggctgcctgggaggcccggccacatcaaaggagtcaagggagacatcgga
gtccccggcatccccggtttgccaggattccctggggtggctggcccccctggaattacg
ggattcccaggattcataggaagccggggtgacaaaggtgccccagggagagcaggcctg
tatggcgagattggcgcgactggtgatttcggtgacatcggggacactataaatttacca
ggaagaccaggcctgaagggggagcggggcaccactggaataccaggtctgaagggattc
tttggagagaagggaacagaaggtgacatcggcttccctgggataacaggcgtgactgga
gtccaaggccctcctggacttaaaggacaaacaggctttccagggctgactgggcctcca
gggtcgcagggagagctggggcggattggactgcctggtggcaaaggagatgatggctgg
ccgggagctccgggcttaccaggttttccgggactccgtgggatccgcggcttacacggc
ttgccaggcaccaagggctttccaggatccccaggttctgacatccacggagacccaggc
ttcccaggccctcctggggaaagaggtgacccaggagaggccaacacccttccaggccct
gtgggagtcccaggacagaaaggagaccaaggagctccaggggaacgaggcccacctggg
agcccaggacttcaggggttccctggtatcacacccccttccaacatctctggggcacct
ggtgacaaaggggcgccagggatatttggcctgaaaggttatcggggcccaccagggcca
ccaggttctgctgctcttcctggaagcaaaggtgacacagggaacccaggagctccagga
accccagggaccaaaggatgggccggggactccgggccccagggcaggcctggtgtgttt
ggtctcccaggagaaaaagggcccaggggtgaacaaggcttcatggggaacactggaccc
actggggcggtgggcgacagaggccccaagggacccaagggagacccaggattccctggt
gcccccgggactgtgggagcccccgggattgcaggaatcccccagaagattgccgtccaa
ccagggacagtgggtccccaggggaggcgaggcccccctggggcaccgggggagatgggg
ccccagggcccccccggagaaccaggtttccgtggggctccagggaaagctgggccccaa
ggaagaggtggtgtgtctgctgttcccggcttccggggagatgaaggacccataggccac
caggggccgattggccaagaaggtgcaccaggccgtccagggagcccgggcctgccgggt
atgccaggccgcagcgtcagcatcggctacctcctggtgaagcacagccagacggaccag
gagcccatgtgcccagtgggcatgaacaaactctggagtggatacagcctgctgtacttc
gagggccaggagaaggcgcacaaccaggacctggggctggcgggctcctgcctggcgcgg
ttcagcaccatgcccttcctgtactgcaaccctggtgatgtctgctactatgccagccgg
aacgacaagtcctactggctctctaccactgcgccgctgcccatgatgcccgtggccgag
gacgagatcaagccctacatcagccgctgttctgtgtgtgaggccccggccatcgccatc
gcggtccacagtcaggatgtctccatcccacactgcccagctgggtggcggagtttgtgg
atcggatattccttcctcatgcacacggcggcgggagacgaaggcggtggccaatcactg
gtgtcaccgggcagctgtctagaggacttccgcgccacaccattcatcgaatgcaatgga
ggccgcggcacctgccactactacgccaacaagtacagcttctggctgaccaccattccc
gagcagagcttccagggctcgccctccgccgacacgctcaaggccggcctcatccgcaca
cacatcagccgctgccaggtgtgcatgaagaacctgtga

KEGG   Homo sapiens (human): 1285
Entry
1285              CDS       T01001                                 

Gene name
COL4A3, ATS2, ATS3
Definition
(RefSeq) collagen type IV alpha 3 chain
  KO
K06237  collagen type IV alpha
Organism
hsa  Homo sapiens (human)
Pathway
hsa04151  PI3K-Akt signaling pathway
hsa04510  Focal adhesion
hsa04512  ECM-receptor interaction
hsa04926  Relaxin signaling pathway
hsa04933  AGE-RAGE signaling pathway in diabetic complications
hsa04974  Protein digestion and absorption
hsa05146  Amoebiasis
hsa05165  Human papillomavirus infection
hsa05200  Pathways in cancer
hsa05222  Small cell lung cancer
Disease
H00581  Alport syndrome
H00582  Benign familial hematuria
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    1285 (COL4A3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    1285 (COL4A3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    1285 (COL4A3)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    1285 (COL4A3)
  09154 Digestive system
   04974 Protein digestion and absorption
    1285 (COL4A3)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    1285 (COL4A3)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    1285 (COL4A3)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    1285 (COL4A3)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    1285 (COL4A3)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    1285 (COL4A3)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:hsa04147]
    1285 (COL4A3)
   00536 Glycosaminoglycan binding proteins [BR:hsa00536]
    1285 (COL4A3)
Exosome [BR:hsa04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   1285 (COL4A3)
Glycosaminoglycan binding proteins [BR:hsa00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   1285 (COL4A3)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 1285
NCBI-ProteinID: NP_000082
OMIM: 120070
HGNC: 2204
Ensembl: ENSG00000169031
Vega: OTTHUMG00000149891
Pharos: Q01955(Tbio)
UniProt: Q01955
LinkDB
Structure
PDB: 
5NB0

Position
2q36.3
AA seq 1670 aa
MSARTAPRPQVLLLPLLLVLLAAAPAASKGCVCKDKGQCFCDGAKGEKGEKGFPGPPGSP
GQKGFTGPEGLPGPQGPKGFPGLPGLTGSKGVRGISGLPGFSGSPGLPGTPGNTGPYGLV
GVPGCSGSKGEQGFPGLPGTLGYPGIPGAAGLKGQKGAPAKEEDIELDAKGDPGLPGAPG
PQGLPGPPGFPGPVGPPGPPGFFGFPGAMGPRGPKGHMGERVIGHKGERGVKGLTGPPGP
PGTVIVTLTGPDNRTDLKGEKGDKGAMGEPGPPGPSGLPGESYGSEKGAPGDPGLQGKPG
KDGVPGFPGSEGVKGNRGFPGLMGEDGIKGQKGDIGPPGFRGPTEYYDTYQEKGDEGTPG
PPGPRGARGPQGPSGPPGVPGSPGSSRPGLRGAPGWPGLKGSKGERGRPGKDAMGTPGSP
GCAGSPGLPGSPGPPGPPGDIVFRKGPPGDHGLPGYLGSPGIPGVDGPKGEPGLLCTQCP
YIPGPPGLPGLPGLHGVKGIPGRQGAAGLKGSPGSPGNTGLPGFPGFPGAQGDPGLKGEK
GETLQPEGQVGVPGDPGLRGQPGRKGLDGIPGTPGVKGLPGPKGELALSGEKGDQGPPGD
PGSPGSPGPAGPAGPPGYGPQGEPGLQGTQGVPGAPGPPGEAGPRGELSVSTPVPGPPGP
PGPPGHPGPQGPPGIPGSLGKCGDPGLPGPDGEPGIPGIGFPGPPGPKGDQGFPGTKGSL
GCPGKMGEPGLPGKPGLPGAKGEPAVAMPGGPGTPGFPGERGNSGEHGEIGLPGLPGLPG
TPGNEGLDGPRGDPGQPGPPGEQGPPGRCIEGPRGAQGLPGLNGLKGQQGRRGKTGPKGD
PGIPGLDRSGFPGETGSPGIPGHQGEMGPLGQRGYPGNPGILGPPGEDGVIGMMGFPGAI
GPPGPPGNPGTPGQRGSPGIPGVKGQRGTPGAKGEQGDKGNPGPSEISHVIGDKGEPGLK
GFAGNPGEKGNRGVPGMPGLKGLKGLPGPAGPPGPRGDLGSTGNPGEPGLRGIPGSMGNM
GMPGSKGKRGTLGFPGRAGRPGLPGIHGLQGDKGEPGYSEGTRPGPPGPTGDPGLPGDMG
KKGEMGQPGPPGHLGPAGPEGAPGSPGSPGLPGKPGPHGDLGFKGIKGLLGPPGIRGPPG
LPGFPGSPGPMGIRGDQGRDGIPGPAGEKGETGLLRAPPGPRGNPGAQGAKGDRGAPGFP
GLPGRKGAMGDAGPRGPTGIEGFPGPPGLPGAIIPGQTGNRGPPGSRGSPGAPGPPGPPG
SHVIGIKGDKGSMGHPGPKGPPGTAGDMGPPGRLGAPGTPGLPGPRGDPGFQGFPGVKGE
KGNPGFLGSIGPPGPIGPKGPPGVRGDPGTLKIISLPGSPGPPGTPGEPGMQGEPGPPGP
PGNLGPCGPRGKPGKDGKPGTPGPAGEKGNKGSKGEPGPAGSDGLPGLKGKRGDSGSPAT
WTTRGFVFTRHSQTTAIPSCPEGTVPLYSGFSFLFVQGNQRAHGQDLGTLGSCLQRFTTM
PFLFCNVNDVCNFASRNDYSYWLSTPALMPMNMAPITGRALEPYISRCTVCEGPAIAIAV
HSQTTDIPPCPHGWISLWKGFSFIMFTSAGSEGTGQALASPGSCLEEFRASPFLECHGRG
TCNYYSNSYSFWLASLNPERMFRKPIPSTVKAGELEKIISRCQVCMKKRH
NT seq 5013 nt   +upstreamnt  +downstreamnt
atgagcgcccggaccgcccccaggccgcaggtgctcctgctgccgctcctgctggtgctc
ctggcggcggcgcccgcagccagcaagggttgtgtctgtaaagacaaaggccagtgcttc
tgtgacggggccaaaggggagaagggggagaagggctttcctggaccccccggttctcct
ggccagaaaggattcacaggtcctgaaggcttgcctggaccgcagggacccaagggcttt
ccaggacttccaggactcacgggttccaaaggtgtaaggggaataagtggattgccagga
ttttctggttctcctggacttccaggcaccccaggcaataccgggccttacggacttgtc
ggtgtaccaggatgcagtggttctaagggtgagcaggggtttccaggactcccagggaca
ctgggctacccagggatcccgggtgctgctggtttgaaaggacaaaagggtgctcctgct
aaagaagaagatatagaacttgatgcaaaaggcgaccccgggttgccaggggctccagga
ccccagggtttgccaggccctccaggttttcctgggcctgttggcccacctggtcctccg
ggattctttggctttccaggagccatgggacctagaggacctaagggtcacatgggtgaa
agagtgataggacataaaggagagcggggtgtgaaagggttaacaggacccccgggacca
ccaggaacagttattgtgaccctaactggcccagataacagaacggacctcaagggggaa
aagggagacaagggagcaatgggcgagcctggacctcctggaccctcaggactgcctgga
gaatcatatggatctgaaaagggtgctcctggagaccctggcctgcagggaaaacccgga
aaagatggtgttcctggcttccctggaagtgagggagtcaagggcaacaggggtttccct
gggttaatgggtgaagatggcattaagggacagaaaggggacattggccctccaggattt
cgtggtccaacagaatattatgacacataccaggaaaagggagatgaaggcactccaggc
ccaccagggcccagaggagctcgtggcccacaaggtcccagtggtccccccggagttcct
ggaagtcctggatcatcaaggcctggcctcagaggagcccctggatggccaggcctgaaa
ggaagtaaaggggaacgaggccgcccaggaaaggatgccatggggactcctgggtcccca
ggttgtgctggttcaccaggtcttccaggatcaccgggacctccaggaccgccaggtgac
atcgtttttcgcaagggtccacctggagatcacggactgccaggctatctagggtctcca
ggaatcccaggagttgatgggcccaaaggagaaccaggcctcctgtgtacacagtgccct
tatatcccagggcctcccggtctcccaggattgccagggttacatggtgtaaaaggaatc
ccaggaagacaaggcgcagctggcttgaaaggaagcccagggtccccaggaaatacaggt
cttccaggatttccaggtttcccaggtgcccagggtgacccaggacttaaaggagaaaaa
ggtgaaacacttcagcctgaggggcaagtgggtgtcccaggtgacccggggctcagaggc
caacctgggagaaagggcttggatggaattcctggaactccgggagtgaaaggattacca
ggacctaaaggcgaactggctctgagtggtgagaaaggggaccaaggtcctccaggggat
cctggctcccctgggtccccaggacctgcaggaccagctggaccacctggctacggaccc
caaggagaacctggtctccagggcacgcaaggagttcctggagcccccggaccacccgga
gaagccggccctaggggagagctcagtgtttcaacaccagttccaggcccaccaggacct
ccagggccccctggccatcctggcccccaaggtccacctggtatccctggatccctgggg
aaatgtggagatcctggtcttccagggcctgatggtgaaccaggaattccaggaattgga
tttcctgggcctcctggacctaagggagaccaaggttttccaggtacaaaaggatcactg
ggttgtcctggaaaaatgggagagcctgggttacctggaaagccaggcctcccaggagcc
aagggagaaccagcagtagccatgcctggaggaccaggaacaccaggttttccaggagaa
agaggcaattctggggaacatggagaaattggactccctggacttccaggtctccctgga
actccaggaaatgaagggcttgatggaccacgaggagatccagggcagcctggaccacct
ggagaacaaggacccccaggaaggtgcatagagggtcccaggggagcccaaggacttcca
ggcttaaatggattgaaagggcaacaaggcagaagaggtaaaacggggccaaagggagac
ccaggaattccaggcttggatagatcaggatttcctggagaaactggatcaccaggaatt
ccaggtcatcaaggtgaaatgggaccactgggtcaaagaggatatccaggaaatccggga
attttagggccaccaggtgaagatggagtgattgggatgatgggctttcctggagccatt
ggccctccagggccccctgggaacccaggcacaccagggcagagggggagccctggaatt
ccaggagtaaagggccagagaggaaccccaggagccaagggggaacaaggagataaagga
aatcccgggccttcagagatatcccacgtaataggggacaaaggagaaccaggtctcaaa
ggattcgcaggaaatccaggtgagaaaggaaacagaggcgttccagggatgccaggttta
aagggcctcaaaggactacccggaccagcaggaccaccaggccccagaggagatttgggc
agcactgggaatcctggagaaccaggactgcgtggtataccaggaagcatggggaacatg
ggcatgccaggttctaaaggaaaaaggggaactttgggattcccaggtcgagcaggaaga
ccaggcctcccaggtattcatggtctccagggagataagggagagccaggttattcagaa
ggtacaaggccaggaccaccgggaccaacgggggatccaggactgccgggtgatatggga
aagaaaggagaaatggggcaacctggcccacctggacatttggggcctgctggacctgag
ggagcccctggaagtcctggaagtcctggcctcccaggaaagccaggtcctcatggtgat
ttgggttttaaaggaatcaaaggcctcctgggccctccaggaatcagaggccctccaggt
cttccaggatttccaggatctcctggaccaatgggtataagaggtgaccaaggacgtgat
ggaattcctggtccagccggagaaaagggagaaacgggtttattgagggcccctccaggc
ccaagagggaaccctggtgctcaaggagccaaaggagacaggggagccccaggttttcct
ggcctcccgggcagaaaaggggccatgggagatgctggacctcgaggacccacaggcata
gaaggattcccagggccaccaggtctgcccggtgcaattatccctggccagacaggaaat
cgtggtccaccaggctcaagaggaagcccaggtgcgcctggtccccctggacctccaggg
agtcatgtaataggcataaaaggagacaaagggtctatgggccaccctggcccaaaaggt
ccacctggaactgcaggagacatgggaccaccaggtcgtctgggagcaccaggtactcca
ggtcttccaggacccagaggtgatcctggattccaggggtttccaggcgtgaaaggagaa
aagggtaatcctggatttctaggatccattggacctccaggaccaattgggccaaaagga
ccacctggtgtacgtggagaccctggcacacttaagattatctcccttccaggaagccca
gggccacctggcacacctggagaaccagggatgcagggagaacctgggccaccagggcca
cctggaaacctaggaccctgtgggccaagaggtaagccaggcaaggatggaaaaccagga
actcctggaccagctggagaaaaaggcaacaaaggttctaaaggagagccaggaccagct
ggatcagatggattgccaggtttgaaaggaaaacgtggagacagtggatcacctgcaacc
tggacaacgagaggctttgtcttcacccgacacagtcaaaccacagcaattccttcatgt
ccagaggggacagtgccactctacagtgggttttcttttctttttgtacaaggaaatcaa
cgagcccacggacaagaccttggaactcttggcagctgcctgcagcgatttaccacaatg
ccattcttattctgcaatgtcaatgatgtatgtaattttgcatctcgaaatgattattca
tactggctgtcaacaccagctctgatgccaatgaacatggctcccattactggcagagcc
cttgagccttatataagcagatgcactgtttgtgaaggtcctgcgatcgccatagccgtt
cacagccaaaccactgacattcctccatgtcctcacggctggatttctctctggaaagga
ttttcattcatcatgttcacaagtgcaggttctgagggcaccgggcaagcactggcctcc
cctggctcctgcctggaagaattccgagccagcccatttctagaatgtcatggaagagga
acgtgcaactactattcaaattcctacagtttctggctggcttcattaaacccagaaaga
atgttcagaaagcctattccatcaactgtgaaagctggggaattagaaaaaataataagt
cgctgtcaggtgtgcatgaagaaaagacactga

KEGG   Homo sapiens (human): 1286
Entry
1286              CDS       T01001                                 

Gene name
COL4A4, ATS2, BFH, CA44
Definition
(RefSeq) collagen type IV alpha 4 chain
  KO
K06237  collagen type IV alpha
Organism
hsa  Homo sapiens (human)
Pathway
hsa04151  PI3K-Akt signaling pathway
hsa04510  Focal adhesion
hsa04512  ECM-receptor interaction
hsa04926  Relaxin signaling pathway
hsa04933  AGE-RAGE signaling pathway in diabetic complications
hsa04974  Protein digestion and absorption
hsa05146  Amoebiasis
hsa05165  Human papillomavirus infection
hsa05200  Pathways in cancer
hsa05222  Small cell lung cancer
Disease
H00581  Alport syndrome
H00582  Benign familial hematuria
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    1286 (COL4A4)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    1286 (COL4A4)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    1286 (COL4A4)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    1286 (COL4A4)
  09154 Digestive system
   04974 Protein digestion and absorption
    1286 (COL4A4)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    1286 (COL4A4)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    1286 (COL4A4)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    1286 (COL4A4)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    1286 (COL4A4)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    1286 (COL4A4)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:hsa04147]
    1286 (COL4A4)
   00536 Glycosaminoglycan binding proteins [BR:hsa00536]
    1286 (COL4A4)
Exosome [BR:hsa04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   1286 (COL4A4)
Glycosaminoglycan binding proteins [BR:hsa00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   1286 (COL4A4)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 1286
NCBI-ProteinID: NP_000083
OMIM: 120131
HGNC: 2206
Ensembl: ENSG00000081052
Vega: OTTHUMG00000149892
Pharos: P53420(Tbio)
UniProt: P53420
LinkDB
Structure
PDB: 
5NB1

Position
2q36.3
AA seq 1690 aa
MWSLHIVLMRCSFRLTKSLATGPWSLILILFSVQYVYGSGKKYIGPCGGRDCSVCHCVPE
KGSRGPPGPPGPQGPIGPLGAPGPIGLSGEKGMRGDRGPPGAAGDKGDKGPTGVPGFPGL
DGIPGHPGPPGPRGKPGMSGHNGSRGDPGFPGGRGALGPGGPLGHPGEKGEKGNSVFILG
AVKGIQGDRGDPGLPGLPGSWGAGGPAGPTGYPGEPGLVGPPGQPGRPGLKGNPGVGVKG
QMGDPGEVGQQGSPGPTLLVEPPDFCLYKGEKGIKGIPGMVGLPGPPGRKGESGIGAKGE
KGIPGFPGPRGDPGSYGSPGFPGLKGELGLVGDPGLFGLIGPKGDPGNRGHPGPPGVLVT
PPLPLKGPPGDPGFPGRYGETGDVGPPGPPGLLGRPGEACAGMIGPPGPQGFPGLPGLPG
EAGIPGRPDSAPGKPGKPGSPGLPGAPGLQGLPGSSVIYCSVGNPGPQGIKGKVGPPGGR
GPKGEKGNEGLCACEPGPMGPPGPPGLPGRQGSKGDLGLPGWLGTKGDPGPPGAEGPPGL
PGKHGASGPPGNKGAKGDMVVSRVKGHKGERGPDGPPGFPGQPGSHGRDGHAGEKGDPGP
PGDHEDATPGGKGFPGPLGPPGKAGPVGPPGLGFPGPPGERGHPGVPGHPGVRGPDGLKG
QKGDTISCNVTYPGRHGPPGFDGPPGPKGFPGPQGAPGLSGSDGHKGRPGTPGTAEIPGP
PGFRGDMGDPGFGGEKGSSPVGPPGPPGSPGVNGQKGIPGDPAFGHLGPPGKRGLSGVPG
IKGPRGDPGCPGAEGPAGIPGFLGLKGPKGREGHAGFPGVPGPPGHSCERGAPGIPGQPG
LPGYPGSPGAPGGKGQPGDVGPPGPAGMKGLPGLPGRPGAHGPPGLPGIPGPFGDDGLPG
PPGPKGPRGLPGFPGFPGERGKPGAEGCPGAKGEPGEKGMSGLPGDRGLRGAKGAIGPPG
DEGEMAIISQKGTPGEPGPPGDDGFPGERGDKGTPGMQGRRGEPGRYGPPGFHRGEPGEK
GQPGPPGPPGPPGSTGLRGFIGFPGLPGDQGEPGSPGPPGFSGIDGARGPKGNKGDPASH
FGPPGPKGEPGSPGCPGHFGASGEQGLPGIQGPRGSPGRPGPPGSSGPPGCPGDHGMPGL
RGQPGEMGDPGPRGLQGDPGIPGPPGIKGPSGSPGLNGLHGLKGQKGTKGASGLHDVGPP
GPVGIPGLKGERGDPGSPGISPPGPRGKKGPPGPPGSSGPPGPAGATGRAPKDIPDPGPP
GDQGPPGPDGPRGAPGPPGLPGSVDLLRGEPGDCGLPGPPGPPGPPGPPGYKGFPGCDGK
DGQKGPVGFPGPQGPHGFPGPPGEKGLPGPPGRKGPTGLPGPRGEPGPPADVDDCPRIPG
LPGAPGMRGPEGAMGLPGMRGPSGPGCKGEPGLDGRRGVDGVPGSPGPPGRKGDTGEDGY
PGGPGPPGPIGDPGPKGFGPGYLGGFLLVLHSQTDQEPTCPLGMPRLWTGYSLLYLEGQE
KAHNQDLGLAGSCLPVFSTLPFAYCNIHQVCHYAQRNDRSYWLASAAPLPMMPLSEEAIR
PYVSRCAVCEAPAQAVAVHSQDQSIPPCPQTWRSLWIGYSFLMHTGAGDQGGGQALMSPG
SCLEDFRAAPFLECQGRQGTCHFFANKYSFWLTTVKADLQFSSAPAPDTLKESQAQRQKI
SRCQVCVKYS
NT seq 5073 nt   +upstreamnt  +downstreamnt
atgtggtctctgcacatagtactaatgaggtgctccttcagattgaccaagtccttggcc
acaggtccctggtcacttatactcattctcttttctgtacaatatgtatatgggagtgga
aagaaatacattggtccttgtggaggaagagattgctctgtttgccactgtgttcctgaa
aaggggtctcggggtccaccaggaccaccagggccacagggtccaattggacccctggga
gccccaggacccattgggctttcaggagagaaaggaatgagaggggaccgcggccctcct
ggagcagcaggggacaaaggagataagggtccaactggtgttcctggatttccaggttta
gatggcatacctgggcacccagggcctcctggacccagaggcaaacctggtatgagtggc
cacaatggctcaagaggtgacccagggtttccaggaggaagaggagctcttggcccagga
ggccccctaggccatcctggggaaaagggagaaaaaggaaattcagtgttcattttaggt
gccgttaaaggtattcagggagacagaggggacccaggactgcctggcttaccaggatct
tggggtgcaggaggaccggcaggtcccacaggatatcctggagagccagggttagtggga
cctccgggccaaccagggcgtccaggtttgaagggaaatcccggtgtgggagtaaagggg
caaatgggagacccgggtgaggttggtcagcaaggttctcctggacccaccctgttggta
gagccacctgacttttgtctctataaaggagaaaagggtataaaaggaattcctggaatg
gttggactgccaggaccaccaggacgcaagggagaatctggtattggggcaaaaggagaa
aaaggtattcctggatttccagggcctcggggggatcctggttcctatggatctccaggt
tttccaggattaaagggagaactaggactggttggagatcctgggctatttggattaatt
ggcccaaagggggatcctggaaatcgagggcacccaggaccaccaggtgttttggtgact
ccacctcttccactcaaaggcccaccaggggacccagggttccctggccgctatggagaa
acaggggatgttggaccacctggtcccccaggtctcttgggcagaccaggggaagcctgt
gcaggcatgataggaccccctgggccacaaggatttcctggtcttcctgggcttccagga
gaagctggtattcctgggagacctgattctgctccaggaaaaccagggaagccaggatca
cctggcttgcctggagcaccaggcctgcagggcctcccaggatcaagtgtgatatactgt
agtgttgggaaccccggaccacaaggaataaaaggcaaagttggtcccccaggaggaaga
ggcccaaaaggagaaaaaggaaatgaaggactctgtgcctgtgagcctggacccatgggc
ccccctggccctccaggacttcctgggaggcaggggagtaagggagacttggggctccct
ggctggcttggaacaaaaggtgacccaggacctcctggtgctgaaggacctccagggcta
ccaggaaagcatggtgcctctggaccacctggcaacaaaggggcgaagggtgacatggtt
gtatcaagagttaaagggcacaaaggagaaagaggtcctgatgggcccccaggatttcca
gggcagccaggatcacatggtcgggatggacatgctggagaaaaaggggatccaggacct
ccaggggatcatgaagatgcgaccccaggtggtaaaggatttcctggacctctgggcccc
ccaggcaaagcaggacctgtggggcccccaggactgggatttcctggtccaccaggagag
cgaggccacccaggagttccaggccacccaggtgtgaggggccctgatggcttgaagggt
cagaaaggtgacacaatttcttgcaacgtaacctaccctgggaggcatggccctccaggt
tttgatggacctccaggtccgaagggatttccaggtccccaaggtgcccctgggctgagt
ggttcagatgggcataaaggcagacctggcacaccaggaacagcggaaataccaggtcca
cctggttttcgtggtgacatgggagatccgggttttggaggtgaaaaggggtcctcccct
gttgggcccccaggccctcccggctcaccaggagtgaatggtcagaaaggaatcccggga
gaccctgcatttggtcacctgggacccccgggaaagaggggtctttcaggagtgccaggg
ataaaaggacccagaggtgatccgggatgtccaggggctgaagggccagctggcattcct
ggattcctaggtctcaaaggtcccaaaggcagagagggacatgctgggtttccaggtgtc
ccaggtccacctggccattcctgtgaaagaggtgctccagggataccagggcaaccggga
ctccctgggtatccaggtagcccaggtgctccaggtgggaaaggacagccgggagatgtg
gggcctcccgggccagctggaatgaaaggcctccccggactcccaggacggcctggggca
catggtcccccaggcctcccaggaatcccaggtccctttggagatgatgggctacctggt
cctccaggtccaaagggaccccgggggctgcctggtttcccaggttttcccggagaaaga
ggaaagcctggtgcagagggatgtcctggcgcaaagggagaacctggagagaagggcatg
tctggccttcctggagaccggggactgagaggggccaaaggagccataggacctcccgga
gatgaaggagaaatggctatcatttcacaaaagggaacacctggggaacctggacctcct
ggagatgatggattcccaggagaaagaggtgataaaggaactcccgggatgcaagggaga
agaggagagccgggaagatacggaccacctggatttcacagaggggaacctggtgagaaa
ggtcagccagggcctcctggacccccaggccctccaggctcaactggtctaagagggttc
attggttttccaggacttccaggtgaccagggtgagccaggttctccaggtccccctgga
ttttcaggaattgatggagcaagaggacctaaaggaaacaaaggtgaccctgccagtcac
tttggtccacctggtccaaagggtgagccaggtagccctggatgtccagggcattttgga
gcatccggagagcagggcttgcctggtattcaagggcccagaggatcacctggaaggcca
gggccacctggctcctctggaccaccagggtgcccaggtgatcacgggatgcctgggctg
aggggacagccaggagaaatgggagaccctgggccaagaggcctccagggggatccaggg
ataccaggtcctccgggaataaaaggtccctccggatcacctggcctgaacggcttgcat
ggattgaaaggtcagaaaggaactaaaggtgcttcaggtttgcatgatgtggggccacct
ggtccagtgggaatacctgggctaaaaggggagagaggagaccctgggagcccaggaatc
tctcctccaggtcctcgtggaaagaaaggtcccccaggacccccagggagttcaggacca
cctggtcctgcaggtgccacaggaagagctcctaaggacattcctgacccgggtccacct
ggagatcagggacctcctggtcctgatggcccaagaggagcacctgggcctccaggcctc
cctgggagtgttgaccttctgagaggggagccaggtgactgtggtctaccagggccacca
ggtccccctggcccaccaggccctccaggatacaaaggctttccaggatgtgatggaaaa
gatggccagaaaggaccagtgggattcccgggaccgcagggaccacatggatttcctggg
ccacctggagagaagggtttacctggacctccagggagaaaagggcccactggtcttccg
ggtcccagaggtgaaccggggccacctgcagatgtggatgactgtccccgaatcccaggc
cttcctggggcgccaggcatgagaggaccagaaggagccatggggctccctggaatgaga
ggcccctcaggaccagggtgcaaaggagagcctgggctggatggcaggaggggtgtggat
ggcgtccctgggtctcctgggcctcccggacgtaaaggtgacacaggagaagacggctac
cctggaggaccagggcctcctggtcccattggggatcctgggcccaaagggtttggccct
ggatacctcggtggcttcctcctggttctccacagtcagacggaccaggagcccacctgc
cccctgggcatgcccaggctctggactgggtatagtctgttatacctggaagggcaagag
aaagctcacaatcaagaccttggtctggcagggtcttgccttcccgtatttagcacgctg
ccctttgcctactgcaacatccaccaggtgtgccactatgcccagagaaacgacagatcc
tactggctggccagcgctgcgcccctccccatgatgccactctctgaagaggcgatccgc
ccctatgtcagccgctgtgcggtatgcgaggccccggcccaggcggtggcggtgcacagc
caggaccagtccatccccccatgtccgcagacctggaggagcctctggatcgggtattca
ttcctgatgcacacaggagctggggaccaaggaggagggcaggcccttatgtcacctggc
agctgcctggaagatttcagagcagcaccattccttgaatgccagggccggcagggaact
tgccactttttcgcaaataagtatagcttctggctcacaacggtgaaagcagacttgcag
ttttcctctgctccagcaccagacaccttaaaagaaagccaggcccaacgccagaaaatc
agccggtgccaggtctgcgtgaagtatagctag

KEGG   Homo sapiens (human): 1287
Entry
1287              CDS       T01001                                 

Gene name
COL4A5, ASLN, ATS, ATS1, CA54
Definition
(RefSeq) collagen type IV alpha 5 chain
  KO
K06237  collagen type IV alpha
Organism
hsa  Homo sapiens (human)
Pathway
hsa04151  PI3K-Akt signaling pathway
hsa04510  Focal adhesion
hsa04512  ECM-receptor interaction
hsa04926  Relaxin signaling pathway
hsa04933  AGE-RAGE signaling pathway in diabetic complications
hsa04974  Protein digestion and absorption
hsa05146  Amoebiasis
hsa05165  Human papillomavirus infection
hsa05200  Pathways in cancer
hsa05222  Small cell lung cancer
Disease
H00581  Alport syndrome
H01640  Uterine leiomyoma
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    1287 (COL4A5)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    1287 (COL4A5)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    1287 (COL4A5)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    1287 (COL4A5)
  09154 Digestive system
   04974 Protein digestion and absorption
    1287 (COL4A5)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    1287 (COL4A5)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    1287 (COL4A5)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    1287 (COL4A5)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    1287 (COL4A5)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    1287 (COL4A5)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:hsa04147]
    1287 (COL4A5)
   00536 Glycosaminoglycan binding proteins [BR:hsa00536]
    1287 (COL4A5)
Exosome [BR:hsa04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   1287 (COL4A5)
Glycosaminoglycan binding proteins [BR:hsa00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   1287 (COL4A5)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 1287
NCBI-ProteinID: NP_000486
OMIM: 303630
HGNC: 2207
Ensembl: ENSG00000188153
Vega: OTTHUMG00000022182
Pharos: P29400(Tbio)
UniProt: P29400 Q49AM6 A7MBN3
LinkDB
Structure
PDB: 
5NAZ

Position
Xq22.3
AA seq 1685 aa
MKLRGVSLAAGLFLLALSLWGQPAEAAACYGCSPGSKCDCSGIKGEKGERGFPGLEGHPG
LPGFPGPEGPPGPRGQKGDDGIPGPPGPKGIRGPPGLPGFPGTPGLPGMPGHDGAPGPQG
IPGCNGTKGERGFPGSPGFPGLQGPPGPPGIPGMKGEPGSIIMSSLPGPKGNPGYPGPPG
IQGLPGPTGIPGPIGPPGPPGLMGPPGPPGLPGPKGNMGLNFQGPKGEKGEQGLQGPPGP
PGQISEQKRPIDVEFQKGDQGLPGDRGPPGPPGIRGPPGPPGGEKGEKGEQGEPGKRGKP
GKDGENGQPGIPGLPGDPGYPGEPGRDGEKGQKGDTGPPGPPGLVIPRPGTGITIGEKGN
IGLPGLPGEKGERGFPGIQGPPGLPGPPGAAVMGPPGPPGFPGERGQKGDEGPPGISIPG
PPGLDGQPGAPGLPGPPGPAGPHIPPSDEICEPGPPGPPGSPGDKGLQGEQGVKGDKGDT
CFNCIGTGISGPPGQPGLPGLPGPPGSLGFPGQKGEKGQAGATGPKGLPGIPGAPGAPGF
PGSKGEPGDILTFPGMKGDKGELGSPGAPGLPGLPGTPGQDGLPGLPGPKGEPGGITFKG
ERGPPGNPGLPGLPGNIGPMGPPGFGPPGPVGEKGIQGVAGNPGQPGIPGPKGDPGQTIT
QPGKPGLPGNPGRDGDVGLPGDPGLPGQPGLPGIPGSKGEPGIPGIGLPGPPGPKGFPGI
PGPPGAPGTPGRIGLEGPPGPPGFPGPKGEPGFALPGPPGPPGLPGFKGALGPKGDRGFP
GPPGPPGRTGLDGLPGPKGDVGPNGQPGPMGPPGLPGIGVQGPPGPPGIPGPIGQPGLHG
IPGEKGDPGPPGLDVPGPPGERGSPGIPGAPGPIGPPGSPGLPGKAGASGFPGTKGEMGM
MGPPGPPGPLGIPGRSGVPGLKGDDGLQGQPGLPGPTGEKGSKGEPGLPGPPGPMDPNLL
GSKGEKGEPGLPGIPGVSGPKGYQGLPGDPGQPGLSGQPGLPGPPGPKGNPGLPGQPGLI
GPPGLKGTIGDMGFPGPQGVEGPPGPSGVPGQPGSPGLPGQKGDKGDPGISSIGLPGLPG
PKGEPGLPGYPGNPGIKGSVGDPGLPGLPGTPGAKGQPGLPGFPGTPGPPGPKGISGPPG
NPGLPGEPGPVGGGGHPGQPGPPGEKGKPGQDGIPGPAGQKGEPGQPGFGNPGPPGLPGL
SGQKGDGGLPGIPGNPGLPGPKGEPGFHGFPGVQGPPGPPGSPGPALEGPKGNPGPQGPP
GRPGLPGPEGPPGLPGNGGIKGEKGNPGQPGLPGLPGLKGDQGPPGLQGNPGRPGLNGMK
GDPGLPGVPGFPGMKGPSGVPGSAGPEGEPGLIGPPGPPGLPGPSGQSIIIKGDAGPPGI
PGQPGLKGLPGPQGPQGLPGPTGPPGDPGRNGLPGFDGAGGRKGDPGLPGQPGTRGLDGP
PGPDGLQGPPGPPGTSSVAHGFLITRHSQTTDAPQCPQGTLQVYEGFSLLYVQGNKRAHG
QDLGTAGSCLRRFSTMPFMFCNINNVCNFASRNDYSYWLSTPEPMPMSMQPLKGQSIQPF
ISRCAVCEAPAVVIAVHSQTIQIPHCPQGWDSLWIGYSFMMHTSAGAEGSGQALASPGSC
LEEFRSAPFIECHGRGTCNYYANSYSFWLATVDVSDMFSKPQSETLKAGDLRTRISRCQV
CMKRT
NT seq 5058 nt   +upstreamnt  +downstreamnt
atgaaactgcgtggagtcagcctggctgccggcttgttcttactggccctgagtctttgg
gggcagcctgcagaggctgcggcttgctatgggtgttctccaggatcaaagtgtgactgc
agtggcataaaaggggaaaagggagagagagggtttccaggtttggaaggacacccagga
ttgcctggatttccaggtccagaagggcctccggggcctcggggacaaaagggtgatgat
ggaattccagggccaccaggaccaaaaggaatcagaggtcctcctggacttcctggattt
ccagggacaccaggtcttcctggaatgccaggccacgatggggccccaggacctcaaggt
attcccggatgcaatggaaccaagggagaacgtggatttccaggcagtcccggttttcct
ggtttacagggtcctccaggaccccctgggatcccaggtatgaagggtgaaccaggtagt
ataattatgtcatcactgccaggaccaaagggtaatccaggatatccaggtcctcctgga
atacaaggcctacctggtcccactggtataccagggccaattggtcccccaggaccacca
ggtttgatgggccctcctggtccaccaggacttccaggacctaaggggaatatgggctta
aatttccagggacccaaaggtgaaaaaggtgagcaaggtcttcagggcccacctgggcca
cctgggcagatcagtgaacagaaaagaccaattgatgtagagtttcagaaaggagatcag
ggacttcctggtgaccgagggcctcctggacctccagggatacgtggtcctccaggtccc
ccaggtggtgagaaaggtgagaagggtgagcaaggagagccaggcaaaagaggtaaacca
ggcaaagatggagaaaatggccaaccaggaattcctggtttgcctggtgatcctggttac
cctggtgaacccggaagggatggtgaaaagggccaaaaaggtgacactggcccacctgga
cctcctggacttgtaattcctagacctgggactggtataactataggagaaaaaggaaac
attgggttgcctgggttgcctggagaaaaaggagagcgaggatttcctggaatacagggt
ccacctggccttcctggacctccaggggctgcagttatgggtcctcctggccctcctgga
tttcctggagaaaggggtcagaaaggtgatgaaggaccacctggaatttccattcctgga
cctcctggacttgacggacagcctggggctcctgggcttccagggcctcctggccctgct
ggccctcacattcctcctagtgatgagatatgtgaaccaggccctccaggccccccagga
tctccaggtgataaaggactccaaggagaacaaggagtgaaaggtgacaaaggtgacact
tgcttcaactgcattggaactggtatttcagggcctccaggtcaacctggtttgccaggt
ctcccaggtcctccaggatctcttggtttccctggacagaaaggggaaaaaggacaagct
ggtgcaactggtcccaaaggattaccaggcattccaggagctccaggtgctccaggcttt
cctggatctaaaggtgaacctggtgatatcctcacttttccaggaatgaagggtgacaaa
ggagagttgggttcccctggagctccagggcttcctggtttacctggcactcctggacag
gatggattgccagggcttcctggcccgaaaggagagcctggtggaattacttttaagggt
gaaagaggtccccctgggaacccaggtttaccaggcctcccagggaatatagggcctatg
ggtccccctggtttcggccctccaggcccagtaggtgaaaaaggcatacaaggtgtggca
ggaaatccaggccagccaggaataccaggtcctaaaggggatccaggtcagactataacc
cagccggggaagcctggcttgcctggtaacccaggcagagatggtgatgtaggtcttcca
ggtgaccctggacttccagggcaaccaggcttgccagggatacctggtagcaaaggagaa
ccaggtatccctggaattgggcttcctggaccacctggtcccaaaggctttcctggaatt
ccaggacctccaggagcacctgggacacctggaagaattggtctagaaggccctcctggg
ccacccggctttccaggaccaaagggtgaaccaggatttgcattacctgggccacctggg
ccaccaggacttccaggtttcaaaggagcacttggtccaaaaggtgatcgtggtttccca
ggacctccgggtcctccaggacgcactggcttagatgggctccctggaccaaaaggtgat
gttggaccaaatggacaacctggaccaatgggacctcctgggctgccaggaataggtgtt
cagggaccaccaggaccaccagggattcctgggccaataggtcaacctggtttacatgga
ataccaggagagaagggggatccaggacctcctggacttgatgttccaggacccccaggt
gaaagaggcagtccagggatccccggagcacctggtcctataggacctccaggatcacca
gggcttccaggaaaagcaggtgcctctggatttccaggtaccaaaggtgaaatgggtatg
atgggacctccaggcccaccaggacctttgggaattcctggcaggagtggtgtacctggt
cttaaaggtgatgatggcttgcagggtcagccaggacttcctggccctacaggagaaaaa
ggtagtaaaggagagcctggccttccaggccctcctggaccaatggatccaaatcttctg
ggctcaaaaggagagaagggggaacctggcttaccaggtatacctggagtttcagggcca
aaaggttatcagggtttgcctggagacccagggcaacctggactgagtggacaacctgga
ttaccaggaccaccaggtcccaaaggtaaccctggtctccctggacagccaggtcttata
ggacctcctggacttaaaggaaccatcggtgatatgggttttccagggcctcagggtgtg
gaagggcctcctggaccttctggagttcctggacaacctggctccccaggattacctgga
cagaaaggcgacaaaggtgatcctggtatttcaagcattggtcttccaggtcttcctggt
ccaaagggtgagcctggtctgcctggatacccagggaaccctggtatcaaaggttctgtg
ggagatcctggtttgcccggattaccaggaacccctggagcaaaaggacaaccaggcctt
cctggattcccaggaaccccaggccctcctggaccaaaaggtattagtggccctcctggg
aaccccggccttccaggagaacctggtcctgtaggtggtggaggtcatcctgggcaacca
gggcctccaggcgaaaaaggcaaacccggtcaagatggtattcctggaccagctggacag
aagggtgaaccaggtcaaccaggctttggaaacccaggaccccctggacttccaggactt
tctggccaaaagggtgatggaggattacctgggattccaggaaatcctggccttccaggt
ccaaagggcgaaccaggctttcacggtttccctggtgtgcagggtcccccaggccctcct
ggttctccgggtccagctctggaaggacctaaaggcaaccctgggccccaaggtcctcct
gggagaccaggtctaccaggtccagaaggtcctccaggtctccctggaaatggaggtatt
aaaggagagaagggaaatccaggccaacctgggctacctggcttgcctggtttgaaagga
gatcaaggaccaccaggactccagggtaatcctggccggccgggtctcaatggaatgaaa
ggagatcctggtctccctggtgttccaggattcccaggcatgaaaggacccagtggagta
cctggatcagctggccctgagggggaaccgggacttattggtcctccaggtcctcctgga
ttacctggtccttcaggacagagtatcataattaaaggagatgctggtcctccaggaatc
cctggccagcctgggctaaagggtctaccaggaccccaaggacctcaaggcttaccaggt
ccaactggccctccaggagatcctggacgcaatggactccctggctttgatggtgcagga
gggcgcaaaggagacccaggtctgccaggacagccaggtacccgtggtttggatggtccc
cctggtccagatggattgcaaggtcccccaggtccccctggaacctcctctgttgcacat
ggatttcttattacacgccacagccagacaacggatgcaccacaatgcccacagggaaca
cttcaggtctatgaaggcttttctctcctgtatgtacaaggaaataaaagagcccacggt
caagacttggggacggctggcagctgccttcgtcgctttagtaccatgcctttcatgttc
tgcaacatcaataatgtttgcaactttgcttcaagaaatgactattcttactggctctct
accccagagcccatgccaatgagcatgcaacccctaaagggccagagcatccagccattc
attagtcgatgtgcagtatgtgaagctccagctgtggtgatcgcagttcacagtcagacg
atccagattccccattgtcctcagggatgggattctctgtggattggttattccttcatg
atgcatacaagtgcaggggcagaaggctcaggtcaagccctagcctcccctggttcctgc
ttggaagagtttcgttcagctcccttcatcgaatgtcatgggaggggtacctgtaactac
tatgccaactcctacagcttttggctggcaactgtagatgtgtcagacatgttcagtaaa
cctcagtcagaaacgctgaaagcaggagacttgaggacacgaattagccgatgtcaagtg
tgcatgaagaggacataa

KEGG   Homo sapiens (human): 1288
Entry
1288              CDS       T01001                                 

Gene name
COL4A6, CXDELq22.3, DELXq22.3, DFNX6
Definition
(RefSeq) collagen type IV alpha 6 chain
  KO
K06237  collagen type IV alpha
Organism
hsa  Homo sapiens (human)
Pathway
hsa04151  PI3K-Akt signaling pathway
hsa04510  Focal adhesion
hsa04512  ECM-receptor interaction
hsa04926  Relaxin signaling pathway
hsa04933  AGE-RAGE signaling pathway in diabetic complications
hsa04974  Protein digestion and absorption
hsa05146  Amoebiasis
hsa05165  Human papillomavirus infection
hsa05200  Pathways in cancer
hsa05222  Small cell lung cancer
Disease
H01209  Deafness, X-linked
H01640  Uterine leiomyoma
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    1288 (COL4A6)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    1288 (COL4A6)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    1288 (COL4A6)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    1288 (COL4A6)
  09154 Digestive system
   04974 Protein digestion and absorption
    1288 (COL4A6)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    1288 (COL4A6)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    1288 (COL4A6)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    1288 (COL4A6)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    1288 (COL4A6)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    1288 (COL4A6)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:hsa04147]
    1288 (COL4A6)
   00536 Glycosaminoglycan binding proteins [BR:hsa00536]
    1288 (COL4A6)
Exosome [BR:hsa04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   1288 (COL4A6)
Glycosaminoglycan binding proteins [BR:hsa00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   1288 (COL4A6)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 1288
NCBI-ProteinID: NP_001838
OMIM: 303631
HGNC: 2208
Ensembl: ENSG00000197565
Vega: OTTHUMG00000022179
Pharos: Q14031(Tbio)
UniProt: Q14031
LinkDB
Position
Xq22.3
AA seq 1691 aa
MLINKLWLLLVTLCLTEELAAAGEKSYGKPCGGQDCSGSCQCFPEKGARGRPGPIGIQGP
TGPQGFTGSTGLSGLKGERGFPGLLGPYGPKGDKGPMGVPGFLGINGIPGHPGQPGPRGP
PGLDGCNGTQGAVGFPGPDGYPGLLGPPGLPGQKGSKGDPVLAPGSFKGMKGDPGLPGLD
GITGPQGAPGFPGAVGPAGPPGLQGPPGPPGPLGPDGNMGLGFQGEKGVKGDVGLPGPAG
PPPSTGELEFMGFPKGKKGSKGEPGPKGFPGISGPPGFPGLGTTGEKGEKGEKGIPGLPG
PRGPMGSEGVQGPPGQQGKKGTLGFPGLNGFQGIEGQKGDIGLPGPDVFIDIDGAVISGN
PGDPGVPGLPGLKGDEGIQGLRGPSGVPGLPALSGVPGALGPQGFPGLKGDQGNPGRTTI
GAAGLPGRDGLPGPPGPPGPPSPEFETETLHNKESGFPGLRGEQGPKGNLGLKGIKGDSG
FCACDGGVPNTGPPGEPGPPGPWGLIGLPGLKGARGDRGSGGAQGPAGAPGLVGPLGPSG
PKGKKGEPILSTIQGMPGDRGDSGSQGFRGVIGEPGKDGVPGLPGLPGLPGDGGQGFPGE
KGLPGLPGEKGHPGPPGLPGNGLPGLPGPRGLPGDKGKDGLPGQQGLPGSKGITLPCIIP
GSYGPSGFPGTPGFPGPKGSRGLPGTPGQPGSSGSKGEPGSPGLVHLPELPGFPGPRGEK
GLPGFPGLPGKDGLPGMIGSPGLPGSKGATGDIFGAENGAPGEQGLQGLTGHKGFLGDSG
LPGLKGVHGKPGLLGPKGERGSPGTPGQVGQPGTPGSSGPYGIKGKSGLPGAPGFPGISG
HPGKKGTRGKKGPPGSIVKKGLPGLKGLPGNPGLVGLKGSPGSPGVAGLPALSGPKGEKG
SVGFVGFPGIPGLPGIPGTRGLKGIPGSTGKMGPSGRAGTPGEKGDRGNPGPVGIPSPRR
PMSNLWLKGDKGSQGSAGSNGFPGPRGDKGEAGRPGPPGLPGAPGLPGIIKGVSGKPGPP
GFMGIRGLPGLKGSSGITGFPGMPGESGSQGIRGSPGLPGASGLPGLKGDNGQTVEISGS
PGPKGQPGESGFKGTKGRDGLIGNIGFPGNKGEDGKVGVSGDVGLPGAPGFPGVAGMRGE
PGLPGSSGHQGAIGPLGSPGLIGPKGFPGFPGLHGLNGLPGTKGTHGTPGPSITGVPGPA
GLPGPKGEKGYPGIGIGAPGKPGLRGQKGDRGFPGLQGPAGLPGAPGISLPSLIAGQPGD
PGRPGLDGERGRPGPAGPPGPPGPSSNQGDTGDPGFPGIPGPKGPKGDQGIPGFSGLPGE
LGLKGMRGEPGFMGTPGKVGPPGDPGFPGMKGKAGPRGSSGLQGDPGQTPTAEAVQVPPG
PLGLPGIDGIPGLTGDPGAQGPVGLQGSKGLPGIPGKDGPSGLPGPPGALGDPGLPGLQG
PPGFEGAPGQQGPFGMPGMPGQSMRVGYTLVKHSQSEQVPPCPIGMSQLWVGYSLLFVEG
QEKAHNQDLGFAGSCLPRFSTMPFIYCNINEVCHYARRNDKSYWLSTTAPIPMMPVSQTQ
IPQYISRCSVCEAPSQAIAVHSQDITIPQCPLGWRSLWIGYSFLMHTAAGAEGGGQSLVS
PGSCLEDFRATPFIECSGARGTCHYFANKYSFWLTTVEERQQFGELPVSETLKAGQLHTR
VSRCQVCMKSL
NT seq 5076 nt   +upstreamnt  +downstreamnt
atgcttataaacaagttgtggctgctcctggttacgttgtgcctgaccgaggaactggca
gcagcgggagagaagtcttatggaaagccatgtgggggccaggactgcagtgggagctgt
cagtgttttcctgagaaaggagcgagaggacgacctggaccaattggaattcaaggccca
acaggtcctcaaggattcactggctctactggtttatcgggattgaaaggagaaaggggt
ttcccaggccttctgggaccttatggaccaaaaggagataagggtcccatgggagttcct
ggctttcttggcatcaatgggattccgggccaccctggacaaccaggccccagaggccca
cctggtctggatggctgtaatggaactcaaggagctgttggatttccaggccctgatggc
tatcctgggcttctcggaccacccgggcttcctggtcagaaaggatcaaaaggtgaccct
gtccttgctccaggtagtttcaaaggaatgaagggggatcctgggctgcctggactggat
ggaatcactggcccacaaggagcacccggatttcctggagctgtaggacctgcaggacca
ccaggattacaaggtcctccagggcctcctggtcctcttggtcctgatgggaatatgggg
ctaggttttcaaggagagaaaggagtcaagggggatgttggcctccctggcccagcagga
cctccaccatctactggagagctggaattcatgggattccccaaagggaagaaaggatcc
aagggtgaaccagggcctaagggttttccaggcataagtggccctccaggcttcccgggc
cttggaactactggagaaaagggagaaaagggagaaaagggaatccctggtttgccagga
cctaggggtcccatgggttcagaaggagtccaaggccctccagggcaacagggcaagaaa
gggaccctgggatttcctgggcttaatggattccaaggaattgagggtcaaaagggtgac
attggcctgccaggcccagatgttttcatcgatatagatggtgctgtgatctcaggtaat
cctggagatcctggtgtacctggcctcccaggccttaaaggagatgaaggcatccaaggc
ctacgtggcccttctggtgtccctggattgccagcattatcaggtgtcccaggagcccta
gggcctcagggatttccagggctgaagggggaccaaggaaacccaggccgtaccacaatt
ggagcagctggcctccctggcagagatggtttgccaggcccaccaggtccaccaggccca
cctagtccagaatttgagactgaaactctacacaacaaagagtcagggttccctggtctc
cgaggagaacaaggtccaaaaggaaacctaggcctcaaaggaataaaaggagactcaggt
ttctgtgcttgtgacggtggtgttcccaacactggaccacccggggaaccaggcccacct
ggtccatggggtctcataggccttccaggccttaaaggagccagaggagatcgaggctct
gggggtgcacagggcccagcaggggctccaggcttagttgggcctctgggtccttcagga
cccaaaggaaagaagggggaaccaattctcagtacaatccaaggaatgccaggagatcgg
ggtgattctggctcccagggcttccgtggtgtaataggagaaccaggcaaggacggagta
ccaggtttaccaggtctgccaggccttccgggtgatggtggacagggcttcccaggtgaa
aaggggttacctggacttcctggtgaaaaaggccatcctggtccacctggcctcccagga
aatgggttaccaggacttcctggaccccgtgggcttcctggagataaaggcaaggatgga
ttaccgggacaacaaggccttcccggatctaagggaatcaccctgccctgtattattcct
gggtcatacggtccatcaggatttccaggcactcccggattcccaggccctaaagggtct
cgaggcctccctgggaccccaggccagcctgggtcaagtggaagtaaaggagagccaggg
agtccaggattggttcatcttcctgaattaccaggatttcctggacctcgtggggagaag
ggcttgcctgggtttcctgggctccctggaaaagatggcttgcctgggatgattggcagt
ccaggcttacctggttccaagggagccactggtgacatctttggtgctgaaaatggtgct
ccgggggaacaaggcctacaaggattaacagggcacaaaggatttcttggagactctggc
cttccaggactcaagggtgtgcacgggaagcctggcttactaggccccaaaggtgagcgg
ggcagccctgggacaccaggacaggtgggacagccaggcaccccaggatctagtggtcca
tatggcatcaagggcaaatctgggctcccaggagcaccaggcttcccaggcatctcagga
catcctggaaagaaaggaacaagaggcaagaaaggtcctcctggatcaattgtaaagaaa
gggctgccagggctaaaaggccttcctggaaatccaggcctagtaggactgaaaggaagc
ccaggctctccaggggtcgctgggttgccagccctctctggacccaagggagagaagggg
tctgttggattcgtaggttttccaggaataccaggtctgcctggtattcctggaacaaga
ggattaaaaggaattccaggatcaactggaaaaatgggaccatctggacgtgctggtact
cctggtgaaaagggagacagaggcaatccggggccagtcggaatacctagtccaagacgt
ccaatgtcaaacctttggctcaaaggagacaaaggctctcaaggctcagccggatccaat
ggatttcctgggccaagaggtgacaaaggagaggctggtcgacctggaccaccaggccta
cctggagctcctggcctcccaggcattatcaaaggagttagtggaaagccagggccccct
ggcttcatgggaatccggggcttacctggcctgaaggggtcctctgggatcacaggtttc
ccaggaatgccaggagaaagtggttcacaaggtatcagagggtcgcctggactcccagga
gcatctggtctcccaggcctgaaaggagacaacggccagacagttgaaatttccggtagc
ccaggacccaagggacagcctggcgaatctggttttaaaggcacaaaaggaagagatgga
ctaataggcaatataggcttccctggaaacaaaggtgaagatggaaaagttggtgtttct
ggagatgttggccttcctggagctccaggatttccaggagttgccggcatgagaggagaa
ccaggacttccaggttcttctggtcaccaaggggcaattgggcctctaggatcccccgga
ttaataggacccaaaggcttccctggatttcctggtttacatggactgaatgggcttccg
ggcaccaagggtacccatggcactccaggacctagtatcaccggtgtgcctgggcctgct
ggtctccctggacccaaaggagaaaaaggatatccaggaattggcatcggagctccaggg
aagccgggcctgagagggcaaaaaggtgatcgaggtttcccaggtctccagggccctgct
ggtctccccggtgccccaggcatctccttgccctcactcatagcaggacagcctggtgac
cccgggcgaccaggcctagatggagaacgaggccgcccaggccccgctggacccccaggt
ccccctgggccatcctcgaatcaaggcgacaccggagaccctggcttccctggaattcct
ggacctaaagggcctaagggagaccaaggaattccaggtttttctggcctccctggagag
ctaggactgaaaggcatgagaggtgagcctggcttcatggggactccaggcaaggttggg
ccacctggagacccaggatttcccggaatgaaggggaaggcagggccaagaggctcttct
ggcctccaaggtgatcctggacaaacaccaactgcagaagctgtccaggttcctcctgga
cccttgggtctaccagggatcgatggcatccctggcctcactggggaccctggggctcaa
ggccctgtaggcctacaaggctccaaaggtttacctggcatccccggtaaagatggcccc
agtgggctcccaggcccacctggggctcttggtgatcctggtctgcctggactgcaaggc
cctccaggatttgaaggagctccagggcagcaaggccccttcgggatgcctggaatgcct
ggccagagcatgagagtgggctacacgttggtaaagcacagccagtcggaacaggtgccc
ccgtgtcccatcgggatgagccagctgtgggtggggtacagcttactgtttgtggagggg
caagagaaagcccacaaccaggacctgggctttgctggctcctgtctgccccgcttcagc
accatgcccttcatctactgcaacatcaacgaggtgtgccactatgccaggcgcaatgat
aaatcttactggctctccactaccgcccctatccccatgatgcccgtcagccagacccag
attccccagtacatcagccgctgctctgtgtgtgaggcaccctcgcaagccattgctgtg
cacagccaggacatcaccatcccgcagtgccccctgggctggcgcagcctctggattggg
tactctttcctcatgcacactgccgctggtgccgagggtggaggccagtccctggtctca
cctggctcctgcctagaggactttcgggccactcctttcatcgaatgcagtggtgcccga
ggcacctgccactactttgcaaacaagtacagtttctggttgaccacagtggaggagagg
cagcagtttggggagttgcctgtgtctgaaacgctgaaagctgggcagctccacactcga
gtcagtcgctgccaggtgtgtatgaaaagcctgtag

DBGET integrated database retrieval system