KEGG   Homo sapiens (human): 1282
Entry
1282              CDS       T01001                                 
Symbol
COL4A1, BSVD, BSVD1, COL4A1s, PADMAL, RATOR
Name
(RefSeq) collagen type IV alpha 1 chain
  KO
K06237  collagen type IV alpha
Organism
hsa  Homo sapiens (human)
Pathway
hsa04151  PI3K-Akt signaling pathway
hsa04510  Focal adhesion
hsa04512  ECM-receptor interaction
hsa04820  Cytoskeleton in muscle cells
hsa04926  Relaxin signaling pathway
hsa04933  AGE-RAGE signaling pathway in diabetic complications
hsa04974  Protein digestion and absorption
hsa05146  Amoebiasis
hsa05165  Human papillomavirus infection
hsa05200  Pathways in cancer
hsa05222  Small cell lung cancer
Network
nt06539  Cytoskeleton in muscle cells
  Element
N01814  Extracellular matrix - Basal lamina
Disease
H00579  Hereditary angiopathy with nephropathy, aneurysms, and muscle cramps (HANAC)
H00839  Porencephaly
H00877  Brain small vessel disease
H02718  Autosomal dominant pontine microangiopathy and leukoencephalopathy
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    1282 (COL4A1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    1282 (COL4A1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    1282 (COL4A1)
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    1282 (COL4A1)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    1282 (COL4A1)
  09154 Digestive system
   04974 Protein digestion and absorption
    1282 (COL4A1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    1282 (COL4A1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    1282 (COL4A1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    1282 (COL4A1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    1282 (COL4A1)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    1282 (COL4A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:hsa04147]
    1282 (COL4A1)
   00536 Glycosaminoglycan binding proteins [BR:hsa00536]
    1282 (COL4A1)
Exosome [BR:hsa04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   1282 (COL4A1)
Glycosaminoglycan binding proteins [BR:hsa00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   1282 (COL4A1)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 1282
NCBI-ProteinID: NP_001836
OMIM: 120130
HGNC: 2202
Ensembl: ENSG00000187498
UniProt: P02462
Structure
LinkDB
Position
13:complement(110148963..110307157)
AA seq 1669 aa
MGPRLSVWLLLLPAALLLHEEHSRAAAKGGCAGSGCGKCDCHGVKGQKGERGLPGLQGVI
GFPGMQGPEGPQGPPGQKGDTGEPGLPGTKGTRGPPGASGYPGNPGLPGIPGQDGPPGPP
GIPGCNGTKGERGPLGPPGLPGFAGNPGPPGLPGMKGDPGEILGHVPGMLLKGERGFPGI
PGTPGPPGLPGLQGPVGPPGFTGPPGPPGPPGPPGEKGQMGLSFQGPKGDKGDQGVSGPP
GVPGQAQVQEKGDFATKGEKGQKGEPGFQGMPGVGEKGEPGKPGPRGKPGKDGDKGEKGS
PGFPGEPGYPGLIGRQGPQGEKGEAGPPGPPGIVIGTGPLGEKGERGYPGTPGPRGEPGP
KGFPGLPGQPGPPGLPVPGQAGAPGFPGERGEKGDRGFPGTSLPGPSGRDGLPGPPGSPG
PPGQPGYTNGIVECQPGPPGDQGPPGIPGQPGFIGEIGEKGQKGESCLICDIDGYRGPPG
PQGPPGEIGFPGQPGAKGDRGLPGRDGVAGVPGPQGTPGLIGQPGAKGEPGEFYFDLRLK
GDKGDPGFPGQPGMPGRAGSPGRDGHPGLPGPKGSPGSVGLKGERGPPGGVGFPGSRGDT
GPPGPPGYGPAGPIGDKGQAGFPGGPGSPGLPGPKGEPGKIVPLPGPPGAEGLPGSPGFP
GPQGDRGFPGTPGRPGLPGEKGAVGQPGIGFPGPPGPKGVDGLPGDMGPPGTPGRPGFNG
LPGNPGVQGQKGEPGVGLPGLKGLPGLPGIPGTPGEKGSIGVPGVPGEHGAIGPPGLQGI
RGEPGPPGLPGSVGSPGVPGIGPPGARGPPGGQGPPGLSGPPGIKGEKGFPGFPGLDMPG
PKGDKGAQGLPGITGQSGLPGLPGQQGAPGIPGFPGSKGEMGVMGTPGQPGSPGPVGAPG
LPGEKGDHGFPGSSGPRGDPGLKGDKGDVGLPGKPGSMDKVDMGSMKGQKGDQGEKGQIG
PIGEKGSRGDPGTPGVPGKDGQAGQPGQPGPKGDPGISGTPGAPGLPGPKGSVGGMGLPG
TPGEKGVPGIPGPQGSPGLPGDKGAKGEKGQAGPPGIGIPGLRGEKGDQGIAGFPGSPGE
KGEKGSIGIPGMPGSPGLKGSPGSVGYPGSPGLPGEKGDKGLPGLDGIPGVKGEAGLPGT
PGPTGPAGQKGEPGSDGIPGSAGEKGEPGLPGRGFPGFPGAKGDKGSKGEVGFPGLAGSP
GIPGSKGEQGFMGPPGPQGQPGLPGSPGHATEGPKGDRGPQGQPGLPGLPGPMGPPGLPG
IDGVKGDKGNPGWPGAPGVPGPKGDPGFQGMPGIGGSPGITGSKGDMGPPGVPGFQGPKG
LPGLQGIKGDQGDQGVPGAKGLPGPPGPPGPYDIIKGEPGLPGPEGPPGLKGLQGLPGPK
GQQGVTGLVGIPGPPGIPGFDGAPGQKGEMGPAGPTGPRGFPGPPGPDGLPGSMGPPGTP
SVDHGFLVTRHSQTIDDPQCPSGTKILYHGYSLLYVQGNERAHGQDLGTAGSCLRKFSTM
PFLFCNINNVCNFASRNDYSYWLSTPEPMPMSMAPITGENIRPFISRCAVCEAPAMVMAV
HSQTIQIPPCPSGWSSLWIGYSFVMHTSAGAEGSGQALASPGSCLEEFRSAPFIECHGRG
TCNYYANAYSFWLATIERSEMFKKPTPSTLKAGELRTHVSRCQVCMRRT
NT seq 5010 nt   +upstreamnt  +downstreamnt
atggggccccggctcagcgtctggctgctgctgctgcccgccgcccttctgctccacgag
gagcacagccgggccgctgcgaagggtggctgtgctggctctggctgtggcaaatgtgac
tgccatggagtgaagggacaaaagggtgaaagaggcctcccggggttacaaggtgtcatt
gggtttcctggaatgcaaggacctgaggggccacagggaccaccaggacaaaagggtgat
actggagaaccaggactacctggaacaaaagggacaagaggacctccgggagcatctggc
taccctggaaacccaggacttcccggaattcctggccaagacggcccgccaggcccccca
ggtattccaggatgcaatggcacaaagggggagagagggccgctcgggcctcctggcttg
cctggtttcgctggaaatcccggaccaccaggcttaccagggatgaagggtgatccaggt
gagatacttggccatgtgcccgggatgctgttgaaaggtgaaagaggatttcccggaatc
ccagggactccaggcccaccaggactgccagggcttcaaggtcctgttgggcctccagga
tttaccggaccaccaggtcccccaggccctcccggccctccaggtgaaaagggacaaatg
ggcttaagttttcaaggaccaaaaggtgacaagggtgaccaaggggtcagtgggcctcca
ggagtaccaggacaagctcaagttcaagaaaaaggagacttcgccaccaagggagaaaag
ggccaaaaaggtgaacctggatttcaggggatgccaggggtcggagagaaaggtgaaccc
ggaaaaccaggacccagaggcaaacccggaaaagatggtgacaaaggggaaaaagggagt
cccggttttcctggtgaacccgggtacccaggactcataggccgccagggcccgcaggga
gaaaagggtgaagcaggtcctcctggcccacctggaattgttataggcacaggacctttg
ggagaaaaaggagagaggggctaccctggaactccggggccaagaggagagccaggccca
aaaggtttcccaggactaccaggccaacccggacctccaggcctccctgtacctgggcag
gctggtgcccctggcttccctggtgaaagaggagaaaaaggtgaccgaggatttcctggt
acatctctgccaggaccaagtggaagagatgggctcccgggtcctcctggttcccctggg
ccccctgggcagcctggctacacaaatggaattgtggaatgtcagcccggacctccaggt
gaccagggtcctcctggaattccagggcagccaggatttataggcgaaattggagagaaa
ggtcaaaaaggagagagttgcctcatctgtgatatagacggatatcgggggcctcccggg
ccacagggacccccgggagaaataggtttcccagggcagccaggggccaagggcgacaga
ggtttgcctggcagagatggtgttgcaggagtgccaggccctcaaggtacaccagggctg
ataggccagccaggagccaagggggagcctggtgagttttatttcgacttgcggctcaaa
ggtgacaaaggagacccaggctttccaggacagcccggcatgccagggagagcgggttct
cctggaagagatggccatccgggtcttcctggccccaagggctcgccgggttctgtagga
ttgaaaggagagcgtggcccccctggaggagttggattcccaggcagtcgtggtgacacc
ggcccccctgggcctccaggatatggtcctgctggtcccattggtgacaaaggacaagca
ggctttcctggaggccctggatccccaggcctgccaggtccaaagggtgaaccaggaaaa
attgttcctttaccaggcccccctggagcagaaggactgccggggtccccaggcttccca
ggtccccaaggagaccgaggctttcccggaaccccaggaaggccaggcctgccaggagag
aagggcgctgtgggccagccaggcattggatttccagggccccccggccccaaaggtgtt
gacggcttacctggagacatggggccaccggggactccaggtcgcccgggatttaatggc
ttacctgggaacccaggtgtgcagggccagaagggagagcctggagttggtctaccggga
ctcaaaggtttgccaggtcttcccggcattcctggcacacccggggagaaggggagcatt
ggggtaccaggcgttcctggagaacatggagcgatcggaccccctgggcttcaggggatc
agaggtgaaccgggacctcctggattgccaggctccgtggggtctccaggagttccagga
ataggcccccctggagctaggggtccccctggaggacagggaccaccggggttgtcaggc
cctcctggaataaaaggagagaagggtttccccggattccctggactggacatgccgggc
cctaaaggagataaaggggctcaaggactccctggcataacgggacagtcggggctccct
ggccttcctggacagcagggggctcctgggattcctgggtttccaggttccaagggagaa
atgggcgtcatggggacccccgggcagccgggctcaccaggaccagtgggtgctcctgga
ttaccgggtgaaaaaggggaccatggctttccgggctcctcaggacccaggggagaccct
ggcttgaaaggtgataagggggatgtcggtctccctggcaagcctggctccatggataag
gtggacatgggcagcatgaagggccagaaaggagaccaaggagagaaaggacaaattgga
ccaattggtgagaagggatcccgaggagaccctgggaccccaggagtgcctggaaaggac
gggcaggcaggacagcctgggcagccaggacctaaaggtgatccaggtataagtggaacc
ccaggtgctccaggacttccgggaccaaaaggatctgttggtggaatgggcttgccagga
acacctggagagaaaggtgtgcctggcatccctggcccacaaggttcacctggcttacct
ggagacaaaggtgcaaaaggagagaaagggcaggcaggcccacctggcataggcatccca
gggctgcgaggtgaaaagggagatcaagggatagcgggtttcccaggaagccctggagag
aagggagaaaaaggaagcattgggatcccaggaatgccagggtccccaggccttaaaggg
tctcccgggagtgttggctatccaggaagtcctgggctacctggagaaaaaggtgacaaa
ggcctcccaggattggatggcatccctggtgtcaaaggagaagcaggtcttcctgggact
cctggccccacaggcccagctggccagaaaggggagccaggcagtgatggaatcccgggg
tcagcaggagagaagggtgaaccaggtctaccaggaagaggattcccagggtttccaggg
gccaaaggagacaaaggttcaaagggtgaggtgggtttcccaggattagccgggagccca
ggaattcctggatccaaaggagagcaaggattcatgggtcctccggggccccagggacag
ccggggttaccgggatccccaggccatgccacggaggggcccaaaggagaccgcggacct
cagggccagcctggcctgccaggacttccgggacccatggggcctccagggcttcctggg
attgatggagttaaaggtgacaaaggaaatccaggctggccaggagcacccggtgtccca
gggcccaagggagaccctggattccagggcatgcctggtattggtggctctccaggaatc
acaggctctaagggtgatatggggcctccaggagttccaggatttcaaggtccaaaaggt
cttcctggcctccagggaattaaaggtgatcaaggcgatcaaggcgtcccgggagctaaa
ggtctcccgggtcctcctggccccccaggtccttacgacatcatcaaaggggagcccggg
ctccctggtcctgagggccccccagggctgaaagggcttcagggactgccaggcccgaaa
ggccagcaaggtgttacaggattggtgggtatacctggacctccaggtattcctgggttt
gacggtgcccctggccagaaaggagagatgggacctgccgggcctactggtccaagagga
tttccaggtccaccaggccccgatgggttgccaggatccatggggcccccaggcacccca
tctgttgatcacggcttccttgtgaccaggcatagtcaaacaatagatgacccacagtgt
ccttctgggaccaaaattctttaccacgggtactctttgctctacgtgcaaggcaatgaa
cgggcccatggccaggacttgggcacggccggcagctgcctgcgcaagttcagcacaatg
cccttcctgttctgcaatattaacaacgtgtgcaactttgcatcacgaaatgactactcg
tactggctgtccacccctgagcccatgcccatgtcaatggcacccatcacgggggaaaac
ataagaccatttattagtaggtgtgctgtgtgtgaggcgcctgccatggtgatggccgtg
cacagccagaccattcagatcccaccgtgccccagcgggtggtcctcgctgtggatcggc
tactcttttgtgatgcacaccagcgctggtgcagaaggctctggccaagccctggcgtcc
cccggctcctgcctggaggagtttagaagtgcgccattcatcgagtgtcacggccgtggg
acctgcaattactacgcaaacgcttacagcttttggctcgccaccatagagaggagcgag
atgttcaagaagcctacgccgtccaccttgaaggcaggggagctgcgcacgcacgtcagc
cgctgccaagtctgtatgagaagaacataa

KEGG   Homo sapiens (human): 1284
Entry
1284              CDS       T01001                                 
Symbol
COL4A2, BSVD2, ICH, POREN2
Name
(RefSeq) collagen type IV alpha 2 chain
  KO
K06237  collagen type IV alpha
Organism
hsa  Homo sapiens (human)
Pathway
hsa04151  PI3K-Akt signaling pathway
hsa04510  Focal adhesion
hsa04512  ECM-receptor interaction
hsa04820  Cytoskeleton in muscle cells
hsa04926  Relaxin signaling pathway
hsa04933  AGE-RAGE signaling pathway in diabetic complications
hsa04974  Protein digestion and absorption
hsa05146  Amoebiasis
hsa05165  Human papillomavirus infection
hsa05200  Pathways in cancer
hsa05222  Small cell lung cancer
Network
nt06539  Cytoskeleton in muscle cells
  Element
N01814  Extracellular matrix - Basal lamina
Disease
H00839  Porencephaly
H00877  Brain small vessel disease
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    1284 (COL4A2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    1284 (COL4A2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    1284 (COL4A2)
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    1284 (COL4A2)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    1284 (COL4A2)
  09154 Digestive system
   04974 Protein digestion and absorption
    1284 (COL4A2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    1284 (COL4A2)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    1284 (COL4A2)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    1284 (COL4A2)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    1284 (COL4A2)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    1284 (COL4A2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:hsa04147]
    1284 (COL4A2)
   00536 Glycosaminoglycan binding proteins [BR:hsa00536]
    1284 (COL4A2)
Exosome [BR:hsa04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   1284 (COL4A2)
Glycosaminoglycan binding proteins [BR:hsa00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   1284 (COL4A2)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 1284
NCBI-ProteinID: NP_001837
OMIM: 120090
HGNC: 2203
Ensembl: ENSG00000134871
UniProt: P08572
Structure
LinkDB
Position
13:110307284..110513209
AA seq 1712 aa
MGRDQRAVAGPALRRWLLLGTVTVGFLAQSVLAGVKKFDVPCGGRDCSGGCQCYPEKGGR
GQPGPVGPQGYNGPPGLQGFPGLQGRKGDKGERGAPGVTGPKGDVGARGVSGFPGADGIP
GHPGQGGPRGRPGYDGCNGTQGDSGPQGPPGSEGFTGPPGPQGPKGQKGEPYALPKEERD
RYRGEPGEPGLVGFQGPPGRPGHVGQMGPVGAPGRPGPPGPPGPKGQQGNRGLGFYGVKG
EKGDVGQPGPNGIPSDTLHPIIAPTGVTFHPDQYKGEKGSEGEPGIRGISLKGEEGIMGF
PGLRGYPGLSGEKGSPGQKGSRGLDGYQGPDGPRGPKGEAGDPGPPGLPAYSPHPSLAKG
ARGDPGFPGAQGEPGSQGEPGDPGLPGPPGLSIGDGDQRRGLPGEMGPKGFIGDPGIPAL
YGGPPGPDGKRGPPGPPGLPGPPGPDGFLFGLKGAKGRAGFPGLPGSPGARGPKGWKGDA
GECRCTEGDEAIKGLPGLPGPKGFAGINGEPGRKGDRGDPGQHGLPGFPGLKGVPGNIGA
PGPKGAKGDSRTITTKGERGQPGVPGVPGMKGDDGSPGRDGLDGFPGLPGPPGDGIKGPP
GDPGYPGIPGTKGTPGEMGPPGLGLPGLKGQRGFPGDAGLPGPPGFLGPPGPAGTPGQID
CDTDVKRAVGGDRQEAIQPGCIGGPKGLPGLPGPPGPTGAKGLRGIPGFAGADGGPGPRG
LPGDAGREGFPGPPGFIGPRGSKGAVGLPGPDGSPGPIGLPGPDGPPGERGLPGEVLGAQ
PGPRGDAGVPGQPGLKGLPGDRGPPGFRGSQGMPGMPGLKGQPGLPGPSGQPGLYGPPGL
HGFPGAPGQEGPLGLPGIPGREGLPGDRGDPGDTGAPGPVGMKGLSGDRGDAGFTGEQGH
PGSPGFKGIDGMPGTPGLKGDRGSPGMDGFQGMPGLKGRPGFPGSKGEAGFFGIPGLKGL
AGEPGFKGSRGDPGPPGPPPVILPGMKDIKGEKGDEGPMGLKGYLGAKGIQGMPGIPGLS
GIPGLPGRPGHIKGVKGDIGVPGIPGLPGFPGVAGPPGITGFPGFIGSRGDKGAPGRAGL
YGEIGATGDFGDIGDTINLPGRPGLKGERGTTGIPGLKGFFGEKGTEGDIGFPGITGVTG
VQGPPGLKGQTGFPGLTGPPGSQGELGRIGLPGGKGDDGWPGAPGLPGFPGLRGIRGLHG
LPGTKGFPGSPGSDIHGDPGFPGPPGERGDPGEANTLPGPVGVPGQKGDQGAPGERGPPG
SPGLQGFPGITPPSNISGAPGDKGAPGIFGLKGYRGPPGPPGSAALPGSKGDTGNPGAPG
TPGTKGWAGDSGPQGRPGVFGLPGEKGPRGEQGFMGNTGPTGAVGDRGPKGPKGDPGFPG
APGTVGAPGIAGIPQKIAVQPGTVGPQGRRGPPGAPGEMGPQGPPGEPGFRGAPGKAGPQ
GRGGVSAVPGFRGDEGPIGHQGPIGQEGAPGRPGSPGLPGMPGRSVSIGYLLVKHSQTDQ
EPMCPVGMNKLWSGYSLLYFEGQEKAHNQDLGLAGSCLARFSTMPFLYCNPGDVCYYASR
NDKSYWLSTTAPLPMMPVAEDEIKPYISRCSVCEAPAIAIAVHSQDVSIPHCPAGWRSLW
IGYSFLMHTAAGDEGGGQSLVSPGSCLEDFRATPFIECNGGRGTCHYYANKYSFWLTTIP
EQSFQGSPSADTLKAGLIRTHISRCQVCMKNL
NT seq 5139 nt   +upstreamnt  +downstreamnt
atggggagagaccagcgcgcggtggccggccctgccctacggcggtggctgctgctgggg
acagtgaccgtggggttcctcgcccagagcgtcttggcgggtgtgaagaagtttgatgtg
ccgtgtggaggaagagattgcagtgggggctgccagtgctaccctgagaaaggtggacgt
ggtcagcctgggccagtgggcccccaggggtacaatgggccaccaggattacaaggattc
ccgggactgcagggacgtaaaggagacaagggtgaaaggggagcccccggagtaacggga
cccaagggcgacgtgggagcaagaggcgtttctggattccctggtgccgatggaattcct
ggacacccggggcaaggtgggcccaggggaaggccgggctacgatggctgcaacggaacc
cagggagactcaggtccacaggggccccccggctctgaggggttcaccgggcctcccggg
ccccaaggaccaaaagggcagaaaggtgagccttatgcactgcctaaagaggagcgcgac
agatatcggggtgaacctggagagcctggattggtcggtttccagggacctcccggccgc
cctgggcatgtgggacagatgggtccagttggagctccagggagaccaggaccacctgga
ccccctggaccaaaaggacagcaaggcaacagaggacttggtttctacggagttaagggt
gaaaagggtgacgtagggcagccgggacccaacgggattccatcagacaccctccacccc
atcatcgcgcccacaggagtcaccttccacccagatcagtacaagggtgaaaaaggcagt
gagggggaaccaggaataagaggcatttccttgaagggagaagaaggaatcatgggcttt
cctggactgaggggttaccctggcttgagtggtgaaaaaggatcaccaggacagaaggga
agccgaggcctggatggctatcaagggcctgatggaccccggggacccaagggagaagcc
ggagacccagggccccctggactacctgcctactcccctcacccttccctagcaaaaggt
gccagaggtgacccgggattcccaggggcccaaggggagccaggaagccagggtgagcca
ggagacccgggcctcccaggtccccctggcctctccatcggagatggagatcagaggaga
ggcctgccgggtgagatgggacccaagggcttcatcggagaccccggcatccctgcgctc
tacgggggcccacctggacctgatggaaagcgagggcctccaggaccccccgggctccct
ggaccacctggacctgatggcttcctgtttgggctgaaaggagcaaaaggaagagcaggc
ttccctgggcttcccggctcccctggagcccgcggaccaaaggggtggaaaggtgacgct
ggggaatgcagatgtacagaaggcgacgaagctatcaaaggtcttccgggactgccagga
cccaagggcttcgcaggcatcaacggggagccggggaggaaaggggacagaggagacccc
ggccaacacggcctccctgggttcccagggctcaagggagtgcctggcaacattggtgct
cccggacccaaaggagcaaaaggagattccagaacaatcacaaccaaaggtgagcgggga
cagcccggcgtcccaggtgtgcccgggatgaaaggtgacgatggcagcccaggccgcgat
gggctcgatggattccccggcctcccaggccctcccggtgatggcatcaagggccctcca
ggggacccaggctatccaggaatacctggaacgaagggtactccaggagaaatgggcccc
ccaggactgggccttcccggcctcaaaggccaacgtggtttccctggagacgccggctta
cctggaccaccaggcttcctgggccctcctggccccgcagggaccccaggacaaatagat
tgtgacacagatgtgaaaagggccgttggaggtgacagacaggaggccatccagccaggt
tgcataggagggcccaagggattgccaggcctgccaggacccccaggccccacaggtgcc
aaaggcctccgaggaatcccaggcttcgcaggagctgatggaggaccagggcccaggggc
ttgccaggagacgcaggtcgtgaagggttcccaggacccccagggttcataggaccccga
ggatccaaaggtgcagtgggcctccctggcccagatggatccccaggtcccatcggcctg
ccagggccagatgggccccctggggaaaggggcctccctggagaagtcctgggagctcag
cccgggccacggggagatgctggtgtgcctggacagcctgggcttaaaggccttcccgga
gacagaggcccccctggattcagaggaagccaagggatgcctgggatgccagggctgaag
ggccagccaggcctcccaggaccttccggccagccaggcctgtatgggcctccaggactg
catggattcccaggagctcctggccaagaggggcccttggggctgccaggaatcccaggc
cgtgaaggtctgcctggtgatagaggggaccctggggacacaggcgctcctggccctgtg
ggcatgaaaggtctctctggtgacagaggagatgctggcttcacaggggagcaaggccat
ccaggaagccctggatttaaaggaattgatggaatgcctgggacccccgggctaaaagga
gatagaggctcacctgggatggatggtttccaaggcatgcctggactcaaagggagaccc
gggtttccagggagcaaaggcgaggctggatttttcggaatacccggtctgaagggtctg
gctggtgagccaggttttaaaggcagccgaggggaccctgggcccccaggaccacctcct
gtcatcctgccaggaatgaaagacattaaaggagagaaaggagatgaagggcctatgggg
ctgaaaggatacctgggcgcaaaaggtatccaaggaatgccaggcatcccagggctgtca
ggaatccctgggctgcctgggaggcccggccacatcaaaggagtcaagggagacatcgga
gtccccggcatccccggtttgccaggattccctggggtggctggcccccctggaattacg
ggattcccaggattcataggaagccggggtgacaaaggtgccccagggagagcaggcctg
tatggcgagattggcgcgactggtgatttcggtgacatcggggacactataaatttacca
ggaagaccaggcctgaagggggagcggggcaccactggaataccaggtctgaagggattc
tttggagagaagggaacagaaggtgacatcggcttccctgggataacaggcgtgactgga
gtccaaggccctcctggacttaaaggacaaacaggctttccagggctgactgggcctcca
gggtcgcagggagagctggggcggattggactgcctggtggcaaaggagatgatggctgg
ccgggagctccgggcttaccaggttttccgggactccgtgggatccgcggcttacacggc
ttgccaggcaccaagggctttccaggatccccaggttctgacatccacggagacccaggc
ttcccaggccctcctggggaaagaggtgacccaggagaggccaacacccttccaggccct
gtgggagtcccaggacagaaaggagaccaaggagctccaggggaacgaggcccacctggg
agcccaggacttcaggggttccctggtatcacacccccttccaacatctctggggcacct
ggtgacaaaggggcgccagggatatttggcctgaaaggttatcggggcccaccagggcca
ccaggttctgctgctcttcctggaagcaaaggtgacacagggaacccaggagctccagga
accccagggaccaaaggatgggccggggactccgggccccagggcaggcctggtgtgttt
ggtctcccaggagaaaaagggcccaggggtgaacaaggcttcatggggaacactggaccc
actggggcggtgggcgacagaggccccaagggacccaagggagacccaggattccctggt
gcccccgggactgtgggagcccccgggattgcaggaatcccccagaagattgccgtccaa
ccagggacagtgggtccccaggggaggcgaggcccccctggggcaccgggggagatgggg
ccccagggcccccccggagaaccaggtttccgtggggctccagggaaagctgggccccaa
ggaagaggtggtgtgtctgctgttcccggcttccggggagatgaaggacccataggccac
caggggccgattggccaagaaggtgcaccaggccgtccagggagcccgggcctgccgggt
atgccaggccgcagcgtcagcatcggctacctcctggtgaagcacagccagacggaccag
gagcccatgtgcccagtgggcatgaacaaactctggagtggatacagcctgctgtacttc
gagggccaggagaaggcgcacaaccaggacctggggctggcgggctcctgcctggcgcgg
ttcagcaccatgcccttcctgtactgcaaccctggtgatgtctgctactatgccagccgg
aacgacaagtcctactggctctctaccactgcgccgctgcccatgatgcccgtggccgag
gacgagatcaagccctacatcagccgctgttctgtgtgtgaggccccggccatcgccatc
gcggtccacagtcaggatgtctccatcccacactgcccagctgggtggcggagtttgtgg
atcggatattccttcctcatgcacacggcggcgggagacgaaggcggtggccaatcactg
gtgtcaccgggcagctgtctagaggacttccgcgccacaccattcatcgaatgcaatgga
ggccgcggcacctgccactactacgccaacaagtacagcttctggctgaccaccattccc
gagcagagcttccagggctcgccctccgccgacacgctcaaggccggcctcatccgcaca
cacatcagccgctgccaggtgtgcatgaagaacctgtga

KEGG   Homo sapiens (human): 1285
Entry
1285              CDS       T01001                                 
Symbol
COL4A3, ATS2, ATS3, ATS3A, ATS3B, BFH2
Name
(RefSeq) collagen type IV alpha 3 chain
  KO
K06237  collagen type IV alpha
Organism
hsa  Homo sapiens (human)
Pathway
hsa04151  PI3K-Akt signaling pathway
hsa04510  Focal adhesion
hsa04512  ECM-receptor interaction
hsa04820  Cytoskeleton in muscle cells
hsa04926  Relaxin signaling pathway
hsa04933  AGE-RAGE signaling pathway in diabetic complications
hsa04974  Protein digestion and absorption
hsa05146  Amoebiasis
hsa05165  Human papillomavirus infection
hsa05200  Pathways in cancer
hsa05222  Small cell lung cancer
Network
nt06539  Cytoskeleton in muscle cells
  Element
N01814  Extracellular matrix - Basal lamina
Disease
H00581  Alport syndrome
H00582  Benign familial hematuria
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    1285 (COL4A3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    1285 (COL4A3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    1285 (COL4A3)
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    1285 (COL4A3)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    1285 (COL4A3)
  09154 Digestive system
   04974 Protein digestion and absorption
    1285 (COL4A3)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    1285 (COL4A3)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    1285 (COL4A3)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    1285 (COL4A3)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    1285 (COL4A3)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    1285 (COL4A3)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:hsa04147]
    1285 (COL4A3)
   00536 Glycosaminoglycan binding proteins [BR:hsa00536]
    1285 (COL4A3)
Exosome [BR:hsa04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   1285 (COL4A3)
Glycosaminoglycan binding proteins [BR:hsa00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   1285 (COL4A3)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 1285
NCBI-ProteinID: NP_000082
OMIM: 120070
HGNC: 2204
Ensembl: ENSG00000169031
UniProt: Q01955
Structure
LinkDB
Position
2:227164624..227314792
AA seq 1670 aa
MSARTAPRPQVLLLPLLLVLLAAAPAASKGCVCKDKGQCFCDGAKGEKGEKGFPGPPGSP
GQKGFTGPEGLPGPQGPKGFPGLPGLTGSKGVRGISGLPGFSGSPGLPGTPGNTGPYGLV
GVPGCSGSKGEQGFPGLPGTLGYPGIPGAAGLKGQKGAPAKEEDIELDAKGDPGLPGAPG
PQGLPGPPGFPGPVGPPGPPGFFGFPGAMGPRGPKGHMGERVIGHKGERGVKGLTGPPGP
PGTVIVTLTGPDNRTDLKGEKGDKGAMGEPGPPGPSGLPGESYGSEKGAPGDPGLQGKPG
KDGVPGFPGSEGVKGNRGFPGLMGEDGIKGQKGDIGPPGFRGPTEYYDTYQEKGDEGTPG
PPGPRGARGPQGPSGPPGVPGSPGSSRPGLRGAPGWPGLKGSKGERGRPGKDAMGTPGSP
GCAGSPGLPGSPGPPGPPGDIVFRKGPPGDHGLPGYLGSPGIPGVDGPKGEPGLLCTQCP
YIPGPPGLPGLPGLHGVKGIPGRQGAAGLKGSPGSPGNTGLPGFPGFPGAQGDPGLKGEK
GETLQPEGQVGVPGDPGLRGQPGRKGLDGIPGTPGVKGLPGPKGELALSGEKGDQGPPGD
PGSPGSPGPAGPAGPPGYGPQGEPGLQGTQGVPGAPGPPGEAGPRGELSVSTPVPGPPGP
PGPPGHPGPQGPPGIPGSLGKCGDPGLPGPDGEPGIPGIGFPGPPGPKGDQGFPGTKGSL
GCPGKMGEPGLPGKPGLPGAKGEPAVAMPGGPGTPGFPGERGNSGEHGEIGLPGLPGLPG
TPGNEGLDGPRGDPGQPGPPGEQGPPGRCIEGPRGAQGLPGLNGLKGQQGRRGKTGPKGD
PGIPGLDRSGFPGETGSPGIPGHQGEMGPLGQRGYPGNPGILGPPGEDGVIGMMGFPGAI
GPPGPPGNPGTPGQRGSPGIPGVKGQRGTPGAKGEQGDKGNPGPSEISHVIGDKGEPGLK
GFAGNPGEKGNRGVPGMPGLKGLKGLPGPAGPPGPRGDLGSTGNPGEPGLRGIPGSMGNM
GMPGSKGKRGTLGFPGRAGRPGLPGIHGLQGDKGEPGYSEGTRPGPPGPTGDPGLPGDMG
KKGEMGQPGPPGHLGPAGPEGAPGSPGSPGLPGKPGPHGDLGFKGIKGLLGPPGIRGPPG
LPGFPGSPGPMGIRGDQGRDGIPGPAGEKGETGLLRAPPGPRGNPGAQGAKGDRGAPGFP
GLPGRKGAMGDAGPRGPTGIEGFPGPPGLPGAIIPGQTGNRGPPGSRGSPGAPGPPGPPG
SHVIGIKGDKGSMGHPGPKGPPGTAGDMGPPGRLGAPGTPGLPGPRGDPGFQGFPGVKGE
KGNPGFLGSIGPPGPIGPKGPPGVRGDPGTLKIISLPGSPGPPGTPGEPGMQGEPGPPGP
PGNLGPCGPRGKPGKDGKPGTPGPAGEKGNKGSKGEPGPAGSDGLPGLKGKRGDSGSPAT
WTTRGFVFTRHSQTTAIPSCPEGTVPLYSGFSFLFVQGNQRAHGQDLGTLGSCLQRFTTM
PFLFCNVNDVCNFASRNDYSYWLSTPALMPMNMAPITGRALEPYISRCTVCEGPAIAIAV
HSQTTDIPPCPHGWISLWKGFSFIMFTSAGSEGTGQALASPGSCLEEFRASPFLECHGRG
TCNYYSNSYSFWLASLNPERMFRKPIPSTVKAGELEKIISRCQVCMKKRH
NT seq 5013 nt   +upstreamnt  +downstreamnt
atgagcgcccggaccgcccccaggccgcaggtgctcctgctgccgctcctgctggtgctc
ctggcggcggcgcccgcagccagcaagggttgtgtctgtaaagacaaaggccagtgcttc
tgtgacggggccaaaggggagaagggggagaagggctttcctggaccccccggttctcct
ggccagaaaggattcacaggtcctgaaggcttgcctggaccgcagggacccaagggcttt
ccaggacttccaggactcacgggttccaaaggtgtaaggggaataagtggattgccagga
ttttctggttctcctggacttccaggcaccccaggcaataccgggccttacggacttgtc
ggtgtaccaggatgcagtggttctaagggtgagcaggggtttccaggactcccagggaca
ctgggctacccagggatcccgggtgctgctggtttgaaaggacaaaagggtgctcctgct
aaagaagaagatatagaacttgatgcaaaaggcgaccccgggttgccaggggctccagga
ccccagggtttgccaggccctccaggttttcctgggcctgttggcccacctggtcctccg
ggattctttggctttccaggagccatgggacctagaggacctaagggtcacatgggtgaa
agagtgataggacataaaggagagcggggtgtgaaagggttaacaggacccccgggacca
ccaggaacagttattgtgaccctaactggcccagataacagaacggacctcaagggggaa
aagggagacaagggagcaatgggcgagcctggacctcctggaccctcaggactgcctgga
gaatcatatggatctgaaaagggtgctcctggagaccctggcctgcagggaaaacccgga
aaagatggtgttcctggcttccctggaagtgagggagtcaagggcaacaggggtttccct
gggttaatgggtgaagatggcattaagggacagaaaggggacattggccctccaggattt
cgtggtccaacagaatattatgacacataccaggaaaagggagatgaaggcactccaggc
ccaccagggcccagaggagctcgtggcccacaaggtcccagtggtccccccggagttcct
ggaagtcctggatcatcaaggcctggcctcagaggagcccctggatggccaggcctgaaa
ggaagtaaaggggaacgaggccgcccaggaaaggatgccatggggactcctgggtcccca
ggttgtgctggttcaccaggtcttccaggatcaccgggacctccaggaccgccaggtgac
atcgtttttcgcaagggtccacctggagatcacggactgccaggctatctagggtctcca
ggaatcccaggagttgatgggcccaaaggagaaccaggcctcctgtgtacacagtgccct
tatatcccagggcctcccggtctcccaggattgccagggttacatggtgtaaaaggaatc
ccaggaagacaaggcgcagctggcttgaaaggaagcccagggtccccaggaaatacaggt
cttccaggatttccaggtttcccaggtgcccagggtgacccaggacttaaaggagaaaaa
ggtgaaacacttcagcctgaggggcaagtgggtgtcccaggtgacccggggctcagaggc
caacctgggagaaagggcttggatggaattcctggaactccgggagtgaaaggattacca
ggacctaaaggcgaactggctctgagtggtgagaaaggggaccaaggtcctccaggggat
cctggctcccctgggtccccaggacctgcaggaccagctggaccacctggctacggaccc
caaggagaacctggtctccagggcacgcaaggagttcctggagcccccggaccacccgga
gaagccggccctaggggagagctcagtgtttcaacaccagttccaggcccaccaggacct
ccagggccccctggccatcctggcccccaaggtccacctggtatccctggatccctgggg
aaatgtggagatcctggtcttccagggcctgatggtgaaccaggaattccaggaattgga
tttcctgggcctcctggacctaagggagaccaaggttttccaggtacaaaaggatcactg
ggttgtcctggaaaaatgggagagcctgggttacctggaaagccaggcctcccaggagcc
aagggagaaccagcagtagccatgcctggaggaccaggaacaccaggttttccaggagaa
agaggcaattctggggaacatggagaaattggactccctggacttccaggtctccctgga
actccaggaaatgaagggcttgatggaccacgaggagatccagggcagcctggaccacct
ggagaacaaggacccccaggaaggtgcatagagggtcccaggggagcccaaggacttcca
ggcttaaatggattgaaagggcaacaaggcagaagaggtaaaacggggccaaagggagac
ccaggaattccaggcttggatagatcaggatttcctggagaaactggatcaccaggaatt
ccaggtcatcaaggtgaaatgggaccactgggtcaaagaggatatccaggaaatccggga
attttagggccaccaggtgaagatggagtgattgggatgatgggctttcctggagccatt
ggccctccagggccccctgggaacccaggcacaccagggcagagggggagccctggaatt
ccaggagtaaagggccagagaggaaccccaggagccaagggggaacaaggagataaagga
aatcccgggccttcagagatatcccacgtaataggggacaaaggagaaccaggtctcaaa
ggattcgcaggaaatccaggtgagaaaggaaacagaggcgttccagggatgccaggttta
aagggcctcaaaggactacccggaccagcaggaccaccaggccccagaggagatttgggc
agcactgggaatcctggagaaccaggactgcgtggtataccaggaagcatggggaacatg
ggcatgccaggttctaaaggaaaaaggggaactttgggattcccaggtcgagcaggaaga
ccaggcctcccaggtattcatggtctccagggagataagggagagccaggttattcagaa
ggtacaaggccaggaccaccgggaccaacgggggatccaggactgccgggtgatatggga
aagaaaggagaaatggggcaacctggcccacctggacatttggggcctgctggacctgag
ggagcccctggaagtcctggaagtcctggcctcccaggaaagccaggtcctcatggtgat
ttgggttttaaaggaatcaaaggcctcctgggccctccaggaatcagaggccctccaggt
cttccaggatttccaggatctcctggaccaatgggtataagaggtgaccaaggacgtgat
ggaattcctggtccagccggagaaaagggagaaacgggtttattgagggcccctccaggc
ccaagagggaaccctggtgctcaaggagccaaaggagacaggggagccccaggttttcct
ggcctcccgggcagaaaaggggccatgggagatgctggacctcgaggacccacaggcata
gaaggattcccagggccaccaggtctgcccggtgcaattatccctggccagacaggaaat
cgtggtccaccaggctcaagaggaagcccaggtgcgcctggtccccctggacctccaggg
agtcatgtaataggcataaaaggagacaaagggtctatgggccaccctggcccaaaaggt
ccacctggaactgcaggagacatgggaccaccaggtcgtctgggagcaccaggtactcca
ggtcttccaggacccagaggtgatcctggattccaggggtttccaggcgtgaaaggagaa
aagggtaatcctggatttctaggatccattggacctccaggaccaattgggccaaaagga
ccacctggtgtacgtggagaccctggcacacttaagattatctcccttccaggaagccca
gggccacctggcacacctggagaaccagggatgcagggagaacctgggccaccagggcca
cctggaaacctaggaccctgtgggccaagaggtaagccaggcaaggatggaaaaccagga
actcctggaccagctggagaaaaaggcaacaaaggttctaaaggagagccaggaccagct
ggatcagatggattgccaggtttgaaaggaaaacgtggagacagtggatcacctgcaacc
tggacaacgagaggctttgtcttcacccgacacagtcaaaccacagcaattccttcatgt
ccagaggggacagtgccactctacagtgggttttcttttctttttgtacaaggaaatcaa
cgagcccacggacaagaccttggaactcttggcagctgcctgcagcgatttaccacaatg
ccattcttattctgcaatgtcaatgatgtatgtaattttgcatctcgaaatgattattca
tactggctgtcaacaccagctctgatgccaatgaacatggctcccattactggcagagcc
cttgagccttatataagcagatgcactgtttgtgaaggtcctgcgatcgccatagccgtt
cacagccaaaccactgacattcctccatgtcctcacggctggatttctctctggaaagga
ttttcattcatcatgttcacaagtgcaggttctgagggcaccgggcaagcactggcctcc
cctggctcctgcctggaagaattccgagccagcccatttctagaatgtcatggaagagga
acgtgcaactactattcaaattcctacagtttctggctggcttcattaaacccagaaaga
atgttcagaaagcctattccatcaactgtgaaagctggggaattagaaaaaataataagt
cgctgtcaggtgtgcatgaagaaaagacactga

KEGG   Homo sapiens (human): 1286
Entry
1286              CDS       T01001                                 
Symbol
COL4A4, ATS2, BFH, BFH1, CA44
Name
(RefSeq) collagen type IV alpha 4 chain
  KO
K06237  collagen type IV alpha
Organism
hsa  Homo sapiens (human)
Pathway
hsa04151  PI3K-Akt signaling pathway
hsa04510  Focal adhesion
hsa04512  ECM-receptor interaction
hsa04820  Cytoskeleton in muscle cells
hsa04926  Relaxin signaling pathway
hsa04933  AGE-RAGE signaling pathway in diabetic complications
hsa04974  Protein digestion and absorption
hsa05146  Amoebiasis
hsa05165  Human papillomavirus infection
hsa05200  Pathways in cancer
hsa05222  Small cell lung cancer
Network
nt06539  Cytoskeleton in muscle cells
  Element
N01814  Extracellular matrix - Basal lamina
Disease
H00581  Alport syndrome
H00582  Benign familial hematuria
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    1286 (COL4A4)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    1286 (COL4A4)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    1286 (COL4A4)
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    1286 (COL4A4)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    1286 (COL4A4)
  09154 Digestive system
   04974 Protein digestion and absorption
    1286 (COL4A4)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    1286 (COL4A4)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    1286 (COL4A4)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    1286 (COL4A4)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    1286 (COL4A4)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    1286 (COL4A4)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:hsa04147]
    1286 (COL4A4)
   00536 Glycosaminoglycan binding proteins [BR:hsa00536]
    1286 (COL4A4)
Exosome [BR:hsa04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   1286 (COL4A4)
Glycosaminoglycan binding proteins [BR:hsa00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   1286 (COL4A4)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 1286
NCBI-ProteinID: NP_000083
OMIM: 120131
HGNC: 2206
Ensembl: ENSG00000081052
UniProt: P53420
Structure
LinkDB
Position
2:complement(226967360..227164488)
AA seq 1690 aa
MWSLHIVLMRCSFRLTKSLATGPWSLILILFSVQYVYGSGKKYIGPCGGRDCSVCHCVPE
KGSRGPPGPPGPQGPIGPLGAPGPIGLSGEKGMRGDRGPPGAAGDKGDKGPTGVPGFPGL
DGIPGHPGPPGPRGKPGMSGHNGSRGDPGFPGGRGALGPGGPLGHPGEKGEKGNSVFILG
AVKGIQGDRGDPGLPGLPGSWGAGGPAGPTGYPGEPGLVGPPGQPGRPGLKGNPGVGVKG
QMGDPGEVGQQGSPGPTLLVEPPDFCLYKGEKGIKGIPGMVGLPGPPGRKGESGIGAKGE
KGIPGFPGPRGDPGSYGSPGFPGLKGELGLVGDPGLFGLIGPKGDPGNRGHPGPPGVLVT
PPLPLKGPPGDPGFPGRYGETGDVGPPGPPGLLGRPGEACAGMIGPPGPQGFPGLPGLPG
EAGIPGRPDSAPGKPGKPGSPGLPGAPGLQGLPGSSVIYCSVGNPGPQGIKGKVGPPGGR
GPKGEKGNEGLCACEPGPMGPPGPPGLPGRQGSKGDLGLPGWLGTKGDPGPPGAEGPPGL
PGKHGASGPPGNKGAKGDMVVSRVKGHKGERGPDGPPGFPGQPGSHGRDGHAGEKGDPGP
PGDHEDATPGGKGFPGPLGPPGKAGPVGPPGLGFPGPPGERGHPGVPGHPGVRGPDGLKG
QKGDTISCNVTYPGRHGPPGFDGPPGPKGFPGPQGAPGLSGSDGHKGRPGTPGTAEIPGP
PGFRGDMGDPGFGGEKGSSPVGPPGPPGSPGVNGQKGIPGDPAFGHLGPPGKRGLSGVPG
IKGPRGDPGCPGAEGPAGIPGFLGLKGPKGREGHAGFPGVPGPPGHSCERGAPGIPGQPG
LPGYPGSPGAPGGKGQPGDVGPPGPAGMKGLPGLPGRPGAHGPPGLPGIPGPFGDDGLPG
PPGPKGPRGLPGFPGFPGERGKPGAEGCPGAKGEPGEKGMSGLPGDRGLRGAKGAIGPPG
DEGEMAIISQKGTPGEPGPPGDDGFPGERGDKGTPGMQGRRGEPGRYGPPGFHRGEPGEK
GQPGPPGPPGPPGSTGLRGFIGFPGLPGDQGEPGSPGPPGFSGIDGARGPKGNKGDPASH
FGPPGPKGEPGSPGCPGHFGASGEQGLPGIQGPRGSPGRPGPPGSSGPPGCPGDHGMPGL
RGQPGEMGDPGPRGLQGDPGIPGPPGIKGPSGSPGLNGLHGLKGQKGTKGASGLHDVGPP
GPVGIPGLKGERGDPGSPGISPPGPRGKKGPPGPPGSSGPPGPAGATGRAPKDIPDPGPP
GDQGPPGPDGPRGAPGPPGLPGSVDLLRGEPGDCGLPGPPGPPGPPGPPGYKGFPGCDGK
DGQKGPVGFPGPQGPHGFPGPPGEKGLPGPPGRKGPTGLPGPRGEPGPPADVDDCPRIPG
LPGAPGMRGPEGAMGLPGMRGPSGPGCKGEPGLDGRRGVDGVPGSPGPPGRKGDTGEDGY
PGGPGPPGPIGDPGPKGFGPGYLGGFLLVLHSQTDQEPTCPLGMPRLWTGYSLLYLEGQE
KAHNQDLGLAGSCLPVFSTLPFAYCNIHQVCHYAQRNDRSYWLASAAPLPMMPLSEEAIR
PYVSRCAVCEAPAQAVAVHSQDQSIPPCPQTWRSLWIGYSFLMHTGAGDQGGGQALMSPG
SCLEDFRAAPFLECQGRQGTCHFFANKYSFWLTTVKADLQFSSAPAPDTLKESQAQRQKI
SRCQVCVKYS
NT seq 5073 nt   +upstreamnt  +downstreamnt
atgtggtctctgcacatagtactaatgaggtgctccttcagattgaccaagtccttggcc
acaggtccctggtcacttatactcattctcttttctgtacaatatgtatatgggagtgga
aagaaatacattggtccttgtggaggaagagattgctctgtttgccactgtgttcctgaa
aaggggtctcggggtccaccaggaccaccagggccacagggtccaattggacccctggga
gccccaggacccattgggctttcaggagagaaaggaatgagaggggaccgcggccctcct
ggagcagcaggggacaaaggagataagggtccaactggtgttcctggatttccaggttta
gatggcatacctgggcacccagggcctcctggacccagaggcaaacctggtatgagtggc
cacaatggctcaagaggtgacccagggtttccaggaggaagaggagctcttggcccagga
ggccccctaggccatcctggggaaaagggagaaaaaggaaattcagtgttcattttaggt
gccgttaaaggtattcagggagacagaggggacccaggactgcctggcttaccaggatct
tggggtgcaggaggaccggcaggtcccacaggatatcctggagagccagggttagtggga
cctccgggccaaccagggcgtccaggtttgaagggaaatcccggtgtgggagtaaagggg
caaatgggagacccgggtgaggttggtcagcaaggttctcctggacccaccctgttggta
gagccacctgacttttgtctctataaaggagaaaagggtataaaaggaattcctggaatg
gttggactgccaggaccaccaggacgcaagggagaatctggtattggggcaaaaggagaa
aaaggtattcctggatttccagggcctcggggggatcctggttcctatggatctccaggt
tttccaggattaaagggagaactaggactggttggagatcctgggctatttggattaatt
ggcccaaagggggatcctggaaatcgagggcacccaggaccaccaggtgttttggtgact
ccacctcttccactcaaaggcccaccaggggacccagggttccctggccgctatggagaa
acaggggatgttggaccacctggtcccccaggtctcttgggcagaccaggggaagcctgt
gcaggcatgataggaccccctgggccacaaggatttcctggtcttcctgggcttccagga
gaagctggtattcctgggagacctgattctgctccaggaaaaccagggaagccaggatca
cctggcttgcctggagcaccaggcctgcagggcctcccaggatcaagtgtgatatactgt
agtgttgggaaccccggaccacaaggaataaaaggcaaagttggtcccccaggaggaaga
ggcccaaaaggagaaaaaggaaatgaaggactctgtgcctgtgagcctggacccatgggc
ccccctggccctccaggacttcctgggaggcaggggagtaagggagacttggggctccct
ggctggcttggaacaaaaggtgacccaggacctcctggtgctgaaggacctccagggcta
ccaggaaagcatggtgcctctggaccacctggcaacaaaggggcgaagggtgacatggtt
gtatcaagagttaaagggcacaaaggagaaagaggtcctgatgggcccccaggatttcca
gggcagccaggatcacatggtcgggatggacatgctggagaaaaaggggatccaggacct
ccaggggatcatgaagatgcgaccccaggtggtaaaggatttcctggacctctgggcccc
ccaggcaaagcaggacctgtggggcccccaggactgggatttcctggtccaccaggagag
cgaggccacccaggagttccaggccacccaggtgtgaggggccctgatggcttgaagggt
cagaaaggtgacacaatttcttgcaacgtaacctaccctgggaggcatggccctccaggt
tttgatggacctccaggtccgaagggatttccaggtccccaaggtgcccctgggctgagt
ggttcagatgggcataaaggcagacctggcacaccaggaacagcggaaataccaggtcca
cctggttttcgtggtgacatgggagatccgggttttggaggtgaaaaggggtcctcccct
gttgggcccccaggccctcccggctcaccaggagtgaatggtcagaaaggaatcccggga
gaccctgcatttggtcacctgggacccccgggaaagaggggtctttcaggagtgccaggg
ataaaaggacccagaggtgatccgggatgtccaggggctgaagggccagctggcattcct
ggattcctaggtctcaaaggtcccaaaggcagagagggacatgctgggtttccaggtgtc
ccaggtccacctggccattcctgtgaaagaggtgctccagggataccagggcaaccggga
ctccctgggtatccaggtagcccaggtgctccaggtgggaaaggacagccgggagatgtg
gggcctcccgggccagctggaatgaaaggcctccccggactcccaggacggcctggggca
catggtcccccaggcctcccaggaatcccaggtccctttggagatgatgggctacctggt
cctccaggtccaaagggaccccgggggctgcctggtttcccaggttttcccggagaaaga
ggaaagcctggtgcagagggatgtcctggcgcaaagggagaacctggagagaagggcatg
tctggccttcctggagaccggggactgagaggggccaaaggagccataggacctcccgga
gatgaaggagaaatggctatcatttcacaaaagggaacacctggggaacctggacctcct
ggagatgatggattcccaggagaaagaggtgataaaggaactcccgggatgcaagggaga
agaggagagccgggaagatacggaccacctggatttcacagaggggaacctggtgagaaa
ggtcagccagggcctcctggacccccaggccctccaggctcaactggtctaagagggttc
attggttttccaggacttccaggtgaccagggtgagccaggttctccaggtccccctgga
ttttcaggaattgatggagcaagaggacctaaaggaaacaaaggtgaccctgccagtcac
tttggtccacctggtccaaagggtgagccaggtagccctggatgtccagggcattttgga
gcatccggagagcagggcttgcctggtattcaagggcccagaggatcacctggaaggcca
gggccacctggctcctctggaccaccagggtgcccaggtgatcacgggatgcctgggctg
aggggacagccaggagaaatgggagaccctgggccaagaggcctccagggggatccaggg
ataccaggtcctccgggaataaaaggtccctccggatcacctggcctgaacggcttgcat
ggattgaaaggtcagaaaggaactaaaggtgcttcaggtttgcatgatgtggggccacct
ggtccagtgggaatacctgggctaaaaggggagagaggagaccctgggagcccaggaatc
tctcctccaggtcctcgtggaaagaaaggtcccccaggacccccagggagttcaggacca
cctggtcctgcaggtgccacaggaagagctcctaaggacattcctgacccgggtccacct
ggagatcagggacctcctggtcctgatggcccaagaggagcacctgggcctccaggcctc
cctgggagtgttgaccttctgagaggggagccaggtgactgtggtctaccagggccacca
ggtccccctggcccaccaggccctccaggatacaaaggctttccaggatgtgatggaaaa
gatggccagaaaggaccagtgggattcccgggaccgcagggaccacatggatttcctggg
ccacctggagagaagggtttacctggacctccagggagaaaagggcccactggtcttccg
ggtcccagaggtgaaccggggccacctgcagatgtggatgactgtccccgaatcccaggc
cttcctggggcgccaggcatgagaggaccagaaggagccatggggctccctggaatgaga
ggcccctcaggaccagggtgcaaaggagagcctgggctggatggcaggaggggtgtggat
ggcgtccctgggtctcctgggcctcccggacgtaaaggtgacacaggagaagacggctac
cctggaggaccagggcctcctggtcccattggggatcctgggcccaaagggtttggccct
ggatacctcggtggcttcctcctggttctccacagtcagacggaccaggagcccacctgc
cccctgggcatgcccaggctctggactgggtatagtctgttatacctggaagggcaagag
aaagctcacaatcaagaccttggtctggcagggtcttgccttcccgtatttagcacgctg
ccctttgcctactgcaacatccaccaggtgtgccactatgcccagagaaacgacagatcc
tactggctggccagcgctgcgcccctccccatgatgccactctctgaagaggcgatccgc
ccctatgtcagccgctgtgcggtatgcgaggccccggcccaggcggtggcggtgcacagc
caggaccagtccatccccccatgtccgcagacctggaggagcctctggatcgggtattca
ttcctgatgcacacaggagctggggaccaaggaggagggcaggcccttatgtcacctggc
agctgcctggaagatttcagagcagcaccattccttgaatgccagggccggcagggaact
tgccactttttcgcaaataagtatagcttctggctcacaacggtgaaagcagacttgcag
ttttcctctgctccagcaccagacaccttaaaagaaagccaggcccaacgccagaaaatc
agccggtgccaggtctgcgtgaagtatagctag

KEGG   Homo sapiens (human): 1287
Entry
1287              CDS       T01001                                 
Symbol
COL4A5, ASLN, ATS, ATS1, CA54
Name
(RefSeq) collagen type IV alpha 5 chain
  KO
K06237  collagen type IV alpha
Organism
hsa  Homo sapiens (human)
Pathway
hsa04151  PI3K-Akt signaling pathway
hsa04510  Focal adhesion
hsa04512  ECM-receptor interaction
hsa04820  Cytoskeleton in muscle cells
hsa04926  Relaxin signaling pathway
hsa04933  AGE-RAGE signaling pathway in diabetic complications
hsa04974  Protein digestion and absorption
hsa05146  Amoebiasis
hsa05165  Human papillomavirus infection
hsa05200  Pathways in cancer
hsa05222  Small cell lung cancer
Network
nt06539  Cytoskeleton in muscle cells
  Element
N01814  Extracellular matrix - Basal lamina
Disease
H00581  Alport syndrome
H01640  Uterine leiomyoma
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    1287 (COL4A5)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    1287 (COL4A5)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    1287 (COL4A5)
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    1287 (COL4A5)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    1287 (COL4A5)
  09154 Digestive system
   04974 Protein digestion and absorption
    1287 (COL4A5)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    1287 (COL4A5)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    1287 (COL4A5)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    1287 (COL4A5)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    1287 (COL4A5)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    1287 (COL4A5)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:hsa04147]
    1287 (COL4A5)
   00536 Glycosaminoglycan binding proteins [BR:hsa00536]
    1287 (COL4A5)
Exosome [BR:hsa04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   1287 (COL4A5)
Glycosaminoglycan binding proteins [BR:hsa00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   1287 (COL4A5)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 1287
NCBI-ProteinID: NP_000486
OMIM: 303630
HGNC: 2207
Ensembl: ENSG00000188153
UniProt: P29400 Q49AM6 A7MBN3
Structure
LinkDB
Position
X:108439838..108697545
AA seq 1685 aa
MKLRGVSLAAGLFLLALSLWGQPAEAAACYGCSPGSKCDCSGIKGEKGERGFPGLEGHPG
LPGFPGPEGPPGPRGQKGDDGIPGPPGPKGIRGPPGLPGFPGTPGLPGMPGHDGAPGPQG
IPGCNGTKGERGFPGSPGFPGLQGPPGPPGIPGMKGEPGSIIMSSLPGPKGNPGYPGPPG
IQGLPGPTGIPGPIGPPGPPGLMGPPGPPGLPGPKGNMGLNFQGPKGEKGEQGLQGPPGP
PGQISEQKRPIDVEFQKGDQGLPGDRGPPGPPGIRGPPGPPGGEKGEKGEQGEPGKRGKP
GKDGENGQPGIPGLPGDPGYPGEPGRDGEKGQKGDTGPPGPPGLVIPRPGTGITIGEKGN
IGLPGLPGEKGERGFPGIQGPPGLPGPPGAAVMGPPGPPGFPGERGQKGDEGPPGISIPG
PPGLDGQPGAPGLPGPPGPAGPHIPPSDEICEPGPPGPPGSPGDKGLQGEQGVKGDKGDT
CFNCIGTGISGPPGQPGLPGLPGPPGSLGFPGQKGEKGQAGATGPKGLPGIPGAPGAPGF
PGSKGEPGDILTFPGMKGDKGELGSPGAPGLPGLPGTPGQDGLPGLPGPKGEPGGITFKG
ERGPPGNPGLPGLPGNIGPMGPPGFGPPGPVGEKGIQGVAGNPGQPGIPGPKGDPGQTIT
QPGKPGLPGNPGRDGDVGLPGDPGLPGQPGLPGIPGSKGEPGIPGIGLPGPPGPKGFPGI
PGPPGAPGTPGRIGLEGPPGPPGFPGPKGEPGFALPGPPGPPGLPGFKGALGPKGDRGFP
GPPGPPGRTGLDGLPGPKGDVGPNGQPGPMGPPGLPGIGVQGPPGPPGIPGPIGQPGLHG
IPGEKGDPGPPGLDVPGPPGERGSPGIPGAPGPIGPPGSPGLPGKAGASGFPGTKGEMGM
MGPPGPPGPLGIPGRSGVPGLKGDDGLQGQPGLPGPTGEKGSKGEPGLPGPPGPMDPNLL
GSKGEKGEPGLPGIPGVSGPKGYQGLPGDPGQPGLSGQPGLPGPPGPKGNPGLPGQPGLI
GPPGLKGTIGDMGFPGPQGVEGPPGPSGVPGQPGSPGLPGQKGDKGDPGISSIGLPGLPG
PKGEPGLPGYPGNPGIKGSVGDPGLPGLPGTPGAKGQPGLPGFPGTPGPPGPKGISGPPG
NPGLPGEPGPVGGGGHPGQPGPPGEKGKPGQDGIPGPAGQKGEPGQPGFGNPGPPGLPGL
SGQKGDGGLPGIPGNPGLPGPKGEPGFHGFPGVQGPPGPPGSPGPALEGPKGNPGPQGPP
GRPGLPGPEGPPGLPGNGGIKGEKGNPGQPGLPGLPGLKGDQGPPGLQGNPGRPGLNGMK
GDPGLPGVPGFPGMKGPSGVPGSAGPEGEPGLIGPPGPPGLPGPSGQSIIIKGDAGPPGI
PGQPGLKGLPGPQGPQGLPGPTGPPGDPGRNGLPGFDGAGGRKGDPGLPGQPGTRGLDGP
PGPDGLQGPPGPPGTSSVAHGFLITRHSQTTDAPQCPQGTLQVYEGFSLLYVQGNKRAHG
QDLGTAGSCLRRFSTMPFMFCNINNVCNFASRNDYSYWLSTPEPMPMSMQPLKGQSIQPF
ISRCAVCEAPAVVIAVHSQTIQIPHCPQGWDSLWIGYSFMMHTSAGAEGSGQALASPGSC
LEEFRSAPFIECHGRGTCNYYANSYSFWLATVDVSDMFSKPQSETLKAGDLRTRISRCQV
CMKRT
NT seq 5058 nt   +upstreamnt  +downstreamnt
atgaaactgcgtggagtcagcctggctgccggcttgttcttactggccctgagtctttgg
gggcagcctgcagaggctgcggcttgctatgggtgttctccaggatcaaagtgtgactgc
agtggcataaaaggggaaaagggagagagagggtttccaggtttggaaggacacccagga
ttgcctggatttccaggtccagaagggcctccggggcctcggggacaaaagggtgatgat
ggaattccagggccaccaggaccaaaaggaatcagaggtcctcctggacttcctggattt
ccagggacaccaggtcttcctggaatgccaggccacgatggggccccaggacctcaaggt
attcccggatgcaatggaaccaagggagaacgtggatttccaggcagtcccggttttcct
ggtttacagggtcctccaggaccccctgggatcccaggtatgaagggtgaaccaggtagt
ataattatgtcatcactgccaggaccaaagggtaatccaggatatccaggtcctcctgga
atacaaggcctacctggtcccactggtataccagggccaattggtcccccaggaccacca
ggtttgatgggccctcctggtccaccaggacttccaggacctaaggggaatatgggctta
aatttccagggacccaaaggtgaaaaaggtgagcaaggtcttcagggcccacctgggcca
cctgggcagatcagtgaacagaaaagaccaattgatgtagagtttcagaaaggagatcag
ggacttcctggtgaccgagggcctcctggacctccagggatacgtggtcctccaggtccc
ccaggtggtgagaaaggtgagaagggtgagcaaggagagccaggcaaaagaggtaaacca
ggcaaagatggagaaaatggccaaccaggaattcctggtttgcctggtgatcctggttac
cctggtgaacccggaagggatggtgaaaagggccaaaaaggtgacactggcccacctgga
cctcctggacttgtaattcctagacctgggactggtataactataggagaaaaaggaaac
attgggttgcctgggttgcctggagaaaaaggagagcgaggatttcctggaatacagggt
ccacctggccttcctggacctccaggggctgcagttatgggtcctcctggccctcctgga
tttcctggagaaaggggtcagaaaggtgatgaaggaccacctggaatttccattcctgga
cctcctggacttgacggacagcctggggctcctgggcttccagggcctcctggccctgct
ggccctcacattcctcctagtgatgagatatgtgaaccaggccctccaggccccccagga
tctccaggtgataaaggactccaaggagaacaaggagtgaaaggtgacaaaggtgacact
tgcttcaactgcattggaactggtatttcagggcctccaggtcaacctggtttgccaggt
ctcccaggtcctccaggatctcttggtttccctggacagaaaggggaaaaaggacaagct
ggtgcaactggtcccaaaggattaccaggcattccaggagctccaggtgctccaggcttt
cctggatctaaaggtgaacctggtgatatcctcacttttccaggaatgaagggtgacaaa
ggagagttgggttcccctggagctccagggcttcctggtttacctggcactcctggacag
gatggattgccagggcttcctggcccgaaaggagagcctggtggaattacttttaagggt
gaaagaggtccccctgggaacccaggtttaccaggcctcccagggaatatagggcctatg
ggtccccctggtttcggccctccaggcccagtaggtgaaaaaggcatacaaggtgtggca
ggaaatccaggccagccaggaataccaggtcctaaaggggatccaggtcagactataacc
cagccggggaagcctggcttgcctggtaacccaggcagagatggtgatgtaggtcttcca
ggtgaccctggacttccagggcaaccaggcttgccagggatacctggtagcaaaggagaa
ccaggtatccctggaattgggcttcctggaccacctggtcccaaaggctttcctggaatt
ccaggacctccaggagcacctgggacacctggaagaattggtctagaaggccctcctggg
ccacccggctttccaggaccaaagggtgaaccaggatttgcattacctgggccacctggg
ccaccaggacttccaggtttcaaaggagcacttggtccaaaaggtgatcgtggtttccca
ggacctccgggtcctccaggacgcactggcttagatgggctccctggaccaaaaggtgat
gttggaccaaatggacaacctggaccaatgggacctcctgggctgccaggaataggtgtt
cagggaccaccaggaccaccagggattcctgggccaataggtcaacctggtttacatgga
ataccaggagagaagggggatccaggacctcctggacttgatgttccaggacccccaggt
gaaagaggcagtccagggatccccggagcacctggtcctataggacctccaggatcacca
gggcttccaggaaaagcaggtgcctctggatttccaggtaccaaaggtgaaatgggtatg
atgggacctccaggcccaccaggacctttgggaattcctggcaggagtggtgtacctggt
cttaaaggtgatgatggcttgcagggtcagccaggacttcctggccctacaggagaaaaa
ggtagtaaaggagagcctggccttccaggccctcctggaccaatggatccaaatcttctg
ggctcaaaaggagagaagggggaacctggcttaccaggtatacctggagtttcagggcca
aaaggttatcagggtttgcctggagacccagggcaacctggactgagtggacaacctgga
ttaccaggaccaccaggtcccaaaggtaaccctggtctccctggacagccaggtcttata
ggacctcctggacttaaaggaaccatcggtgatatgggttttccagggcctcagggtgtg
gaagggcctcctggaccttctggagttcctggacaacctggctccccaggattacctgga
cagaaaggcgacaaaggtgatcctggtatttcaagcattggtcttccaggtcttcctggt
ccaaagggtgagcctggtctgcctggatacccagggaaccctggtatcaaaggttctgtg
ggagatcctggtttgcccggattaccaggaacccctggagcaaaaggacaaccaggcctt
cctggattcccaggaaccccaggccctcctggaccaaaaggtattagtggccctcctggg
aaccccggccttccaggagaacctggtcctgtaggtggtggaggtcatcctgggcaacca
gggcctccaggcgaaaaaggcaaacccggtcaagatggtattcctggaccagctggacag
aagggtgaaccaggtcaaccaggctttggaaacccaggaccccctggacttccaggactt
tctggccaaaagggtgatggaggattacctgggattccaggaaatcctggccttccaggt
ccaaagggcgaaccaggctttcacggtttccctggtgtgcagggtcccccaggccctcct
ggttctccgggtccagctctggaaggacctaaaggcaaccctgggccccaaggtcctcct
gggagaccaggtctaccaggtccagaaggtcctccaggtctccctggaaatggaggtatt
aaaggagagaagggaaatccaggccaacctgggctacctggcttgcctggtttgaaagga
gatcaaggaccaccaggactccagggtaatcctggccggccgggtctcaatggaatgaaa
ggagatcctggtctccctggtgttccaggattcccaggcatgaaaggacccagtggagta
cctggatcagctggccctgagggggaaccgggacttattggtcctccaggtcctcctgga
ttacctggtccttcaggacagagtatcataattaaaggagatgctggtcctccaggaatc
cctggccagcctgggctaaagggtctaccaggaccccaaggacctcaaggcttaccaggt
ccaactggccctccaggagatcctggacgcaatggactccctggctttgatggtgcagga
gggcgcaaaggagacccaggtctgccaggacagccaggtacccgtggtttggatggtccc
cctggtccagatggattgcaaggtcccccaggtccccctggaacctcctctgttgcacat
ggatttcttattacacgccacagccagacaacggatgcaccacaatgcccacagggaaca
cttcaggtctatgaaggcttttctctcctgtatgtacaaggaaataaaagagcccacggt
caagacttggggacggctggcagctgccttcgtcgctttagtaccatgcctttcatgttc
tgcaacatcaataatgtttgcaactttgcttcaagaaatgactattcttactggctctct
accccagagcccatgccaatgagcatgcaacccctaaagggccagagcatccagccattc
attagtcgatgtgcagtatgtgaagctccagctgtggtgatcgcagttcacagtcagacg
atccagattccccattgtcctcagggatgggattctctgtggattggttattccttcatg
atgcatacaagtgcaggggcagaaggctcaggtcaagccctagcctcccctggttcctgc
ttggaagagtttcgttcagctcccttcatcgaatgtcatgggaggggtacctgtaactac
tatgccaactcctacagcttttggctggcaactgtagatgtgtcagacatgttcagtaaa
cctcagtcagaaacgctgaaagcaggagacttgaggacacgaattagccgatgtcaagtg
tgcatgaagaggacataa

KEGG   Homo sapiens (human): 1288
Entry
1288              CDS       T01001                                 
Symbol
COL4A6, CXDELq22.3, DELXq22.3, DFNX6
Name
(RefSeq) collagen type IV alpha 6 chain
  KO
K06237  collagen type IV alpha
Organism
hsa  Homo sapiens (human)
Pathway
hsa04151  PI3K-Akt signaling pathway
hsa04510  Focal adhesion
hsa04512  ECM-receptor interaction
hsa04820  Cytoskeleton in muscle cells
hsa04926  Relaxin signaling pathway
hsa04933  AGE-RAGE signaling pathway in diabetic complications
hsa04974  Protein digestion and absorption
hsa05146  Amoebiasis
hsa05165  Human papillomavirus infection
hsa05200  Pathways in cancer
hsa05222  Small cell lung cancer
Network
nt06539  Cytoskeleton in muscle cells
  Element
N01814  Extracellular matrix - Basal lamina
Disease
H01209  Deafness, X-linked
H01640  Uterine leiomyoma
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    1288 (COL4A6)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    1288 (COL4A6)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    1288 (COL4A6)
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    1288 (COL4A6)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    1288 (COL4A6)
  09154 Digestive system
   04974 Protein digestion and absorption
    1288 (COL4A6)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    1288 (COL4A6)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    1288 (COL4A6)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    1288 (COL4A6)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    1288 (COL4A6)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    1288 (COL4A6)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:hsa04147]
    1288 (COL4A6)
   00536 Glycosaminoglycan binding proteins [BR:hsa00536]
    1288 (COL4A6)
Exosome [BR:hsa04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   1288 (COL4A6)
Glycosaminoglycan binding proteins [BR:hsa00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   1288 (COL4A6)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 1288
NCBI-ProteinID: NP_001838
OMIM: 303631
HGNC: 2208
Ensembl: ENSG00000197565
UniProt: Q14031
LinkDB
Position
X:complement(108155614..108439458)
AA seq 1691 aa
MLINKLWLLLVTLCLTEELAAAGEKSYGKPCGGQDCSGSCQCFPEKGARGRPGPIGIQGP
TGPQGFTGSTGLSGLKGERGFPGLLGPYGPKGDKGPMGVPGFLGINGIPGHPGQPGPRGP
PGLDGCNGTQGAVGFPGPDGYPGLLGPPGLPGQKGSKGDPVLAPGSFKGMKGDPGLPGLD
GITGPQGAPGFPGAVGPAGPPGLQGPPGPPGPLGPDGNMGLGFQGEKGVKGDVGLPGPAG
PPPSTGELEFMGFPKGKKGSKGEPGPKGFPGISGPPGFPGLGTTGEKGEKGEKGIPGLPG
PRGPMGSEGVQGPPGQQGKKGTLGFPGLNGFQGIEGQKGDIGLPGPDVFIDIDGAVISGN
PGDPGVPGLPGLKGDEGIQGLRGPSGVPGLPALSGVPGALGPQGFPGLKGDQGNPGRTTI
GAAGLPGRDGLPGPPGPPGPPSPEFETETLHNKESGFPGLRGEQGPKGNLGLKGIKGDSG
FCACDGGVPNTGPPGEPGPPGPWGLIGLPGLKGARGDRGSGGAQGPAGAPGLVGPLGPSG
PKGKKGEPILSTIQGMPGDRGDSGSQGFRGVIGEPGKDGVPGLPGLPGLPGDGGQGFPGE
KGLPGLPGEKGHPGPPGLPGNGLPGLPGPRGLPGDKGKDGLPGQQGLPGSKGITLPCIIP
GSYGPSGFPGTPGFPGPKGSRGLPGTPGQPGSSGSKGEPGSPGLVHLPELPGFPGPRGEK
GLPGFPGLPGKDGLPGMIGSPGLPGSKGATGDIFGAENGAPGEQGLQGLTGHKGFLGDSG
LPGLKGVHGKPGLLGPKGERGSPGTPGQVGQPGTPGSSGPYGIKGKSGLPGAPGFPGISG
HPGKKGTRGKKGPPGSIVKKGLPGLKGLPGNPGLVGLKGSPGSPGVAGLPALSGPKGEKG
SVGFVGFPGIPGLPGIPGTRGLKGIPGSTGKMGPSGRAGTPGEKGDRGNPGPVGIPSPRR
PMSNLWLKGDKGSQGSAGSNGFPGPRGDKGEAGRPGPPGLPGAPGLPGIIKGVSGKPGPP
GFMGIRGLPGLKGSSGITGFPGMPGESGSQGIRGSPGLPGASGLPGLKGDNGQTVEISGS
PGPKGQPGESGFKGTKGRDGLIGNIGFPGNKGEDGKVGVSGDVGLPGAPGFPGVAGMRGE
PGLPGSSGHQGAIGPLGSPGLIGPKGFPGFPGLHGLNGLPGTKGTHGTPGPSITGVPGPA
GLPGPKGEKGYPGIGIGAPGKPGLRGQKGDRGFPGLQGPAGLPGAPGISLPSLIAGQPGD
PGRPGLDGERGRPGPAGPPGPPGPSSNQGDTGDPGFPGIPGPKGPKGDQGIPGFSGLPGE
LGLKGMRGEPGFMGTPGKVGPPGDPGFPGMKGKAGPRGSSGLQGDPGQTPTAEAVQVPPG
PLGLPGIDGIPGLTGDPGAQGPVGLQGSKGLPGIPGKDGPSGLPGPPGALGDPGLPGLQG
PPGFEGAPGQQGPFGMPGMPGQSMRVGYTLVKHSQSEQVPPCPIGMSQLWVGYSLLFVEG
QEKAHNQDLGFAGSCLPRFSTMPFIYCNINEVCHYARRNDKSYWLSTTAPIPMMPVSQTQ
IPQYISRCSVCEAPSQAIAVHSQDITIPQCPLGWRSLWIGYSFLMHTAAGAEGGGQSLVS
PGSCLEDFRATPFIECSGARGTCHYFANKYSFWLTTVEERQQFGELPVSETLKAGQLHTR
VSRCQVCMKSL
NT seq 5076 nt   +upstreamnt  +downstreamnt
atgcttataaacaagttgtggctgctcctggttacgttgtgcctgaccgaggaactggca
gcagcgggagagaagtcttatggaaagccatgtgggggccaggactgcagtgggagctgt
cagtgttttcctgagaaaggagcgagaggacgacctggaccaattggaattcaaggccca
acaggtcctcaaggattcactggctctactggtttatcgggattgaaaggagaaaggggt
ttcccaggccttctgggaccttatggaccaaaaggagataagggtcccatgggagttcct
ggctttcttggcatcaatgggattccgggccaccctggacaaccaggccccagaggccca
cctggtctggatggctgtaatggaactcaaggagctgttggatttccaggccctgatggc
tatcctgggcttctcggaccacccgggcttcctggtcagaaaggatcaaaaggtgaccct
gtccttgctccaggtagtttcaaaggaatgaagggggatcctgggctgcctggactggat
ggaatcactggcccacaaggagcacccggatttcctggagctgtaggacctgcaggacca
ccaggattacaaggtcctccagggcctcctggtcctcttggtcctgatgggaatatgggg
ctaggttttcaaggagagaaaggagtcaagggggatgttggcctccctggcccagcagga
cctccaccatctactggagagctggaattcatgggattccccaaagggaagaaaggatcc
aagggtgaaccagggcctaagggttttccaggcataagtggccctccaggcttcccgggc
cttggaactactggagaaaagggagaaaagggagaaaagggaatccctggtttgccagga
cctaggggtcccatgggttcagaaggagtccaaggccctccagggcaacagggcaagaaa
gggaccctgggatttcctgggcttaatggattccaaggaattgagggtcaaaagggtgac
attggcctgccaggcccagatgttttcatcgatatagatggtgctgtgatctcaggtaat
cctggagatcctggtgtacctggcctcccaggccttaaaggagatgaaggcatccaaggc
ctacgtggcccttctggtgtccctggattgccagcattatcaggtgtcccaggagcccta
gggcctcagggatttccagggctgaagggggaccaaggaaacccaggccgtaccacaatt
ggagcagctggcctccctggcagagatggtttgccaggcccaccaggtccaccaggccca
cctagtccagaatttgagactgaaactctacacaacaaagagtcagggttccctggtctc
cgaggagaacaaggtccaaaaggaaacctaggcctcaaaggaataaaaggagactcaggt
ttctgtgcttgtgacggtggtgttcccaacactggaccacccggggaaccaggcccacct
ggtccatggggtctcataggccttccaggccttaaaggagccagaggagatcgaggctct
gggggtgcacagggcccagcaggggctccaggcttagttgggcctctgggtccttcagga
cccaaaggaaagaagggggaaccaattctcagtacaatccaaggaatgccaggagatcgg
ggtgattctggctcccagggcttccgtggtgtaataggagaaccaggcaaggacggagta
ccaggtttaccaggtctgccaggccttccgggtgatggtggacagggcttcccaggtgaa
aaggggttacctggacttcctggtgaaaaaggccatcctggtccacctggcctcccagga
aatgggttaccaggacttcctggaccccgtgggcttcctggagataaaggcaaggatgga
ttaccgggacaacaaggccttcccggatctaagggaatcaccctgccctgtattattcct
gggtcatacggtccatcaggatttccaggcactcccggattcccaggccctaaagggtct
cgaggcctccctgggaccccaggccagcctgggtcaagtggaagtaaaggagagccaggg
agtccaggattggttcatcttcctgaattaccaggatttcctggacctcgtggggagaag
ggcttgcctgggtttcctgggctccctggaaaagatggcttgcctgggatgattggcagt
ccaggcttacctggttccaagggagccactggtgacatctttggtgctgaaaatggtgct
ccgggggaacaaggcctacaaggattaacagggcacaaaggatttcttggagactctggc
cttccaggactcaagggtgtgcacgggaagcctggcttactaggccccaaaggtgagcgg
ggcagccctgggacaccaggacaggtgggacagccaggcaccccaggatctagtggtcca
tatggcatcaagggcaaatctgggctcccaggagcaccaggcttcccaggcatctcagga
catcctggaaagaaaggaacaagaggcaagaaaggtcctcctggatcaattgtaaagaaa
gggctgccagggctaaaaggccttcctggaaatccaggcctagtaggactgaaaggaagc
ccaggctctccaggggtcgctgggttgccagccctctctggacccaagggagagaagggg
tctgttggattcgtaggttttccaggaataccaggtctgcctggtattcctggaacaaga
ggattaaaaggaattccaggatcaactggaaaaatgggaccatctggacgtgctggtact
cctggtgaaaagggagacagaggcaatccggggccagtcggaatacctagtccaagacgt
ccaatgtcaaacctttggctcaaaggagacaaaggctctcaaggctcagccggatccaat
ggatttcctgggccaagaggtgacaaaggagaggctggtcgacctggaccaccaggccta
cctggagctcctggcctcccaggcattatcaaaggagttagtggaaagccagggccccct
ggcttcatgggaatccggggcttacctggcctgaaggggtcctctgggatcacaggtttc
ccaggaatgccaggagaaagtggttcacaaggtatcagagggtcgcctggactcccagga
gcatctggtctcccaggcctgaaaggagacaacggccagacagttgaaatttccggtagc
ccaggacccaagggacagcctggcgaatctggttttaaaggcacaaaaggaagagatgga
ctaataggcaatataggcttccctggaaacaaaggtgaagatggaaaagttggtgtttct
ggagatgttggccttcctggagctccaggatttccaggagttgccggcatgagaggagaa
ccaggacttccaggttcttctggtcaccaaggggcaattgggcctctaggatcccccgga
ttaataggacccaaaggcttccctggatttcctggtttacatggactgaatgggcttccg
ggcaccaagggtacccatggcactccaggacctagtatcaccggtgtgcctgggcctgct
ggtctccctggacccaaaggagaaaaaggatatccaggaattggcatcggagctccaggg
aagccgggcctgagagggcaaaaaggtgatcgaggtttcccaggtctccagggccctgct
ggtctccccggtgccccaggcatctccttgccctcactcatagcaggacagcctggtgac
cccgggcgaccaggcctagatggagaacgaggccgcccaggccccgctggacccccaggt
ccccctgggccatcctcgaatcaaggcgacaccggagaccctggcttccctggaattcct
ggacctaaagggcctaagggagaccaaggaattccaggtttttctggcctccctggagag
ctaggactgaaaggcatgagaggtgagcctggcttcatggggactccaggcaaggttggg
ccacctggagacccaggatttcccggaatgaaggggaaggcagggccaagaggctcttct
ggcctccaaggtgatcctggacaaacaccaactgcagaagctgtccaggttcctcctgga
cccttgggtctaccagggatcgatggcatccctggcctcactggggaccctggggctcaa
ggccctgtaggcctacaaggctccaaaggtttacctggcatccccggtaaagatggcccc
agtgggctcccaggcccacctggggctcttggtgatcctggtctgcctggactgcaaggc
cctccaggatttgaaggagctccagggcagcaaggccccttcgggatgcctggaatgcct
ggccagagcatgagagtgggctacacgttggtaaagcacagccagtcggaacaggtgccc
ccgtgtcccatcgggatgagccagctgtgggtggggtacagcttactgtttgtggagggg
caagagaaagcccacaaccaggacctgggctttgctggctcctgtctgccccgcttcagc
accatgcccttcatctactgcaacatcaacgaggtgtgccactatgccaggcgcaatgat
aaatcttactggctctccactaccgcccctatccccatgatgcccgtcagccagacccag
attccccagtacatcagccgctgctctgtgtgtgaggcaccctcgcaagccattgctgtg
cacagccaggacatcaccatcccgcagtgccccctgggctggcgcagcctctggattggg
tactctttcctcatgcacactgccgctggtgccgagggtggaggccagtccctggtctca
cctggctcctgcctagaggactttcgggccactcctttcatcgaatgcagtggtgcccga
ggcacctgccactactttgcaaacaagtacagtttctggttgaccacagtggaggagagg
cagcagtttggggagttgcctgtgtctgaaacgctgaaagctgggcagctccacactcga
gtcagtcgctgccaggtgtgtatgaaaagcctgtag

KEGG   Homo sapiens (human): 2192
Entry
2192              CDS       T01001                                 
Symbol
FBLN1, FBLN, FIBL1
Name
(RefSeq) fibulin 1
  KO
K17307  fibulin 1/2
Organism
hsa  Homo sapiens (human)
Pathway
hsa04820  Cytoskeleton in muscle cells
Network
nt06539  Cytoskeleton in muscle cells
  Element
N01814  Extracellular matrix - Basal lamina
Disease
H00459  Synpolydactyly
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09140 Cellular Processes
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    2192 (FBLN1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:hsa04147]
    2192 (FBLN1)
Exosome [BR:hsa04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   2192 (FBLN1)
SSDB
Motif
Pfam: EGF_CA cEGF Fibulin_C EGF_3 FXa_inhibition ANATO EGF hEGF
Other DBs
NCBI-GeneID: 2192
NCBI-ProteinID: NP_006477
OMIM: 135820
HGNC: 3600
Ensembl: ENSG00000077942
UniProt: P23142 Q8NBH6
LinkDB
Position
22:45502883..45601135
AA seq 703 aa
MERAAPSRRVPLPLLLLGGLALLAAGVDADVLLEACCADGHRMATHQKDCSLPYATESKE
CRMVQEQCCHSQLEELHCATGISLANEQDRCATPHGDNASLEATFVKRCCHCCLLGRAAQ
AQGQSCEYSLMVGYQCGQVFQACCVKSQETGDLDVGGLQETDKIIEVEEEQEDPYLNDRC
RGGGPCKQQCRDTGDEVVCSCFVGYQLLSDGVSCEDVNECITGSHSCRLGESCINTVGSF
RCQRDSSCGTGYELTEDNSCKDIDECESGIHNCLPDFICQNTLGSFRCRPKLQCKSGFIQ
DALGNCIDINECLSISAPCPIGHTCINTEGSYTCQKNVPNCGRGYHLNEEGTRCVDVDEC
APPAEPCGKGHRCVNSPGSFRCECKTGYYFDGISRMCVDVNECQRYPGRLCGHKCENTLG
SYLCSCSVGFRLSVDGRSCEDINECSSSPCSQECANVYGSYQCYCRRGYQLSDVDGVTCE
DIDECALPTGGHICSYRCINIPGSFQCSCPSSGYRLAPNGRNCQDIDECVTGIHNCSINE
TCFNIQGGFRCLAFECPENYRRSAATLQQEKTDTVRCIKSCRPNDVTCVFDPVHTISHTV
ISLPTFREFTRPEEIIFLRAITPPHPASQANIIFDITEGNLRDSFDIIKRYMDGMTVGVV
RQVRPIVGPFHAVLKLEMNYVVGGVVSHRNVVNVHIFVSEYWF
NT seq 2112 nt   +upstreamnt  +downstreamnt
atggagcgcgccgcgccgtcgcgccgggtcccgcttccgctgctgctgctcggcggcctt
gcgctgctggcggccggagtggacgcggatgtcctcctggaggcctgctgtgcggacgga
caccggatggccactcatcagaaggactgctcgctgccatatgctacggaatccaaagaa
tgcaggatggtgcaggagcagtgctgccacagccagctggaggagctgcactgtgccacg
ggcatcagcctggccaacgagcaggaccgctgtgccacgccccacggtgacaacgccagc
ctggaggccacatttgtgaagaggtgctgccattgctgtctgctggggagggcggcccag
gcccagggccagagctgcgagtacagcctcatggttggctaccagtgtggacaggtcttc
caggcatgctgtgtcaagagccaggagaccggagatttggatgtcgggggcctccaagaa
acggataagatcattgaggttgaggaggaacaagaggacccatatctgaatgaccgctgc
cgaggaggcgggccctgcaagcagcagtgccgagacacgggtgacgaggtggtctgctcc
tgcttcgtgggctaccagctgctgtctgatggtgtctcctgtgaagatgtcaatgaatgc
atcacgggcagccacagctgccggcttggagaatcctgcatcaacacagtgggctctttc
cgctgccagcgggacagcagctgcgggactggctatgagctcacagaggacaatagctgc
aaagatattgacgagtgtgagagtggtattcataactgcctccccgattttatctgtcag
aatactctgggatccttccgctgccgacccaagctacagtgcaagagtggctttatacaa
gatgctctaggcaactgtattgatatcaatgagtgtttgagtatcagtgccccgtgccct
atcgggcatacatgcatcaacacagagggctcctacacgtgccagaagaacgtgcccaac
tgtggccgtggctaccatctcaacgaggagggaacgcgctgtgttgatgtggacgagtgc
gcgccacctgctgagccctgtgggaagggacatcgctgcgtgaactctcccggcagtttc
cgctgcgaatgcaagacgggttactattttgacggcatcagcaggatgtgtgtcgatgtc
aacgagtgccagcgctaccccgggcgcctgtgtggccacaagtgcgagaacacgctgggc
tcctacctctgcagctgttccgtgggcttccggctctctgtggatggcaggtcatgtgaa
gacatcaatgagtgcagcagcagcccctgtagccaggagtgtgccaacgtctacggctcc
taccagtgttactgccggcgaggctaccagctcagcgatgtggatggagtcacctgtgaa
gacatcgacgagtgcgccctgcccaccgggggccacatctgctcctaccgctgcatcaac
atccctggaagcttccagtgcagctgcccctcgtctggctacaggctggcccccaatggc
cgcaactgccaagacattgatgagtgtgtgactggcatccacaactgctccatcaacgag
acctgcttcaacatccagggcggcttccgctgcctggccttcgagtgccctgagaactac
cgccgctccgcagccacgctccagcaggagaagacagacacggtccgctgcatcaagtcc
tgccgccccaacgatgtcacatgcgtgttcgaccccgtgcacaccatctcccacaccgtc
atctcgctgcctaccttccgcgagttcacccgccctgaagagatcatcttcctccgggcc
atcacgccaccgcatcctgccagccaggctaacatcatcttcgacatcacggaagggaac
ctgcgggactcttttgacatcatcaagcgttacatggacggcatgaccgtgggtgtcgtg
cgccaggtgcggcccatcgtgggcccatttcatgccgtcctgaagctggagatgaactat
gtggtcgggggcgtggtctcccaccgaaatgttgtcaacgtccacatcttcgtctctgag
tactggttctga

KEGG   Homo sapiens (human): 2199
Entry
2199              CDS       T01001                                 
Symbol
FBLN2
Name
(RefSeq) fibulin 2
  KO
K17307  fibulin 1/2
Organism
hsa  Homo sapiens (human)
Pathway
hsa04820  Cytoskeleton in muscle cells
Network
nt06539  Cytoskeleton in muscle cells
  Element
N01814  Extracellular matrix - Basal lamina
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09140 Cellular Processes
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    2199 (FBLN2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:hsa04147]
    2199 (FBLN2)
Exosome [BR:hsa04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   2199 (FBLN2)
SSDB
Motif
Pfam: EGF_CA cEGF Fibulin_C FXa_inhibition EGF_3 ANATO EGF hEGF
Other DBs
NCBI-GeneID: 2199
NCBI-ProteinID: NP_001989
OMIM: 135821
HGNC: 3601
Ensembl: ENSG00000163520
UniProt: P98095 Q9Y3V7 Q86V58
LinkDB
Position
3:13549125..13638404
AA seq 1184 aa
MVLLWEPAGAWLALGLALALGPSVAAAAPRQDCTGVECPPLENCIEEALEPGACCATCVQ
QGCACEGYQYYDCLQGGFVRGRVPAGQSYFVDFGSTECSCPPGGGKISCQFMLCPELPPN
CIEAVVVADSCPQCGQVGCVHAGHKYAAGHTVHLPPCRACHCPDAGGELICYQLPGCHGN
FSDAEEGDPERHYEDPYSYDQEVAEVEAATALGGEVQAGAVQAGAGGPPAALGGGSQPLS
TIQAPPWPAVLPRPTAAAALGPPAPVQAKARRVTEDSEEEEEEEEEREEMAVTEQLAAGG
HRGLDGLPTTAPAGPSLPIQEERAEAGARAEAGARPEENLILDAQATSRSTGPEGVTHAP
SLGKAALVPTQAVPGSPRDPVKPSPHNILSTSLPDAAWIPPTREVPRKPQVLPHSHVEED
TDPNSVHSIPRSSPEGSTKDLIETCCAAGQQWAIDNDECLEIPESGTEDNVCRTAQRHCC
VSYLQEKSCMAGVLGAKEGETCGAEDNDSCGISLYKQCCDCCGLGLRVRAEGQSCESNPN
LGYPCNHVMLSCCEGEEPLIVPEVRRPPEPAAAPRRVSEAEMAGREALSLGTEAELPNSL
PGDDQDECLLLPGELCQHLCINTVGSYHCACFPGFSLQDDGRTCRPEGHPPQPEAPQEPA
LKSEFSQVASNTIPLPLPQPNTCKDNGPCKQVCSTVGGSAICSCFPGYAIMADGVSCEDI
NECVTDLHTCSRGEHCVNTLGSFHCYKALTCEPGYALKDGECEDVDECAMGTHTCQPGFL
CQNTKGSFYCQARQRCMDGFLQDPEGNCVDINECTSLSEPCRPGFSCINTVGSYTCQRNP
LICARGYHASDDGTKCVDVNECETGVHRCGEGQVCHNLPGSYRCDCKAGFQRDAFGRGCI
DVNECWASPGRLCQHTCENTLGSYRCSCASGFLLAADGKRCEDVNECEAQRCSQECANIY
GSYQCYCRQGYQLAEDGHTCTDIDECAQGAGILCTFRCLNVPGSYQCACPEQGYTMTANG
RSCKDVDECALGTHNCSEAETCHNIQGSFRCLRFECPPNYVQVSKTKCERTTCHDFLECQ
NSPARITHYQLNFQTGLLVPAHIFRIGPAPAFTGDTIALNIIKGNEEGYFGTRRLNAYTG
VVYLQRAVLEPRDFALDVEMKLWRQGSVTTFLAKMHIFFTTFAL
NT seq 3555 nt   +upstreamnt  +downstreamnt
atggtgctgctctgggagcctgcaggagcctggcttgctctgggcctggccctggccctg
ggccccagcgtggccgcagctgcccctcggcaggactgcacgggcgtggagtgcccgccg
ctggagaactgcattgaggaggcgctggagccgggtgcctgctgtgccacgtgtgtgcag
cagggctgcgcctgcgagggctaccagtactatgactgcctacagggtggcttcgtgcgc
ggccgcgtgcccgccggtcagtcctattttgtggacttcgggagcactgagtgctcctgc
ccaccaggcggcggcaagatcagctgccagttcatgctgtgcccggagctgccgcccaac
tgcatcgaggctgtagtggtggctgacagctgcccacagtgcggccaggtgggctgcgtc
cacgcgggccacaagtacgccgctggccacactgttcacctgccgccctgccgggcctgc
cactgccctgacgccggtggagagctcatctgctaccagctccccggttgccacgggaac
ttctcagatgccgaggagggtgaccccgagcgacactacgaagacccctacagctatgac
caggaggtggccgaggtggaagcagcaacagccctggggggtgaggtccaggcgggtgca
gtccaggcaggcgcagggggccccccagctgctctgggaggtgggagtcagccactgtcc
accatccaggcacccccctggccagctgtcctccccaggcccacagcggctgctgccctg
ggtcccccagccccagtgcaggccaaagctaggagagtgaccgaggacagtgaggaggaa
gaagaggaggaggaggagagagaggaaatggctgtcactgagcagctggcagcaggtggc
cacagggggctggatgggctgcccactacagccccagctggacccagtcttcctatccag
gaggagagggcagaagctggggcaagggcagaagctggggcaaggcctgaagagaacctc
atcctggatgcccaagccacgtcccgcagcactgggccggagggcgtgacgcatgcaccg
agcctgggcaaggctgctctcgtcccaactcaggccgtgcctggctctcccagggaccca
gtcaagcccagcccccacaacatcctgtccacatcactgcctgatgcagcctggatccca
cccacccgagaagtgcccaggaagccgcaagttctgccccattcccacgtggaggaggac
acagaccccaactctgtccattctatccccagaagtagccctgaaggctccaccaaggac
ctgatcgagacttgctgcgcagccggacagcagtgggccattgacaatgacgagtgcctg
gagatccctgagagtggcactgaggacaacgtctgcaggacagcccagaggcactgctgt
gtctcctacttgcaggagaagagctgcatggccggcgtcctgggagccaaggagggtgag
acctgtggggctgaggacaacgacagctgcggcatctccctgtacaagcaatgctgtgac
tgctgtggcctgggcctccgcgtgcgggccgagggccagtcgtgtgagtccaatcctaac
ctgggctatccctgcaatcatgtcatgctctcctgctgtgagggtgaagagcctctcata
gtacctgaggttcgccgacctccagagcccgcagctgcaccacggagagtttcagaggca
gagatggcgggccgagaggccctgtcactgggcacagaggccgagctgccgaacagcctg
ccgggcgatgaccaggatgagtgccttctcctcccgggagagctgtgccagcacctttgc
atcaatactgtgggttcttaccactgtgcctgctttcctggcttctcactgcaggacgat
ggccgcacttgccgcccagagggtcaccctccacagccggaagccccacaggagcctgca
ctgaagtcagaattttcccaggtggcctctaacaccatcccgctgccactgccgcagccc
aatacctgcaaagacaatggaccctgcaagcaggtgtgcagcactgttgggggctcagcc
atatgctcctgttttcccggctatgccatcatggcggatggcgtgtcctgtgaagacatc
aacgagtgtgtgacggacctgcacacgtgcagccggggcgagcactgtgtgaacacactg
ggctccttccactgctacaaggcactcacctgtgagccaggctatgccctcaaggatggc
gagtgcgaagacgtggatgagtgtgcgatgggcacgcacacctgccagccgggcttcttg
tgccagaacaccaagggctccttctactgccaggccaggcagcgctgcatggatggcttc
ctgcaggatcctgaaggcaactgtgtggacatcaacgagtgcacgtcactgtccgagcca
tgtcggccaggcttcagctgcatcaacacggtgggctcctacacatgccagaggaacccg
ctgatctgcgcgcgcggctaccacgccagcgatgatgggaccaagtgtgtggacgtgaat
gagtgtgagacaggtgtgcaccgctgcggtgagggccaagtgtgccacaacctccctggc
tcctaccgctgtgactgcaaagccggctttcagcgggatgcctttggccggggctgcatc
gacgtgaatgagtgctgggcctcgccaggccgcctgtgccagcacacgtgtgagaacaca
ctcggctcctaccgctgttcctgcgcctccgggttcctgctagcagcggacggcaagcgc
tgtgaagacgtgaatgagtgtgaggcccagcgctgcagccaggagtgtgccaacatctat
ggctcctaccagtgctactgccgccagggctaccagctggctgaggatgggcacacctgc
acagacatcgacgagtgtgctcaaggcgccggcatcctctgcaccttccgctgtctcaac
gtgccagggagctaccagtgtgcatgccctgagcagggctacaccatgacggccaacggg
aggtcctgcaaggacgtggatgagtgtgcactgggtacccacaactgttccgaggctgag
acctgccacaacatccagggtagcttccgctgcctgcgcttcgagtgtcctcccaactat
gtccaagtctccaaaacgaagtgcgagcgcaccacgtgccatgacttcctggagtgccag
aactcgccagcgcgcatcacgcactaccagctcaacttccagacgggcctcctggtgcct
gcgcatatcttccgcattggccccgcgccagccttcacgggggacaccatcgccctgaac
atcatcaagggcaatgaggagggctactttggcacgcgcaggctcaatgcctacacgggt
gtggtctacctgcagcgggccgtgctggagccccgggactttgccctggacgtggagatg
aagctctggaggcagggctccgtcaccaccttcctggccaagatgcacatcttcttcacc
acctttgccctgtga

KEGG   Homo sapiens (human): 3339
Entry
3339              CDS       T01001                                 
Symbol
HSPG2, HSPG, PLC, PRCAN, SJA, SJS, SJS1
Name
(RefSeq) heparan sulfate proteoglycan 2
  KO
K06255  basement membrane-specific heparan sulfate proteoglycan core protein
Organism
hsa  Homo sapiens (human)
Pathway
hsa04512  ECM-receptor interaction
hsa04820  Cytoskeleton in muscle cells
hsa05161  Hepatitis B
hsa05205  Proteoglycans in cancer
Network
nt06539  Cytoskeleton in muscle cells
  Element
N01814  Extracellular matrix - Basal lamina
Disease
H00493  Heparan sulfate proteoglycan gene defects
H01777  Schwartz-Jampel syndrome
H02155  Dyssegmental dysplasia
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09130 Environmental Information Processing
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    3339 (HSPG2)
 09140 Cellular Processes
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    3339 (HSPG2)
 09160 Human Diseases
  09161 Cancer: overview
   05205 Proteoglycans in cancer
    3339 (HSPG2)
  09172 Infectious disease: viral
   05161 Hepatitis B
    3339 (HSPG2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:hsa04147]
    3339 (HSPG2)
   00535 Proteoglycans [BR:hsa00535]
    3339 (HSPG2)
Exosome [BR:hsa04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   3339 (HSPG2)
Proteoglycans [BR:hsa00535]
 Extracellular matrix (ECM) proteoglycans
  Basement membrane proteoglycans
   3339 (HSPG2)
SSDB
Motif
Pfam: I-set Ig_3 Ig_2 ig Laminin_B V-set Laminin_G_1 Laminin_G_2 Laminin_EGF C2-set_2 Ldl_recept_a Ig_6 Laminin_G_3 C2-set EGF hEGF
Other DBs
NCBI-GeneID: 3339
NCBI-ProteinID: NP_005520
OMIM: 142461
HGNC: 5273
Ensembl: ENSG00000142798
UniProt: P98160
Structure
LinkDB
Position
1:complement(21822244..21937310)
AA seq 4391 aa
MGWRAAGALLLALLLHGRLLAVTHGLRAYDGLSLPEDIETVTASQMRWTHSYLSDDEDML
ADSISGDDLGSGDLGSGDFQMVYFRALVNFTRSIEYSPQLEDAGSREFREVSEAVVDTLE
SEYLKIPGDQVVSVVFIKELDGWVFVELDVGSEGNADGAQIQEMLLRVISSGSVASYVTS
PQGFQFRRLGTVPQFPRACTEAEFACHSYNECVALEYRCDRRPDCRDMSDELNCEEPVLG
ISPTFSLLVETTSLPPRPETTIMRQPPVTHAPQPLLPGSVRPLPCGPQEAACRNGHCIPR
DYLCDGQEDCEDGSDELDCGPPPPCEPNEFPCGNGHCALKLWRCDGDFDCEDRTDEANCP
TKRPEEVCGPTQFRCVSTNMCIPASFHCDEESDCPDRSDEFGCMPPQVVTPPRESIQASR
GQTVTFTCVAIGVPTPIINWRLNWGHIPSHPRVTVTSEGGRGTLIIRDVKESDQGAYTCE
AMNARGMVFGIPDGVLELVPQRGPCPDGHFYLEHSAACLPCFCFGITSVCQSTRRFRDQI
RLRFDQPDDFKGVNVTMPAQPGTPPLSSTQLQIDPSLHEFQLVDLSRRFLVHDSFWALPE
QFLGNKVDSYGGSLRYNVRYELARGMLEPVQRPDVVLMGAGYRLLSRGHTPTQPGALNQR
QVQFSEEHWVHESGRPVQRAELLQVLQSLEAVLIQTVYNTKMASVGLSDIAMDTTVTHAT
SHGRAHSVEECRCPIGYSGLSCESCDAHFTRVPGGPYLGTCSGCNCNGHASSCDPVYGHC
LNCQHNTEGPQCNKCKAGFFGDAMKATATSCRPCPCPYIDASRRFSDTCFLDTDGQATCD
ACAPGYTGRRCESCAPGYEGNPIQPGGKCRPVNQEIVRCDERGSMGTSGEACRCKNNVVG
RLCNECADGSFHLSTRNPDGCLKCFCMGVSRHCTSSSWSRAQLHGASEEPGHFSLTNAAS
THTTNEGIFSPTPGELGFSSFHRLLSGPYFWSLPSRFLGDKVTSYGGELRFTVTQRSQPG
STPLHGQPLVVLQGNNIILEHHVAQEPSPGQPSTFIVPFREQAWQRPDGQPATREHLLMA
LAGIDTLLIRASYAQQPAESRVSGISMDVAVPEETGQDPALEVEQCSCPPGYRGPSCQDC
DTGYTRTPSGLYLGTCERCSCHGHSEACEPETGACQGCQHHTEGPRCEQCQPGYYGDAQR
GTPQDCQLCPCYGDPAAGQAAHTCFLDTDGHPTCDACSPGHSGRHCERCAPGYYGNPSQG
QPCQRDSQVPGPIGCNCDPQGSVSSQCDAAGQCQCKAQVEGLTCSHCRPHHFHLSASNPD
GCLPCFCMGITQQCASSAYTRHLISTHFAPGDFQGFALVNPQRNSRLTGEFTVEPVPEGA
QLSFGNFAQLGHESFYWQLPETYQGDKVAAYGGKLRYTLSYTAGPQGSPLSDPDVQITGN
NIMLVASQPALQGPERRSYEIMFREEFWRRPDGQPATREHLLMALADLDELLIRATFSSV
PLAASISAVSLEVAQPGPSNRPRALEVEECRCPPGYIGLSCQDCAPGYTRTGSGLYLGHC
ELCECNGHSDLCHPETGACSQCQHNAAGEFCELCAPGYYGDATAGTPEDCQPCACPLTNP
ENMFSRTCESLGAGGYRCTACEPGYTGQYCEQCGPGYVGNPSVQGGQCLPETNQAPLVVE
VHPARSIVPQGGSHSLRCQVSGSPPHYFYWSREDGRPVPSGTQQRHQGSELHFPSVQPSD
AGVYICTCRNLHQSNTSRAELLVTEAPSKPITVTVEEQRSQSVRPGADVTFICTAKSKSP
AYTLVWTRLHNGKLPTRAMDFNGILTIRNVQLSDAGTYVCTGSNMFAMDQGTATLHVQAS
GTLSAPVVSIHPPQLTVQPGQLAEFRCSATGSPTPTLEWTGGPGGQLPAKAQIHGGILRL
PAVEPTDQAQYLCRAHSSAGQQVARAVLHVHGGGGPRVQVSPERTQVHAGRTVRLYCRAA
GVPSATITWRKEGGSLPPQARSERTDIATLLIPAITTADAGFYLCVATSPAGTAQARIQV
VVLSASDASPPPVKIESSSPSVTEGQTLDLNCVVAGSAHAQVTWYRRGGSLPPHTQVHGS
RLRLPQVSPADSGEYVCRVENGSGPKEASITVSVLHGTHSGPSYTPVPGSTRPIRIEPSS
SHVAEGQTLDLNCVVPGQAHAQVTWHKRGGSLPARHQTHGSLLRLHQVTPADSGEYVCHV
VGTSGPLEASVLVTIEASVIPGPIPPVRIESSSSTVAEGQTLDLSCVVAGQAHAQVTWYK
RGGSLPARHQVRGSRLYIFQASPADAGQYVCRASNGMEASITVTVTGTQGANLAYPAGST
QPIRIEPSSSQVAEGQTLDLNCVVPGQSHAQVTWHKRGGSLPVRHQTHGSLLRLYQASPA
DSGEYVCRVLGSSVPLEASVLVTIEPAGSVPALGVTPTVRIESSSSQVAEGQTLDLNCLV
AGQAHAQVTWHKRGGSLPARHQVHGSRLRLLQVTPADSGEYVCRVVGSSGTQEASVLVTI
QQRLSGSHSQGVAYPVRIESSSASLANGHTLDLNCLVASQAPHTITWYKRGGSLPSRHQI
VGSRLRIPQVTPADSGEYVCHVSNGAGSRETSLIVTIQGSGSSHVPSVSPPIRIESSSPT
VVEGQTLDLNCVVARQPQAIITWYKRGGSLPSRHQTHGSHLRLHQMSVADSGEYVCRANN
NIDALEASIVISVSPSAGSPSAPGSSMPIRIESSSSHVAEGETLDLNCVVPGQAHAQVTW
HKRGGSLPSHHQTRGSRLRLHHVSPADSGEYVCRVMGSSGPLEASVLVTIEASGSSAVHV
PAPGGAPPIRIEPSSSRVAEGQTLDLKCVVPGQAHAQVTWHKRGGNLPARHQVHGPLLRL
NQVSPADSGEYSCQVTGSSGTLEASVLVTIEPSSPGPIPAPGLAQPIYIEASSSHVTEGQ
TLDLNCVVPGQAHAQVTWYKRGGSLPARHQTHGSQLRLHLVSPADSGEYVCRAASGPGPE
QEASFTVTVPPSEGSSYRLRSPVISIDPPSSTVQQGQDASFKCLIHDGAAPISLEWKTRN
QELEDNVHISPNGSIITIVGTRPSNHGTYRCVASNAYGVAQSVVNLSVHGPPTVSVLPEG
PVWVKVGKAVTLECVSAGEPRSSARWTRISSTPAKLEQRTYGLMDSHAVLQISSAKPSDA
GTYVCLAQNALGTAQKQVEVIVDTGAMAPGAPQVQAEEAELTVEAGHTATLRCSATGSPA
PTIHWSKLRSPLPWQHRLEGDTLIIPRVAQQDSGQYICNATSPAGHAEATIILHVESPPY
ATTVPEHASVQAGETVQLQCLAHGTPPLTFQWSRVGSSLPGRATARNELLHFERAAPEDS
GRYRCRVTNKVGSAEAFAQLLVQGPPGSLPATSIPAGSTPTVQVTPQLETKSIGASVEFH
CAVPSDRGTQLRWFKEGGQLPPGHSVQDGVLRIQNLDQSCQGTYICQAHGPWGKAQASAQ
LVIQALPSVLINIRTSVQTVVVGHAVEFECLALGDPKPQVTWSKVGGHLRPGIVQSGGVV
RIAHVELADAGQYRCTATNAAGTTQSHVLLLVQALPQISMPQEVRVPAGSAAVFPCIASG
YPTPDISWSKLDGSLPPDSRLENNMLMLPSVRPQDAGTYVCTATNRQGKVKAFAHLQVPE
RVVPYFTQTPYSFLPLPTIKDAYRKFEIKITFRPDSADGMLLYNGQKRVPGSPTNLANRQ
PDFISFGLVGGRPEFRFDAGSGMATIRHPTPLALGHFHTVTLLRSLTQGSLIVGDLAPVN
GTSQGKFQGLDLNEELYLGGYPDYGAIPKAGLSSGFIGCVRELRIQGEEIVFHDLNLTAH
GISHCPTCRDRPCQNGGQCHDSESSSYVCVCPAGFTGSRCEHSQALHCHPEACGPDATCV
NRPDGRGYTCRCHLGRSGLRCEEGVTVTTPSLSGAGSYLALPALTNTHHELRLDVEFKPL
APDGVLLFSGGKSGPVEDFVSLAMVGGHLEFRYELGSGLAVLRSAEPLALGRWHRVSAER
LNKDGSLRVNGGRPVLRSSPGKSQGLNLHTLLYLGGVEPSVPLSPATNMSAHFRGCVGEV
SVNGKRLDLTYSFLGSQGIGQCYDSSPCERQPCQHGATCMPAGEYEFQCLCRDGFKGDLC
EHEENPCQLREPCLHGGTCQGTRCLCLPGFSGPRCQQGSGHGIAESDWHLEGSGGNDAPG
QYGAYFHDDGFLAFPGHVFSRSLPEVPETIELEVRTSTASGLLLWQGVEVGEAGQGKDFI
SLGLQDGHLVFRYQLGSGEARLVSEDPINDGEWHRVTALREGRRGSIQVDGEELVSGRSP
GPNVAVNAKGSVYIGGAPDVATLTGGRFSSGITGCVKNLVLHSARPGAPPPQPLDLQHRA
QAGANTRPCPS
NT seq 13176 nt   +upstreamnt  +downstreamnt
atggggtggcgggcggcgggcgcgctgctgctggcgctgctgctgcacgggcggctgctg
gcggtgacccatgggctgagggcatacgatggcttgtctctgcctgaggacatagagacc
gtcacagcaagccaaatgcgctggacacattcgtacctttctgatgatgaggacatgctg
gctgacagcatctcaggagacgacctgggcagtggggacctgggcagcggggacttccag
atggtttatttccgagccctggtgaatttcactcgctccatcgagtacagccctcagctg
gaggatgcaggctccagagagttccgagaggtgtccgaggctgtggtagacacgctggag
tcggagtacttgaaaattcccggagaccaggttgtcagtgtggtgttcatcaaggagctg
gatggctgggtttttgtggagctggatgtgggctcggaagggaatgcggatggggctcag
attcaggagatgctgctcagggtcatctccagcggctctgtggcctcctacgtcacctct
ccccagggattccagttccgacgcctgggcacagtgccccagttcccaagagcctgcacg
gaggccgagtttgcctgccacagctacaatgagtgtgtggccctggagtatcgctgtgac
cggcggcccgactgcagggacatgtctgatgagctcaattgtgaggagccagtcctgggt
atcagccccacattctctctccttgtggagacgacatctttaccgccccggccagagaca
accatcatgcgacagccaccagtcacccacgctcctcagcccctgcttcccggttccgtc
aggcccctgccctgtgggccccaggaggccgcatgccgcaatgggcactgcatccccaga
gactacctctgcgacggacaggaggactgcgaggacggcagcgatgagctagactgtggc
cccccgccaccctgtgagcccaacgagttcccctgcgggaatggacattgtgccctcaag
ctgtggcgctgcgatggtgactttgactgtgaggaccgaactgatgaagccaactgcccc
accaagcgtcctgaggaagtgtgcgggcccacacagttccgatgcgtctctaccaacatg
tgcatcccagccagcttccactgtgacgaggagagcgactgtcctgaccggagcgacgag
tttggctgcatgcccccccaggtggtgacacctccccgggagtccatccaggcttcccgg
ggccagacagtgaccttcacctgcgtggccattggcgtccccacccccatcatcaattgg
aggctcaactggggccacatcccctctcatcccagggtgacagtgaccagcgagggtggc
cgtggcacactgatcatccgtgatgtgaaggagtcagaccagggtgcctacacctgtgag
gccatgaacgcccggggcatggtgtttggcattcctgacggtgtccttgagctcgtccca
caacgaggcccctgccctgacggccacttctacctggagcacagcgccgcctgcctgccc
tgcttctgctttggcatcaccagcgtgtgccagagcacccgccgcttccgggaccagatc
aggctgcgctttgaccaacccgatgacttcaagggtgtgaatgtgacaatgcctgcgcag
cccggcacgccacccctctcctccacgcagctgcagatcgacccatccctgcacgagttc
cagctagtcgacctgtcccgccgcttcctcgtccacgactccttctgggctctgcctgaa
cagttcctgggcaacaaggtggactcctatggcggctccctgcgttacaacgtgcgctac
gagttggcccgtggcatgctggagccagtgcagcggccggacgtggtcctcatgggtgcc
gggtaccgcctcctctcccgaggccacacacccacccaacctggtgctctgaaccagcgc
caggtccagttctctgaggagcactgggtccatgagtctggccggccggtgcagcgcgcg
gagctgctgcaggtgctgcagagcctggaggccgtgctcatccagaccgtgtacaacacc
aagatggccagcgtgggacttagcgacatcgccatggataccaccgtcacccatgccacc
agccatggccgtgcccacagtgtggaggagtgcagatgccccattggctattctggcttg
tcctgcgagagctgtgatgcccacttcactcgggtgcctggtgggccctacctgggcacc
tgctctggttgcaattgcaatggccatgccagctcctgtgaccctgtgtatggccactgc
ctgaattgccagcacaacacggaggggccacagtgcaacaagtgcaaggctggcttcttt
ggggacgccatgaaggccacggccacttcctgccggccctgcccttgcccatacatcgat
gcctcccgcagattctcagacacttgcttcctggacacggatggccaagccacatgtgac
gcctgtgccccaggctacactggccgccgctgtgagagctgtgcccccggatacgagggc
aaccccatccagcccggcgggaagtgcaggcccgtcaaccaggagattgtgcgctgtgac
gagcgtggcagcatggggacctccggggaggcctgccgctgtaagaacaatgtggtgggg
cgcttgtgcaatgaatgtgctgacggctctttccacctgagtacccgaaaccccgatggc
tgcctcaagtgcttctgcatgggtgtcagtcgccactgcaccagctcttcatggagccgt
gcccagttgcatggggcctctgaggagcctggtcacttcagcctgaccaacgccgcaagc
acccacaccaccaacgagggcatcttctcccccacgcccggggaactgggattctcctcc
ttccacagactcttatctggaccctacttctggagcctcccttcacgcttcctgggggac
aaggtgacctcctatggaggagagctgcgcttcacagtgacccagaggtcccagccgggc
tccacacccctgcacgggcagccgttggtggtgctgcaaggtaacaacatcatcctagag
caccatgtggcccaggagcccagccccggccagcccagcaccttcattgtgcctttccgg
gagcaagcatggcagcggcccgatgggcagccagccacacgggagcacctgctgatggca
ctggcaggcatcgacaccctcctgatccgagcatcctacgcccagcagcccgctgagagc
agggtctctggcatcagcatggacgtggctgtgcccgaggaaaccggccaggaccccgcg
ctggaagtggaacagtgctcctgcccacccgggtaccgtgggccgtcctgccaggactgt
gacacaggctacacacgcacgcccagtggcctctacctgggtacctgtgaacgctgcagc
tgccatggccactcagaggcctgcgagccagaaacaggtgcctgccagggctgccagcat
cacacggagggccctcggtgtgagcagtgccagccaggatactacggggacgcccagcgg
gggacaccacaggactgccagctgtgcccctgctacggagaccctgctgccggccaggct
gcccacacttgttttctggacacagacggccaccccacctgtgatgcgtgctccccaggc
cacagtgggcgtcactgtgagaggtgcgcccctggctactatggcaaccccagccagggc
cagccatgccagagagacagccaggtgccagggcccataggctgcaactgtgacccccaa
ggcagcgtcagcagccagtgtgatgctgctggtcagtgccagtgcaaggcccaggtggaa
ggcctcacttgcagccactgccggccccaccacttccacctgagtgccagcaacccagac
ggctgcctgccctgcttctgtatgggcatcacccagcagtgcgccagctctgcctacaca
cgccacctgatctccacccactttgcccctggggacttccaaggctttgccctggtgaac
ccacagcgaaacagccgcctgacaggagaattcactgtggaacccgtgcccgagggtgcc
cagctctcttttggcaactttgcccaactcggccatgagtccttctactggcagctgccg
gagacataccagggagacaaggtggcggcctacggtgggaagttgcgatacaccctctcc
tacacagcaggcccacagggcagcccactctctgaccccgatgtgcagatcacgggcaac
aacatcatgctagtggcctcccagccagcgctgcagggccctgagaggaggagctacgag
atcatgttccgagaggaattctggcgccggcccgatgggcagccggccacacgcgagcac
ctcctgatggcactggccgacctggatgagctcctgatccgggccacgttctcctccgtg
ccgctggcggccagcatcagcgcagtcagcctggaggtcgcccagccggggccctcaaac
agaccccgcgccctcgaggtggaggagtgccgctgcccgccaggctacatcggtctgtcc
tgccaggactgtgcccccggctacacgcgcaccgggagtgggctctacctcggccactgc
gagctatgtgaatgcaatggccactcagacctgtgccacccagagactggggcctgctcg
caatgccagcacaacgccgcaggggagttctgcgagctttgtgcccctggctactacgga
gatgccacagccgggacgcctgaggactgccagccctgtgcctgcccactgaccaaccca
gagaacatgttttcccgcacctgtgagagcctgggagccggcgggtaccgctgcacggcc
tgcgaacccggctacactggccagtactgtgagcagtgtggcccaggttacgtgggtaac
cccagtgtgcaagggggccagtgcctgccagagacaaaccaagccccactggtggtcgag
gtccatcctgctcgaagcatagtgccccaaggtggctcccactccctgcggtgtcaggtc
agtgggagcccaccccactacttctattggtcccgtgaggatgggcggcctgtgcccagc
ggcacccagcagcgacatcaaggctccgagctccacttccccagcgtccagccctcggat
gctggggtctacatttgcacctgccgtaatctccaccaatccaataccagccgggcagag
ctgctggtcactgaggctccaagcaagcccatcacagtgactgtggaggagcagcggagc
cagagcgtgcgccccggagctgacgtcaccttcatctgcacagccaaaagcaagtcccca
gcctataccctggtgtggacccgcctgcacaacgggaaactgcccacccgagccatggat
ttcaatggcatcctgaccattcgcaacgtccagctgagtgatgcaggcacctacgtgtgc
accggctccaacatgtttgccatggaccagggcacagccactctacatgtgcaggcctcg
ggcaccttgtccgcccccgtggtctccatccatccgccacagctcacagtgcagcccggg
caactggcggagttccgctgcagcgccacagggagccccacgcccaccctcgagtggaca
gggggccccggcggccagctccctgcgaaggcacaaatccacggcggcatcctgcgcctg
ccagctgtcgagcccacggatcaggcccagtacttgtgccgagcccacagcagcgctggg
cagcaggtggccagggctgtgctccacgtgcatgggggcggtgggcccagagtccaagtg
agcccagagaggacccaggtccacgcaggccgcaccgtcaggctgtactgcagggctgca
ggcgtgcctagcgccaccatcacctggaggaaggaagggggcagcctcccaccacaggcc
cggtcagagcgcacagacatcgcgacactgctcatcccagccatcacgactgctgacgcc
ggcttctacctctgcgtggccaccagccctgcaggcactgcccaggcccggatccaagtg
gttgtcctttcagcctcagatgccagcccaccgccggtcaagattgagtcctcatcgcct
tctgtgacagaagggcaaacactcgacctcaactgtgtggtggcagggtcagcccatgcc
caggtcacctggtacaggcgagggggtagcctgcctccccacacccaggtgcacggctcc
cgtctgcggctcccccaggtctcaccagctgattctggagaatatgtgtgccgtgtggag
aatggatcgggccccaaggaggcctccattactgtgtctgtgctccacggcacccattct
ggccccagctacaccccagtgcccggcagcacccggcccatccgcatcgagccctcctcc
tcacacgtggcggaagggcagaccctggatctgaactgcgtggtgcccgggcaggcccac
gcccaggtcacgtggcacaagcgtgggggcagcctccctgcccggcaccagacccacggc
tcgctgctgcggctgcaccaggtgaccccggccgactcaggcgagtatgtgtgccatgtg
gtgggcacctccggccccctagaggcctcagtcctggtcaccatcgaagcctctgtcatc
cctggacccatcccacctgtcaggatcgagtcttcatcctccacagtggccgagggccag
accctggatctgagctgcgtggtggcagggcaggcccacgcccaggtcacatggtacaag
cgtgggggcagcctccctgcccggcaccaggttcgtggctcccgcctgtacatcttccag
gcctcacctgccgatgcgggacagtacgtctgccgggccagcaacggcatggaggcctcc
atcacggtcacagtaactgggacccagggggccaacttagcctaccctgccggcagcacc
cagcccatccgcatcgagccctcctcctcgcaagtggcggaagggcagaccctggatctg
aactgcgtggtgcccgggcagtcccatgcccaggtcacgtggcacaagcgtgggggcagc
ctccctgtccggcaccagacccacggctccctgctgagactctaccaagcgtcccccgcc
gactcgggcgagtacgtgtgccgagtgttgggcagctccgtgcctctagaggcctctgtc
ctggtcaccattgagcctgcgggctcagtgcctgcacttggggtcacccccacggtccgg
atcgagtcatcgtcttcgcaagtggccgaggggcagaccctggacctgaactgcctcgtt
gctggtcaggcccatgcccaggtcacgtggcacaagcgcgggggcagcctcccggcccgg
caccaggtgcatggctcgaggctacgcctgctccaggtgaccccagctgattcaggggag
tacgtgtgccgtgtggtcggcagctcaggtacccaggaagcctcagtccttgtcaccatc
cagcagcgccttagtggctcccactcccagggtgtggcgtaccccgtccgcatcgagtcc
tcctcagcctccctggccaatggacacaccctggacctcaactgcctggttgccagccag
gctccccacaccatcacctggtataagcgtggaggcagcttacccagccggcaccagatc
gtgggctcccggctgcggatccctcaggtgactccggcagactcgggcgagtacgtgtgt
cacgtcagtaacggtgcaggctcccgggagacctcgctcatcgtcaccatccagggcagc
ggttcctcccacgtgcccagcgtctccccaccgatcaggatcgagtcgtcttcccccacg
gtggtggaagggcagaccttggatctgaactgcgtggtcgccaggcagccccaggctatc
atcacatggtacaagcgtgggggcagccttccctcccgacaccagacccatggctcccac
ctgcggttgcaccaaatgtctgtggctgactcgggcgagtatgtgtgccgggccaacaac
aacatcgatgccctggaggcctccatcgtcatctccgtctcccctagcgccggcagcccc
tccgcccctggcagctccatgcccatcagaattgagtcatcctcctcacacgtggccgaa
ggggagaccctggatctgaactgcgtggtccccgggcaggcccatgcccaggtcacttgg
cacaagcgtgggggcagcctccccagtcaccatcagacccgcggctcacggctgcggctg
caccatgtgtccccggccgactcgggtgaatacgtgtgccgggtgatgggcagctctggc
cccctggaggcctcagtcctggtcaccatcgaagcctctggctcaagtgctgtccacgtc
cccgccccaggtggagccccacccatccgcatcgagccctcctcctcccgagtggcagaa
gggcagaccctggatctgaagtgcgtggtgcccgggcaggcccacgcccaggtcacgtgg
cacaagcgtggaggaaacctccctgcccggcaccaggtccacggcccactgctgaggctg
aaccaggtgtccccggctgactctggcgagtactcgtgccaagtgaccggaagctcaggc
accctggaggcatctgtcctggtcacaattgagccctccagcccaggacccattcctgct
ccaggactggcccagcccatctacatcgaggcctcctcttcacacgtgactgaagggcag
actctggatctgaactgtgtggtgcccgggcaggcccatgcccaggtcacgtggtacaag
cgcgggggcagcctccccgcccggcaccagacccatggctcccagctgcggctccacctc
gtctcccctgccgactcaggcgagtatgtgtgtcgtgcagccagcggcccaggccctgag
caagaagcctccttcacagtcaccgtcccgcccagtgaggggtcttcctaccgccttagg
agcccggtcatctccatcgacccgcccagcagcaccgtgcagcagggccaggatgccagc
ttcaagtgcctcatccatgacggggcagcccccatcagcctcgagtggaagacccggaac
caggagctggaggacaacgtccacatcagtcccaatggctccatcatcaccatcgtgggc
acccggcccagcaaccacggtacctaccgctgcgtggcctccaatgcctacggtgtggcc
cagagtgtggtgaacctcagtgtgcacgggccccctacagtgtccgtgctccccgagggc
cccgtgtgggtgaaagtgggaaaggctgtcaccctggagtgtgtcagtgccggggagccc
cgctcctctgctcgttggacccggatcagcagcacccctgccaagttggagcagcggaca
tatgggctcatggacagccacgcggtgctgcagatttcatcagctaaaccatcagatgcg
ggcacttatgtgtgccttgctcagaatgcactaggcacagcacagaagcaggtggaggtg
atcgtggacacgggcgccatggccccaggggcccctcaggtccaagctgaagaagctgag
ctgactgtggaggctggacacacggccaccttgcgctgctcagccacaggcagccccgcg
cccaccatccactggtccaagctgcgttccccactgccctggcagcaccggctggaaggt
gacacactcatcataccccgggtagcccagcaggactcgggccagtacatctgcaatgcc
actagccctgctgggcacgctgaggccaccatcatcctgcacgtggagagcccaccatat
gccaccacggtcccagagcacgcttcggtgcaggcaggggagacggtgcagctccagtgc
ctggctcacgggacacccccactcaccttccagtggagccgcgtgggcagcagccttcct
gggagggcgaccgccaggaacgagctgctgcactttgagcgtgcagcccctgaggactca
ggccgctaccgctgccgggtcaccaacaaggtgggctcagccgaggcctttgcccagctg
ctcgtccaaggccctcccggctctctccctgccacctccatcccagcagggtccacgccc
accgtgcaggtcacgcctcagctagagaccaagagcattggggccagcgttgagttccac
tgtgctgtgcccagcgaccggggtacccagctccgttggttcaaggaagggggtcagctg
cctccgggtcacagcgtgcaggatggggtgctccgaatccagaacttggaccagagctgc
caagggacgtatatatgccaggcccatggaccttgggggaaggcccaggccagtgcccag
ctggttatccaagccctgccctcggtgctcatcaacatccggacctctgtgcagaccgtg
gtggttggccacgccgtggagttcgaatgcctggcactgggtgaccccaagcctcaggtg
acatggagcaaagttggagggcacctgcggccaggcattgtgcagagcggaggtgtcgtc
aggatcgcccacgtagagctggctgatgcgggacagtatcgctgcactgccaccaacgca
gctggcaccacacaatcccacgtcctgctgcttgtgcaagccttgccccagatctcaatg
ccccaagaagtccgtgtgcctgctggttctgcagctgtcttcccctgcatagcctcaggc
taccccactcctgacatcagctggagcaagctggatggcagcctgccacctgacagccgc
ctggagaacaacatgctgatgctgccctcagtccgaccccaggacgcaggtacctacgtc
tgcaccgccactaaccgccagggcaaggtcaaagcctttgcccacctgcaggtgccagag
cgggtggtgccctacttcacgcagaccccctactccttcctaccgctgcccaccatcaag
gatgcctacaggaagttcgagatcaagatcaccttccggcccgactcagccgatgggatg
ctgctgtacaatgggcagaagcgagtcccagggagccccaccaacctggccaaccggcag
cccgacttcatctccttcggcctcgtggggggaaggcccgagttccggttcgatgcaggc
tcaggcatggccaccatccgccatcccacaccactggccctgggccatttccacaccgtg
accctgctgcgcagcctcacccagggctccctgattgtgggtgacctggccccggtcaat
gggacctcccagggcaagttccagggcctggatctgaacgaggaactctacctgggtggc
tatcctgactatggtgccatccccaaggcggggctgagcagcggcttcataggctgtgtc
cgggagctgcgcatccagggcgaggagatcgtcttccatgacctcaacctcacggcgcac
ggcatctcccactgccccacctgtcgggaccggccctgccagaatggcggtcagtgccat
gactctgagagcagcagctacgtgtgcgtctgcccagctggcttcaccgggagccgctgt
gagcactcgcaggccctgcactgccatccagaggcctgtgggcccgacgccacctgtgtg
aaccggcctgacggtcgaggctacacctgccgctgccacctgggccgctcggggttgcgg
tgtgaggaaggtgtgacagtgaccaccccctcgctgtcgggtgctggctcctacctggca
ctgcccgccctcaccaacacacaccacgagctacgcctggacgtggagttcaagccactc
gcccctgacggggtcctgctgttcagcggggggaagagcgggcctgtggaggacttcgtg
tccctggcgatggtgggcggccacctggagttccgctatgagttggggtcagggctggcc
gttctgcggagcgccgagccgctggccctgggccgctggcaccgtgtgtctgcagagcgt
ctcaacaaggacggcagcctgcgggtgaatggtggacgccctgtgctgcgctcctcgccc
ggcaagagccagggcctcaacctgcacaccctgctctacctggggggtgtggagccttcc
gtgccactgtccccggccaccaacatgagcgctcacttccgcggctgtgtgggcgaggtg
tcagtgaatggcaaacggctggacctcacctacagtttcctaggcagccagggcatcggg
caatgctatgatagctccccatgtgagcgccagccttgccaacatggtgccacgtgcatg
cccgctggcgagtatgagttccagtgcctgtgtcgagatggattcaaaggagacctgtgt
gagcacgaggagaacccctgccagctccgtgaaccctgtctgcatgggggcacctgccag
ggcacccgctgcctctgcctccctggcttctctggcccacgctgccaacaaggctctgga
catggcatagcagagtccgactggcatcttgaaggcagcgggggcaatgatgcccctggg
cagtacggagcctatttccacgatgatggcttcctcgccttccctggccatgtcttctcc
aggagcctgcccgaggtgcccgagaccatcgagctggaggttcggaccagcacagccagt
ggcctcctgctctggcagggtgtggaggtgggagaggccggccaaggcaaggacttcatc
agcctcgggcttcaagacgggcaccttgtcttcaggtaccagctgggtagtggggaggcc
cgcctggtctctgaggaccccatcaatgacggcgagtggcaccgggtgacagcactgcgg
gagggccgcagaggttccatccaagtcgacggtgaggagctggtcagcggccggtcccca
ggtcccaacgtggcagtcaacgccaagggcagcgtctacatcggcggagcccctgacgtg
gccacgctgaccgggggcagattctcctcaggcatcacaggctgtgtcaagaacctggtg
ctgcactcggcccgacccggcgccccgcccccacagcccctggacctgcagcaccgcgcc
caggccggggccaacacacgcccctgcccctcgtag

KEGG   Homo sapiens (human): 4811
Entry
4811              CDS       T01001                                 
Symbol
NID1, NID
Name
(RefSeq) nidogen 1
  KO
K06826  nidogen (entactin)
Organism
hsa  Homo sapiens (human)
Pathway
hsa04820  Cytoskeleton in muscle cells
Network
nt06539  Cytoskeleton in muscle cells
  Element
N01814  Extracellular matrix - Basal lamina
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09140 Cellular Processes
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    4811 (NID1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04990 Domain-containing proteins not elsewhere classified [BR:hsa04990]
    4811 (NID1)
Domain-containing proteins not elsewhere classified [BR:hsa04990]
 EGF-like domain-containing proteins
  cEGF domain-containing proteins
   4811 (NID1)
SSDB
Motif
Pfam: G2F NIDO EGF_3 Ldl_recept_b Thyroglobulin_1 cEGF EGF FXa_inhibition EGF_CA EGF_MSP1_1 SGL Plasmod_Pvs28 hEGF
Other DBs
NCBI-GeneID: 4811
NCBI-ProteinID: NP_002499
OMIM: 131390
HGNC: 7821
Ensembl: ENSG00000116962
UniProt: P14543
LinkDB
Position
1:complement(235975830..236065090)
AA seq 1247 aa
MLASSSRIRAAWTRALLLPLLLAGPVGCLSRQELFPFGPGQGDLELEDGDDFVSPALELS
GALRFYDRSDIDAVYVTTNGIIATSEPPAKESHPGLFPPTFGAVAPFLADLDTTDGLGKV
YYREDLSPSITQRAAECVHRGFPEISFQPSSAVVVTWESVAPYQGPSRDPDQKGKRNTFQ
AVLASSDSSSYAIFLYPEDGLQFHTTFSKKENNQVPAVVAFSQGSVGFLWKSNGAYNIFA
NDRESVENLAKSSNSGQQGVWVFEIGSPATTNGVVPADVILGTEDGAEYDDEDEDYDLAT
TRLGLEDVGTTPFSYKALRRGGADTYSVPSVLSPRRAATERPLGPPTERTRSFQLAVETF
HQQHPQVIDVDEVEETGVVFSYNTDSRQTCANNRHQCSVHAECRDYATGFCCSCVAGYTG
NGRQCVAEGSPQRVNGKVKGRIFVGSSQVPIVFENTDLHSYVVMNHGRSYTAISTIPETV
GYSLLPLAPVGGIIGWMFAVEQDGFKNGFSITGGEFTRQAEVTFVGHPGNLVIKQRFSGI
DEHGHLTIDTELEGRVPQIPFGSSVHIEPYTELYHYSTSVITSSSTREYTVTEPERDGAS
PSRIYTYQWRQTITFQECVHDDSRPALPSTQQLSVDSVFVLYNQEEKILRYALSNSIGPV
REGSPDALQNPCYIGTHGCDTNAACRPGPRTQFTCECSIGFRGDGRTCYDIDECSEQPSV
CGSHTICNNHPGTFRCECVEGYQFSDEGTCVAVVDQRPINYCETGLHNCDIPQRAQCIYT
GGSSYTCSCLPGFSGDGQACQDVDECQPSRCHPDAFCYNTPGSFTCQCKPGYQGDGFRCV
PGEVEKTRCQHEREHILGAAGATDPQRPIPPGLFVPECDAHGHYAPTQCHGSTGYCWCVD
RDGREVEGTRTRPGMTPPCLSTVAPPIHQGPAVPTAVIPLPPGTHLLFAQTGKIERLPLE
GNTMRKTEAKAFLHVPAKVIIGLAFDCVDKMVYWTDITEPSIGRASLHGGEPTTIIRQDL
GSPEGIAVDHLGRNIFWTDSNLDRIEVAKLDGTQRRVLFETDLVNPRGIVTDSVRGNLYW
TDWNRDNPKIETSYMDGTNRRILVQDDLGLPNGLTFDAFSSQLCWVDAGTNRAECLNPSQ
PSRRKALEGLQYPFAVTSYGKNLYFTDWKMNSVVALDLAISKETDAFQPHKQTRLYGITT
ALSQCPQGHNYCSVNNGGCTHLCLATPGSRTCRCPDNTLGVDCIEQK
NT seq 3744 nt   +upstreamnt  +downstreamnt
atgttggcctcgagcagccggatccgggctgcgtggacgcgggcgctgctgctgccgctg
ctgctggcggggcctgtgggctgcctgagccgccaggagctctttcccttcggccccgga
cagggggacctggagctggaggacggggatgacttcgtctctcctgccctggagctgagt
ggggcgctccgcttctacgacagatccgacatcgacgcagtctacgtcaccacaaatggc
atcattgctacgagtgaacccccggccaaagaatcccatcccgggctcttcccaccaaca
ttcggtgcagtcgcccctttcctggcggacttggacacgaccgatggcctggggaaggtt
tattatcgagaagacttatccccctccatcactcagcgagcagcagagtgtgtccacaga
gggttcccggagatctctttccagcctagtagcgcggtggttgtcacttgggaatccgtg
gccccctaccaagggcccagcagggacccagaccagaaaggcaagagaaacacgttccag
gctgttctagcctcctctgattccagctcctatgccattttcctttatcctgaggatggt
ctgcagttccatacgacattctcaaagaaggaaaacaaccaagttcctgccgtggttgca
ttcagtcaaggttcagtgggattcttatggaagagcaacggagcttataacatatttgct
aatgacagggaatcagttgaaaatttggccaagagtagtaactctgggcagcagggtgtc
tgggtgtttgagattgggagtccagccaccaccaatggcgtggtgcctgcagacgtgatc
ctcggaactgaagatggggcagagtatgatgatgaggatgaagattatgacctggcgacc
actcgtctgggcctggaggatgtgggcaccacgcccttctcctacaaggctctgagaagg
ggaggtgctgacacatacagtgtgcccagcgtcctctccccgcgccgggcagctaccgaa
aggccccttggacctcccacagagagaaccaggtctttccagttggcagtggagactttt
caccagcagcaccctcaggtcatagatgtggatgaagttgaggaaacaggagttgttttc
agctataacacggattcccgccagacgtgtgctaacaacagacaccagtgctcggtgcac
gcagagtgcagggactacgccacgggcttctgctgcagctgtgtcgctggctatacgggc
aatggcaggcaatgtgttgcagaaggttccccccagcgagtcaatggcaaggtgaaagga
aggatctttgtggggagcagccaggtccccattgtctttgagaacactgacctccactct
tacgtagtaatgaaccacgggcgctcctacacagccatcagcaccattcccgagaccgtt
ggatattctctgcttccactggccccagttggaggcatcattggatggatgtttgcagtg
gagcaggacggattcaagaatgggttcagcatcaccgggggtgagttcactcgccaggct
gaggtgaccttcgtggggcacccgggcaatctggtcattaagcagcggttcagcggcatc
gatgagcatgggcacctgaccatcgacacggagctggagggccgcgtgccgcagattccg
ttcggctcctccgtgcacattgagccctacacggagctgtaccactactccacctcagtg
atcacttcctcctccacccgggagtacacggtgactgagcccgagcgagatggggcatct
ccttcacgcatctacacttaccagtggcgccagaccatcaccttccaggaatgcgtccac
gatgactcccggccagccctgcccagcacccagcagctctcggtggacagcgtgttcgtc
ctgtacaaccaggaggagaagatcttgcgctatgctctcagcaactccattgggcctgtg
agggaaggctcccctgatgctcttcagaatccctgctacatcggcactcatgggtgtgac
accaacgcggcctgtcgccctggtcccaggacacagttcacctgcgagtgctccatcggc
ttccgaggagacgggcgaacctgctatgatattgatgaatgttcagaacaaccctcagtg
tgtgggagccacacaatctgcaataatcacccaggaaccttccgctgcgagtgtgtggag
ggctaccagttttcagatgagggaacgtgtgtggctgtcgtggaccagcgccccatcaac
tactgtgaaactggccttcataactgcgacataccccagcgggcccagtgtatctacaca
ggaggctcctcctacacctgttcctgcttgccaggcttttctggggatggccaagcctgc
caagatgtagatgaatgccagccaagccgatgtcaccctgacgccttctgctacaacact
ccaggctctttcacgtgccagtgcaaacctggttatcagggagacggcttccgttgcgtg
cccggagaggtggagaaaacccggtgccagcacgagcgagaacacattctcggggcagcg
ggggcgacagacccacagcgacccattcctccggggctgttcgttcctgagtgcgatgcg
cacgggcactacgcgcccacccagtgccacggcagcaccggctactgctggtgcgtggat
cgcgacggccgcgaggtggagggcaccaggaccaggcccgggatgacgcccccgtgtctg
agtacagtggctcccccgattcaccaaggacctgcggtgcctaccgccgtgatccccttg
cctcctgggacccatttactctttgcccagactgggaagattgagcgcctgcccctggag
ggaaataccatgaggaagacagaagcaaaggcgttccttcatgtcccggctaaagtcatc
attggactggcctttgactgcgtggacaagatggtttactggacggacatcactgagcct
tccattgggagagctagtctacatggtggagagccaaccaccatcattagacaagatctt
ggaagtccagaaggtatcgctgttgatcaccttggccgcaacatcttctggacagactct
aacctggatcgaatagaagtggcgaagctggacggcacgcagcgccgggtgctctttgag
actgacttggtgaatcccagaggcattgtaacggattccgtgagagggaacctttactgg
acagactggaacagagataaccccaagattgaaacttcctacatggacggcacgaaccgg
aggatccttgtgcaggatgacctgggcttgcccaatggactgaccttcgatgcgttctca
tctcagctctgctgggtggatgcaggcaccaatcgggcggaatgcctgaaccccagtcag
cccagcagacgcaaggctctcgaagggctccagtatccttttgctgtgacgagctacggg
aagaatctgtatttcacagactggaagatgaattccgtggttgctctcgatcttgcaatt
tccaaggagacggatgctttccaaccccacaagcagacccggctgtatggcatcaccacg
gccctgtctcagtgtccgcaaggccataactactgctcagtgaacaatggcggctgcacc
cacctatgcttggccaccccagggagcaggacctgccgttgccctgacaacaccttggga
gttgactgtatcgaacagaaatga

DBGET integrated database retrieval system