KEGG   Mus musculus (house mouse): 12643
Entry
12643             CDS       T01002                                 
Symbol
Chad, SLRR4A
Name
(RefSeq) chondroadherin
  KO
K06248  chondroadherin
Organism
mmu  Mus musculus (house mouse)
Pathway
mmu04151  PI3K-Akt signaling pathway
mmu04510  Focal adhesion
mmu04512  ECM-receptor interaction
mmu05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:mmu00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    12643 (Chad)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    12643 (Chad)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    12643 (Chad)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    12643 (Chad)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00535 Proteoglycans [BR:mmu00535]
    12643 (Chad)
Proteoglycans [BR:mmu00535]
 Extracellular matrix (ECM) proteoglycans
  Small leucine-rich proteoglycan (SLRP) family
   class IV
    12643 (Chad)
SSDB
Motif
Pfam: LRR_8 LRR_4 LRR_5 LRRNT LRR_14 LRR_9 LRRCT
Other DBs
NCBI-GeneID: 12643
NCBI-ProteinID: NP_031715
MGI: 1096866
Ensembl: ENSMUSG00000039084
UniProt: O55226 Q3TYW1
LinkDB
Position
11:94455873..94459953
AA seq 358 aa
MARALLFSLVFLAILLPALAACPQNCHCHGDLQHVICDKVGLQKIPKVSETTKLLNLQRN
NFPVLAANSFRTMPNLVSLHLQHCNIREVAAGAFRGLKQLIYLYLSHNDIRVLRAGAFDD
LTELTYLYLDHNKVSELPRGLLSPLVNLFILQLNNNKIRELRAGAFQGAKDLRWLYLSEN
ALSSLQPGSLDDVENLAKFHLDKNQLSSYPSAALSKLRVVEELKLSHNPLKSIPDNAFQS
FGRYLETLWLDNTNLEKFSDAAFSGVTTLKHVHLDNNRLNQLPSSFPFDNLETLTLTNNP
WKCTCQLRGLRRWLEAKASRPDATCSSPAKFKGQRIRDTDALRSCKSPTKRSKKAGRH
NT seq 1077 nt   +upstreamnt  +downstreamnt
atggcccgcgcgctcttattcagtctggtctttcttgccatcctcctgcctgcgctagcc
gcctgcccccaaaactgccactgccatggagatctgcagcatgtcatctgcgacaaggtg
gggctgcagaagatccccaaggtatcagagacaaccaaactgctcaatctccagcgcaac
aacttcccggtgctggctgccaactcgtttcggaccatgccgaacctggtctccctgcac
ctgcaacactgcaacatccgcgaggtggcggctggtgccttccgaggcctgaagcagctt
atctacctgtacctgtcccacaacgacatccgggtattgcgagctggagccttcgacgac
ctgactgaactcacttacctctatctagaccacaacaaagtgtcggaactgccccggggg
ttgctctctcctctggtcaacctcttcatcttgcaactcaacaacaacaaaatccgagag
ctgcgtgctggagctttccagggggccaaggacctgcgctggctctacctgtcagaaaat
gccctcagttccctgcagcctggttccctggatgatgtggagaacctagccaagttccac
ctggacaagaaccagctgtctagctacccctcagccgccctgagcaaacttcgggtggtg
gaggagctgaagctgtctcacaaccctctgaagagcatcccagacaatgccttccagtcc
ttcggtagatatctggagaccctctggctggataacaccaacctggagaagttctcagat
gctgccttctcgggtgtgaccacactgaaacacgtccatctggacaacaaccgcctgaac
caactgccttcctccttcccctttgacaacctggagaccctcactctcaccaacaaccca
tggaaatgcacctgccagctccgtggccttcggcggtggttggaagccaaggcttctcga
ccggatgctacctgctcctcgccagccaagttcaagggtcagcggattcgtgacacagat
gcccttcgcagctgcaaatccccgaccaagaggtccaagaaagctggccgccattaa

KEGG   Mus musculus (house mouse): 12824
Entry
12824             CDS       T01002                                 
Symbol
Col2a1, Col2, Col2a, Col2a-1, Del1, Dmm, Lpk, M100413, Rgsc413, Rgsc856
Name
(RefSeq) collagen, type II, alpha 1
  KO
K19719  collagen type II alpha
Organism
mmu  Mus musculus (house mouse)
Pathway
mmu04151  PI3K-Akt signaling pathway
mmu04510  Focal adhesion
mmu04512  ECM-receptor interaction
mmu04974  Protein digestion and absorption
mmu05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:mmu00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    12824 (Col2a1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    12824 (Col2a1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    12824 (Col2a1)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    12824 (Col2a1)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    12824 (Col2a1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:mmu00536]
    12824 (Col2a1)
Glycosaminoglycan binding proteins [BR:mmu00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   12824 (Col2a1)
SSDB
Motif
Pfam: Collagen COLFI VWC VWC2L_2nd Amnionless
Other DBs
NCBI-GeneID: 12824
NCBI-ProteinID: NP_112440
MGI: 88452
Ensembl: ENSMUSG00000022483
UniProt: P28481
Structure
LinkDB
Position
15:complement(97873483..97902525)
AA seq 1487 aa
MIRLGAPQSLVLLTLLIAAVLRCQGQDAQEAGSCLQNGQRYKDKDVWKPSSCRICVCDTG
NVLCDDIICEDPDCLNPEIPFGECCPICPADLATASGKLGPKGQKGEPGDIRDIIGPRGP
PGPQGPAGEQGPRGDRGDKGEKGAPGPRGRDGEPGTPGNPGPAGPPGPPGPPGLSAGNFA
AQMAGGYDEKAGGAQMGVMQGPMGPMGPRGPPGPAGAPGPQGFQGNPGEPGEPGVSGPMG
PRGPPGPAGKPGDDGEAGKPGKSGERGLPGPQGARGFPGTPGLPGVKGHRGYPGLDGAKG
EAGAPGVKGESGSPGENGSPGPMGPRGLPGERGRTGPAGAAGARGNDGQPGPAGPPGPVG
PAGGPGFPGAPGAKGEAGPTGARGPEGAQGSRGEPGNPGSPGPAGASGNPGTDGIPGAKG
SAGAPGIAGAPGFPGPRGPPGPQGATGPLGPKGQAGEPGIAGFKGDQGPKGETGPAGPQG
APGPAGEEGKRGARGEPGGAGPIGPPGERGAPGNRGFPGQDGLAGPKGAPGERGPSGLTG
PKGANGDPGRPGEPGLPGARGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGARGQPG
VMGFPGPKGANGEPGKAGEKGLAGAPGLRGLPGKDGETGAAGPPGPSGPAGERGEQGAPG
PSGFQGLPGPPGPPGEGGKQGDQGIPGEAGAPGLVGPRGERGFPGERGSPGAQGLQGPRG
LPGTPGTDGPKGAAGPDGPPGAQGPPGLQGMPGERGAAGIAGPKGDRGDVGEKGPEGAPG
KDGGRGLTGPIGPPGPAGANGEKGEVGPPGPSGSTGARGAPGERGETGPPGPAGFAGPPG
ADGQPGAKGDQGEAGQKGDAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFPG
AAGRVGPPGANGNPGPAGPPGPAGKDGPKGVRGDSGPPGRAGDPGLQGPAGAPGEKGEPG
DDGPSGLDGPPGPQGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGKQGAPGASGDRGPPG
PVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGVKGDRGETGALGAPGAPGPPGSPGPAG
PTGKQGDRGEAGAQGPMGPSGPAGARGIAGPQGPRGDKGESGEQGERGLKGHRGFTGLQG
LPGPPGPSGDQGASGPAGPSGPRGPPGPVGPSGKDGSNGIPGPIGPPGPRGRSGETGPVG
PPGSPGPPGPPGPPGPGIDMSAFAGLGQREKGPDPMQYMRADEADSTLRQHDVEVDATLK
SLNNQIESIRSPDGSRKNPARTCQDLKLCHPEWKSGDYWIDPNQGCTLDAMKVFCNMETG
ETCVYPNPATVPRKNWWSSKSKEKKHIWFGETMNGGFHFSYGDGNLAPNTANVQMTFLRL
LSTEGSQNITYHCKNSIAYLDEAAGNLKKALLIQGSNDVEMRAEGNSRFTYTALKDGCTK
HTGKWGKTVIEYRSQKTSRLPIIDIAPMDIGGAEQEFGVDIGPVCFL
NT seq 4464 nt   +upstreamnt  +downstreamnt
atgatccgcctcggggctccccagtcgctggtgctgctgacgctgctcatcgccgcggtc
ctacggtgtcagggccaggatgcccaggaggctggcagctgtctgcagaatgggcagagg
tataaagataaggatgtatggaagccctcatcttgccgcatctgtgtgtgtgacactggg
aatgtcctctgcgatgacattatctgtgaagacccagactgcctcaaccccgagatcccc
ttcggagagtgctgtcccatctgcccagctgacctcgccactgccagtggaaaattaggg
ccaaaggggcagaaaggagaacctggagatatcagagatatcataggacccagaggacct
cctggccctcagggacctgcaggtgaacaaggacccagaggtgatcgtggtgacaaggga
gaaaagggtgcgcctggaccccgtggcagagatggagaacctggtacccctggaaatcct
ggccccgctggccctccaggtccccctggtccccctggccttagtgcaggaaacttcgcg
gctcagatggctggagggtatgacgagaaggctggtggtgcccagatgggagtcatgcaa
gggcccatgggccccatgggaccccgtggacccccaggccctgccggtgcccccggccct
caaggatttcaaggcaatcctggtgaacctggcgagcctggtgtctctggtcccatgggt
ccccgaggtcctcctggccctgctggaaaacctggtgacgacggtgaagctgggaagccc
ggaaagtctggggaaagaggcctccctggccctcagggtgctcgtggattcccaggaacc
ccgggtctccccggtgtcaagggtcacagaggttacccaggcctcgacggtgctaagggg
gaagctggtgctccgggtgtgaagggtgagagtggttcccctggtgagaacggatccccg
ggcccaatgggtccccgtggcctgcctggtgagagaggacggactggccctgctggtgct
gctggtgctcggggtaacgatggccagccaggccccgctggacctccgggtcctgtgggt
cccgcaggtggtcctggcttccctggtgctcctggtgccaagggcgaagctggtcccact
ggtgctcgcggtcctgaaggtgctcaaggttctcgtggcgagcctggcaatcctgggtcc
cctgggcctgcaggtgcttctggtaacccagggactgatggtattcctggagccaaagga
tccgctggtgctcctggaattgctggtgcccctggcttccctgggccccgtggccctccc
ggtcctcaaggtgcaactggtccccttggccccaaaggtcaggcgggtgaacctggcatt
gctggctttaaaggtgatcaaggccccaagggagagactggacctgctgggccccaagga
gcccctggccccgctggtgaagaaggcaaacgaggtgctcgaggagagccgggtggtgct
ggaccaatcggaccccctggagagagaggtgctcctggcaaccgtggattcccaggtcaa
gatggtctggcaggtcccaagggtgcccctggagagcgagggcccagtggcttgactggt
cccaagggagccaacggtgacccgggtcgtcctggagaacctggtcttcctggagccagg
ggtcttaccggtcgccctggtgacgctggtcctcaaggcaaagttggtccttctggagcc
cctggtgaagacggtcgccctggacctcctggtcctcagggagctcgtgggcagcctggc
gtcatgggtttccctggccccaaaggtgccaacggcgagcctggcaaagctggtgagaag
ggtctggctggcgctcctggtctgagaggtcttcctggaaaagacggtgagacgggagcc
gcaggaccccccggccccagtggacctgctggtgaacgaggcgagcagggcgctcctgga
ccatcagggttccagggacttcctggccctcccggtcccccaggtgaaggtggaaagcaa
ggtgaccagggtattcctggtgaagctggagctcctggccttgtgggtcctcggggcgag
cgaggtttcccaggtgaacgtggctctcctggtgctcagggccttcagggtccccgaggc
ctccctggcactcctggtactgatggtcccaaaggtgcagctggcccagatggcccccct
ggggctcaggggcctccaggtctacagggaatgcctggtgagagaggagccgctggcatt
gctgggcccaagggagacagaggcgatgttggcgagaaaggcccagagggagctcctggg
aaggatggcggccgaggtctgactgggcccatcggacccccaggcccagcaggggccaac
ggcgagaagggagaagtcggacctcctggcccgtcaggaagtaccggagctcgaggtgcc
ccgggtgaacgcggagagaccgggccacctggacctgctggattcgctggccctcctggt
gctgatggccagcctggtgccaagggtgatcaaggagaagccggacagaaaggagatgct
ggtgcccccggcccacaaggcccctcgggagcccctgggccacagggtcctactggagtg
actggtcctaagggagcccgaggtgcccaaggtcccccgggagccaccggattccctgga
gctgctggccgagttggacccccaggtgctaatggcaatcctggacccgccggtccccct
ggtcctgctggaaaagatggtcccaaaggtgttcgaggagacagtggcccccctggcaga
gctggtgaccccggtcttcaaggtcctgcaggagctcctggcgagaaaggagaacctgga
gatgatggtccctctggtcttgatggtcctccaggtccccaggggctggctggtcaaagg
ggcattgttggtctgcctggtcagcgtggtgagagaggattccccggccttcccggccca
tcgggtgagcccggcaagcagggtgcacctggcgcgtctggagacagaggtcctcctggt
cctgtggggcctcctggcctgacagggcctgcaggtgaacctggacgagagggcagccct
ggtgctgatggaccccctggaagagatggtgcagctggagtcaagggagatcgtggtgag
actggagcactgggtgcccctggagctcctgggcccccaggctctcctggtcctgctggc
ccaactggcaaacaaggagacagaggagaggctggtgcacaaggtcctatgggtccctca
ggacctgctggagcccgtgggattgcaggccctcaaggcccccgaggtgacaaaggagaa
tctggagagcagggcgagaggggactgaagggacaccgaggtttcactggactgcagggt
ctgcctggccctccgggtccttctggagatcagggtgcttctggccctgctggtccttct
ggccctagaggtccacctggccctgttggtccctctggcaaagatggctctaatggaatc
cctggccccatcgggcctccaggtccccgtggacgctcaggagaaacaggccctgttggt
ccccctggaagtcccggtcctcctggccctccaggtcctcctggtcctggcatcgacatg
tcagcctttgctggcttagggcagagagagaagggccccgaccccatgcagtacatgcgg
gccgacgaggcagacagtaccttgagacagcacgacgtggaggtggacgctacactcaag
tcactgaacaaccagattgagagcatccgcagccccgacggctcccgcaagaaccctgct
cgcacttgccaagacctgaaactctgccaccccgagtggaagagcggagactactggatt
gatcccaaccagggctgcaccttggacgccatgaaagttttctgcaacatggagaccggc
gagacttgcgtctaccccaacccagcgactgtccctcggaaaaactggtggagcagcaag
agcaaggaaaagaaacacatctggtttggagagaccatgaacggtggcttccacttcagc
tatggcgatggcaacctggctcccaacaccgctaacgtccagatgactttcctccgtcta
ctgtccactgagggctcccagaacatcacctaccactgtaagaacagcatcgcctacctg
gacgaagcggctggcaacctcaagaaggccttgctcatccagggctccaatgatgtagag
atgagggccgagggcaacagcaggttcacatacactgccctgaaggatggctgcacgaaa
cacactggtaagtggggcaagaccgtcatcgagtaccgatcacagaagacctcccgcctt
cccattattgacatcgcacccatggacattggaggggctgaacaggaatttggtgtggac
atagggcctgtctgcttcttgtaa

KEGG   Mus musculus (house mouse): 12826
Entry
12826             CDS       T01002                                 
Symbol
Col4a1, Bru, Col4a-1, Raw, Svc
Name
(RefSeq) collagen, type IV, alpha 1
  KO
K06237  collagen type IV alpha
Organism
mmu  Mus musculus (house mouse)
Pathway
mmu04151  PI3K-Akt signaling pathway
mmu04382  Cornified envelope formation
mmu04510  Focal adhesion
mmu04512  ECM-receptor interaction
mmu04820  Cytoskeleton in muscle cells
mmu04926  Relaxin signaling pathway
mmu04933  AGE-RAGE signaling pathway in diabetic complications
mmu04974  Protein digestion and absorption
mmu05146  Amoebiasis
mmu05165  Human papillomavirus infection
mmu05200  Pathways in cancer
mmu05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:mmu00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    12826 (Col4a1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    12826 (Col4a1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    12826 (Col4a1)
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    12826 (Col4a1)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    12826 (Col4a1)
  09154 Digestive system
   04974 Protein digestion and absorption
    12826 (Col4a1)
  09158 Development and regeneration
   04382 Cornified envelope formation
    12826 (Col4a1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    12826 (Col4a1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    12826 (Col4a1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    12826 (Col4a1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    12826 (Col4a1)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    12826 (Col4a1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:mmu04147]
    12826 (Col4a1)
   00536 Glycosaminoglycan binding proteins [BR:mmu00536]
    12826 (Col4a1)
Exosome [BR:mmu04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   12826 (Col4a1)
Glycosaminoglycan binding proteins [BR:mmu00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   12826 (Col4a1)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 12826
NCBI-ProteinID: NP_034061
MGI: 88454
Ensembl: ENSMUSG00000031502
UniProt: P02463
LinkDB
Position
8:complement(11248423..11362889)
AA seq 1669 aa
MGPRLSVWLLLLFAALLLHEERSRAAAKGDCGGSGCGKCDCHGVKGQKGERGLPGLQGVI
GFPGMQGPEGPHGPPGQKGDAGEPGLPGTKGTRGPPGAAGYPGNPGLPGIPGQDGPPGPP
GIPGCNGTKGERGPLGPPGLPGFSGNPGPPGLPGMKGDPGEILGHVPGTLLKGERGFPGI
PGMPGSPGLPGLQGPVGPPGFTGPPGPPGPPGPPGEKGQMGSSFQGPKGDKGEQGVSGPP
GVPGQAQVKEKGDFAPTGEKGQKGEPGFPGVPGYGEKGEPGKQGPRGKPGKDGEKGERGS
PGIPGDSGYPGLPGRQGPQGEKGEAGLPGPPGTVIGTMPLGEKGDRGYPGAPGLRGEPGP
KGFPGTPGQPGPPGFPTPGQAGAPGFPGERGEKGDQGFPGVSLPGPSGRDGAPGPPGPPG
PPGQPGHTNGIVECQPGPPGDQGPPGTPGQPGLTGEVGQKGQKGESCLACDTEGLRGPPG
PQGPPGEIGFPGQPGAKGDRGLPGRDGLEGLPGPQGSPGLIGQPGAKGEPGEIFFDMRLK
GDKGDPGFPGQPGMPGRAGTPGRDGHPGLPGPKGSPGSIGLKGERGPPGGVGFPGSRGDI
GPPGPPGVGPIGPVGEKGQAGFPGGPGSPGLPGPKGEAGKVVPLPGPPGAAGLPGSPGFP
GPQGDRGFPGTPGRPGIPGEKGAVGQPGIGFPGLPGPKGVDGLPGEIGRPGSPGRPGFNG
LPGNPGPQGQKGEPGIGLPGLKGQPGLPGIPGTPGEKGSIGGPGVPGEQGLTGPPGLQGI
RGDPGPPGVQGPAGPPGVPGIGPPGAMGPPGGQGPPGSSGPPGIKGEKGFPGFPGLDMPG
PKGDKGSQGLPGLTGQSGLPGLPGQQGTPGVPGFPGSKGEMGVMGTPGQPGSPGPAGTPG
LPGEKGDHGLPGSSGPRGDPGFKGDKGDVGLPGMPGSMEHVDMGSMKGQKGDQGEKGQIG
PTGDKGSRGDPGTPGVPGKDGQAGHPGQPGPKGDPGLSGTPGSPGLPGPKGSVGGMGLPG
SPGEKGVPGIPGSQGVPGSPGEKGAKGEKGQSGLPGIGIPGRPGDKGDQGLAGFPGSPGE
KGEKGSAGTPGMPGSPGPRGSPGNIGHPGSPGLPGEKGDKGLPGLDGVPGVKGEAGLPGT
PGPTGPAGQKGEPGSDGIPGSAGEKGEQGVPGRGFPGFPGSKGDKGSKGEVGFPGLAGSP
GIPGVKGEQGFMGPPGPQGQPGLPGTPGHPVEGPKGDRGPQGQPGLPGHPGPMGPPGFPG
INGPKGDKGNQGWPGAPGVPGPKGDPGFQGMPGIGGSPGITGSKGDMGLPGVPGFQGQKG
LPGLQGVKGDQGDQGVPGPKGLQGPPGPPGPYDVIKGEPGLPGPEGPPGLKGLQGPPGPK
GQQGVTGSVGLPGPPGVPGFDGAPGQKGETGPFGPPGPRGFPGPPGPDGLPGSMGPPGTP
SVDHGFLVTRHSQTTDDPLCPPGTKILYHGYSLLYVQGNERAHGQDLGTAGSCLRKFSTM
PFLFCNINNVCNFASRNDYSYWLSTPEPMPMSMAPISGDNIRPFISRCAVCEAPAMVMAV
HSQTIQIPQCPNGWSSLWIGYSFVMHTSAGAEGSGQALASPGSCLEEFRSAPFIECHGRG
TCNYYANAYSFWLATIERSEMFKKPTPSTLKAGELRTHVSRCQVCMRRT
NT seq 5010 nt   +upstreamnt  +downstreamnt
atggggccccggctcagcgtctggcttctgctgctcttcgccgcccttctgctccacgag
gagcgcagccgagcagctgcgaagggcgattgtggtggctctggctgtggaaaatgtgac
tgtcatggcgtgaagggacaaaagggagaaagaggcttgccagggttgcaaggtgtcatt
ggatttcccgggatgcaaggacctgaggggccacatggaccaccaggacaaaagggtgat
gctggagaacctggacttcctggcacaaaagggacgaggggaccccctggagcagctggc
taccctgggaatcccggacttcctggtattcctggccaagatggccctccaggtccccca
ggtatcccaggatgcaacggtacaaagggagagagagggccgctcggtcctcctggcttg
cctggattcagcggcaatcctggaccaccagggttaccaggaatgaagggagatcctggt
gaaattctcggccacgttcccggaacactgctgaaaggggagagaggatttcctggtatc
cccgggatgccgggctcaccagggttgccaggactccagggacccgtgggtcctccggga
tttactggaccaccgggtcctccaggccctcctggacctcctggagaaaagggccagatg
ggctccagcttccaaggaccaaaaggtgacaagggagagcaaggggtcagcggtcccccg
ggagttcctggacaggcacaagttaaggaaaaaggagactttgccccaacaggagaaaag
ggtcagaaaggtgaacctggatttccgggagtaccaggatatggagagaaaggtgaacct
ggcaagcaagggcctcggggaaaacctgggaaagacggtgaaaaaggagaaagggggagc
ccgggcattcctggcgattccgggtacccaggtctcccaggccggcagggcccgcaggga
gagaagggtgaagctggacttccaggcccccctggaactgtgataggcaccatgcctttg
ggagagaaaggagatcgaggctacccaggagcaccagggttgagaggagagcccggccct
aaaggtttccctggaacaccaggccaaccaggccctccaggtttccctactccaggccag
gctggtgctccaggcttcccaggtgaaaggggagaaaaaggtgaccagggatttccaggc
gtgtcattgccaggaccaagtggaagagatggagccccggggcctcccggccctcccggc
ccccctgggcagccaggccacacaaatggcattgtggagtgtcaacctggaccacccggg
gaccagggccctcctgggactccaggacagccaggtttgacaggtgaagttggacagaaa
ggtcagaaaggagagagctgccttgcctgtgacacagaaggacttcgtggccctccaggg
ccacaggggcctccgggagagattggtttccctggacaacctggggctaaaggtgaccga
ggcctgcccggcagagatggtcttgaaggattgccgggtccacaaggttcaccagggctt
ataggccagcctggagctaagggagagcctggagagatattttttgacatgcgactcaaa
ggtgacaaaggagacccaggttttccgggacagcctggcatgccaggaagagcaggaact
cccggaagagatggccacccaggactccctggacccaaaggctctccgggttcaatagga
ttgaaaggagagcgaggtcctcccggaggagttggcttccctggtagtcgtggagacatc
ggccctcctggacccccaggagttggccccattggtcctgttggtgaaaaaggacaagca
ggctttcctggaggccctgggtccccaggtcttccaggcccaaagggtgaggcaggaaag
gttgtcccactccctggccccccaggagctgcaggacttccaggatcccctggcttccca
gggccacaaggtgaccgaggcttcccaggaaccccaggacgtccaggcatcccgggagag
aaaggtgctgtgggccagccaggaattggatttcctgggcttcctggccccaaaggtgtt
gatggcttgcctggagagataggacggcctgggagtccaggtcgccctggatttaacggc
ttacctggcaacccaggacctcaaggtcaaaagggagaacctggcattgggcttccagga
ctcaaagggcaaccaggccttccaggcattcccggtacacctggagagaagggtagcatc
gggggacccggcgttccaggagaacaggggttgacaggccccccaggactccaggggatc
agaggtgacccagggcctcctggagttcaaggcccagcaggtccaccaggggtcccagga
atagggccacccggagctatgggccctcccggagggcaaggaccaccagggtcatcaggt
ccacctggaattaaaggagagaaagggttccctggattccccggactggatatgcctggc
cccaaaggcgataaaggctctcaaggacttcctggcctcacaggacagtcaggcctccct
ggccttcctggacagcaggggacacctggagttccagggttcccaggttctaaaggtgaa
atgggtgtcatgggaaccccgggacaaccaggctcgccaggaccagcaggcaccccaggg
ttacctggagaaaaaggggaccatggccttccgggctcctcaggacccaggggcgaccct
ggcttcaaaggtgataaaggtgacgttgggcttcctggcatgccaggatccatggagcat
gtggacatggggagcatgaagggacagaaaggagaccagggagagaaaggacaaatcgga
cccactggtgataaaggttcccgaggagaccctggaacaccaggagtacctgggaaggat
gggcaggcagggcatcccggacagccagggcctaaaggtgacccaggccttagtgggaca
ccaggatcccctggactccctggacccaaaggatcagttggaggaatgggcttgccaggt
tcgcctggagaaaaaggtgtgcctggcatccctggctcacagggtgtccctggctcacct
ggagagaagggagccaaaggagagaaagggcagtcaggtctacctggcattgggattccg
ggacggcctggtgacaagggagaccagggccttgctggcttcccaggcagccccggtgag
aagggagagaaaggcagtgccggaaccccagggatgccagggtccccaggcccgagaggt
tctccggggaacatcggccatccaggaagcccaggcctgcctggagagaaaggggataaa
ggcctcccaggactggatggcgttcctggtgtcaaaggagaagcaggtctccctgggact
cctggccccacaggcccagctggccagaagggagagccgggcagcgatggaatcccgggg
tcggcaggagagaagggtgaacaaggtgttccaggaagaggcttccccggcttccctggc
tccaaaggagacaaaggctccaagggtgaagtgggtttccctggcctagctggaagtcct
ggaattcctggagtcaaaggcgagcaagggttcatgggtcctcctggccctcaaggacaa
cctggcttacctggcactcctggtcaccctgtggaggggcccaaaggagaccgaggacct
cagggtcaacctggcctgccagggcatccgggacctatggggccgccagggttcccagga
atcaatgggccaaaaggtgacaagggaaatcaaggttggccaggagctccgggggttcca
ggccctaagggagacccaggattccaaggcatgccgggcattggcggctctccagggatc
acaggttcaaagggagatatgggactgcccggagttccaggatttcaaggtcagaaaggg
cttcctggtctgcagggagtgaaaggagaccagggagaccaaggtgtacccggccctaaa
ggtctccaaggtccccctgggcctccaggtccctacgatgtcatcaaaggagaaccaggg
ctccctggtcctgagggtccccctggtcttaaaggactccagggaccaccaggtccaaaa
ggacagcaaggtgtgacaggctcagtgggcttgcctggacctccaggtgtccctgggttc
gatggtgctcctggccagaaaggagagacaggaccctttggaccacctggtccaagaggg
tttcctggcccaccaggccccgatgggctgccaggatccatgggtcccccaggtacccca
tctgtggaccatggcttccttgtgaccaggcatagtcagacaacagatgacccactgtgt
cccccagggaccaaaattctttaccatggatactctctgctctatgtccaaggcaacgag
cgtgcccacgggcaggacttgggtacggctggcagctgcctgcgtaagttcagcaccatg
ccctttctcttctgcaacatcaacaacgtctgcaacttcgcctccaggaacgactactct
tactggctgtccacgccagagcccatgcccatgtccatggcacccatctctggggacaac
atccggcccttcattagcaggtgtgcggtttgtgaagcaccggccatggtgatggcggta
cacagtcagaccattcagattccgcagtgccctaacggttggtcctcactgtggatcggc
tattccttcgtgatgcacaccagcgctggtgctgaaggttccggccaagccctcgcatcc
cccgggtcctgtctggaagagtttagaagcgccccattcatcgagtgccacggcagagga
acgtgcaattactacgcaaatgcttacagcttttggctcgccaccatagagagaagcgag
atgttcaagaagcccacgccatccaccttgaaggcaggggagctgcgaacacacgtcagc
cgctgccaagtgtgcatgagaagaacataa

KEGG   Mus musculus (house mouse): 12827
Entry
12827             CDS       T01002                                 
Symbol
Col4a2, Col4a-2
Name
(RefSeq) collagen, type IV, alpha 2
  KO
K06237  collagen type IV alpha
Organism
mmu  Mus musculus (house mouse)
Pathway
mmu04151  PI3K-Akt signaling pathway
mmu04382  Cornified envelope formation
mmu04510  Focal adhesion
mmu04512  ECM-receptor interaction
mmu04820  Cytoskeleton in muscle cells
mmu04926  Relaxin signaling pathway
mmu04933  AGE-RAGE signaling pathway in diabetic complications
mmu04974  Protein digestion and absorption
mmu05146  Amoebiasis
mmu05165  Human papillomavirus infection
mmu05200  Pathways in cancer
mmu05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:mmu00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    12827 (Col4a2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    12827 (Col4a2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    12827 (Col4a2)
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    12827 (Col4a2)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    12827 (Col4a2)
  09154 Digestive system
   04974 Protein digestion and absorption
    12827 (Col4a2)
  09158 Development and regeneration
   04382 Cornified envelope formation
    12827 (Col4a2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    12827 (Col4a2)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    12827 (Col4a2)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    12827 (Col4a2)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    12827 (Col4a2)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    12827 (Col4a2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:mmu04147]
    12827 (Col4a2)
   00536 Glycosaminoglycan binding proteins [BR:mmu00536]
    12827 (Col4a2)
Exosome [BR:mmu04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   12827 (Col4a2)
Glycosaminoglycan binding proteins [BR:mmu00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   12827 (Col4a2)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 12827
NCBI-ProteinID: NP_034062
MGI: 88455
Ensembl: ENSMUSG00000031503
UniProt: P08122 B2RQQ8
LinkDB
Position
8:11362878..11499287
AA seq 1707 aa
MDRVRFKASGPPLRGWLLLATVTVGLLAQSVLGGVKKLDVPCGGRDCSGGCQCYPEKGAR
GQPGAVGPQGYNGPPGLQGFPGLQGRKGDKGERGVPGPTGPKGDVGARGVSGFPGADGIP
GHPGQGGPRGRPGYDGCNGTRGDAGPQGPSGSGGFPGLPGPQGPKGQKGEPYALSKEDRD
KYRGEPGEPGLVGYQGPPGRPGPIGQMGPMGAPGRPGPPGPPGPKGQPGNRGLGFYGQKG
EKGDIGQPGPNGIPSDITLVGPTTSTIHPDLYKGEKGDEGEQGIPGVISKGEEGIMGFPG
IRGFPGLDGEKGVVGQKGSRGLDGFQGPSGPRGPKGERGEQGPPGPSVYSPHPSLAKGAR
GDPGFQGAHGEPGSRGEPGEPGTAGPPGPSVGDEDSMRGLPGEMGPKGFSGEPGSPARYL
GPPGADGRPGPQGVPGPAGPPGPDGFLFGLKGSEGRVGYPGPSGFPGTRGQKGWKGEAGD
CQCGQVIGGLPGLPGPKGFPGVNGELGKKGDQGDPGLHGIPGFPGFKGAPGVAGAPGPKG
IKGDSRTITTKGERGQPGIPGVHGMKGDDGVPGRDGLDGFPGLPGPPGDGIKGPPGDAGL
PGVPGTKGFPGDIGPPGQGLPGPKGERGFPGDAGLPGPPGFPGPPGPPGTPGQRDCDTGV
KRPIGGGQQVVVQPGCIEGPTGSPGQPGPPGPTGAKGVRGMPGFPGASGEQGLKGFPGDP
GREGFPGPPGFMGPRGSKGTTGLPGPDGPPGPIGLPGPAGPPGDRGIPGEVLGAQPGTRG
DAGLPGQPGLKGLPGETGAPGFRGSQGMPGMPGLKGQPGFPGPSGQPGQSGPPGQHGFPG
TPGREGPLGQPGSPGLGGLPGDRGEPGDPGVPGPVGMKGLSGDRGDAGMSGERGHPGSPG
FKGMAGMPGIPGQKGDRGSPGMDGFQGMLGLKGRQGFPGTKGEAGFFGVPGLKGLPGEPG
VKGNRGDRGPPGPPPLILPGMKDIKGEKGDEGPMGLKGYLGLKGIQGMPGVPGVSGFPGL
PGRPGFIKGVKGDIGVPGTPGLPGFPGVSGPPGITGFPGFTGSRGEKGTPGVAGVFGETG
PTGDFGDIGDTVDLPGSPGLKGERGITGIPGLKGFFGEKGAAGDIGFPGITGMAGAQGSP
GLKGQTGFPGLTGLQGPQGEPGRIGIPGDKGDFGWPGVPGLPGFPGIRGISGLHGLPGTK
GFPGSPGVDAHGDPGFPGPTGDRGDRGEANTLPGPVGVPGQKGERGTPGERGPAGSPGLQ
GFPGISPPSNISGSPGDVGAPGIFGLQGYQGPPGPPGPNALPGIKGDEGSSGAAGFPGQK
GWVGDPGPQGQPGVLGLPGEKGPKGEQGFMGNTGPSGAVGDRGPKGPKGDQGFPGAPGSM
GSPGIPGIPQKIAVQPGTLGPQGRRGLPGALGEIGPQGPPGDPGFRGAPGKAGPQGRGGV
SAVPGFRGDQGPMGHQGPVGQEGEPGRPGSPGLPGMPGRSVSIGYLLVKHSQTDQEPMCP
VGMNKLWSGYSLLYFEGQEKAHNQDLGLAGSCLARFSTMPFLYCNPGDVCYYASRNDKSY
WLSTTAPLPMMPVAEEEIKPYISRCSVCEAPAVAIAVHSQDTSIPHCPAGWRSLWIGYSF
LMHTAAGDEGGGQSLVSPGSCLEDFRATPFIECNGGRGTCHYFANKYSFWLTTIPEQNFQ
STPSADTLKAGLIRTHISRCQVCMKNL
NT seq 5124 nt   +upstreamnt  +downstreamnt
atggaccgagtgcggttcaaagcgtctggccctcccctgcgagggtggctgctgctagca
actgtgacagtgggactcctggctcagagcgtcttggggggtgtgaagaagttggatgtg
ccctgcggagggagagactgcagtgggggttgccagtgctacccggagaaaggagcaagg
ggtcagccaggggcagtgggcccccaggggtacaatggccccccagggttgcaaggattc
ccaggactacagggccgcaaaggagacaagggcgaacggggagttcctggaccaactgga
ccaaaaggagatgtgggagcgagaggcgtctctggattccccggtgcagatggaattcct
gggcatccagggcaaggcggaccccgaggaaggccaggctacgatggttgcaacggaacc
aggggagacgcgggtccacaggggccctctggatctgggggcttccctggcctccctggg
ccccaaggacccaaggggcagaaaggcgagccttacgcactctctaaagaagaccgtgac
aaataccggggtgaacctggagagcctggcctggttggctaccagggacctcctggccgc
ccagggcccataggacagatgggtcccatgggagcacctggaagaccgggaccaccggga
ccccctggacccaagggacaaccaggcaaccgagggctcggtttttacggacagaagggt
gaaaagggtgacataggacagccaggacccaacgggatcccatctgacatcacacttgtt
gggcccacaacatcaacgatccacccggatctgtacaagggtgaaaaaggagacgaagga
gagcaagggatacccggcgtaatctcaaaaggcgaggaaggcatcatggggttcccagga
atacggggttttcctggccttgatggagaaaaaggagtcgtaggacagaaaggaagcaga
ggcctggatggtttccaaggccccagtggacctcgaggacccaagggagaacggggggaa
caaggacccccaggaccctctgtctactcgccccatccatccctggcaaaaggtgcccga
ggtgacccaggattccaaggtgctcatggggagccaggaagccgaggcgaaccgggagag
cctggaaccgcgggtcctccaggcccatccgtcggagatgaagattcaatgagaggcctt
ccaggggagatgggacccaaaggcttctcaggagaaccagggtcccctgcacgctatctt
ggcccaccaggagcagatggaagaccaggtccccaaggagtccctggacctgcaggccca
cctggaccagatggcttcctgtttggcctgaaaggatcagaagggagagtgggctacccg
ggaccttcaggtttcccaggaacacgagggcagaaaggatggaaaggtgaagctggagac
tgccaatgtggccaggtcatcgggggtcttccgggactgccaggacccaaaggttttccc
ggtgtcaatggggagctcggaaagaaaggcgaccaaggagacccaggcctgcacggtatc
cctgggttcccaggattcaagggagccccgggcgttgctggagctcctggacccaaagga
ataaagggagactctagaacaatcaccaccaaaggggagcgaggacagccaggtatccct
ggtgtacacggaatgaaaggtgatgacggcgtcccaggacgggatgggctggatggattc
cctggcctgcctggccccccgggcgatggcatcaaaggccctccaggggatgcaggtctc
cctggtgtacccggaaccaagggctttccaggagacatcggtcctccaggacagggctta
ccaggtccgaaaggcgagcgtggtttccctggagacgctgggttgcctggacctccgggc
ttccctggtcctccaggtcccccaggaaccccaggacagagagattgtgacacgggtgtg
aaaagacctatcggaggcggccagcaagtggtcgttcagccaggttgcattgaaggcccc
acaggatcacctggccagccaggacccccaggccccacaggtgccaaaggcgtcagagga
atgccaggattcccaggagcttctggagaacaagggctcaagggctttccaggagaccca
ggtcgagaaggcttcccagggcccccagggttcatgggaccccgaggatccaaaggtacc
acaggcctccctggcccagatggacccccaggccccattggacttccaggaccagctgga
ccccctggggacagagggattccaggagaagtgctaggggcccagcctgggacacgagga
gatgctggacttccaggacagcctgggctgaaagggctcccaggagaaacaggagcaccg
ggattcagaggaagccaagggatgcccgggatgccagggcttaaaggtcagccaggcttc
ccaggaccttcgggacagccaggacagtctgggcccccaggacaacatggattcccagga
actcctggccgagaggggcctttggggcagccaggctcccctggccttggaggtttgcct
ggagacaggggcgagcctggtgaccctggtgtccctggccctgtgggcatgaaaggtctg
tctggggacagaggtgacgccggtatgtcaggcgagcgtggtcaccctggaagccctgga
tttaaaggaatggcagggatgcctggtatccctgggcagaaaggtgatcgagggtcgcct
ggaatggacggtttccagggcatgttgggacttaagggaagacagggcttcccagggacc
aaaggagaagctggattttttggagttcctggtttgaagggcctgcctggcgagccaggg
gttaaaggaaaccgaggagaccgggggcccccaggaccacctcctctcatcctgccgggg
atgaaggacatcaagggagaaaagggagatgaaggaccaatgggcctgaaagggtatctg
ggcttaaaaggcatccaaggaatgcccggagtccccggagtgtctggattccctgggcta
cctggaaggcctggcttcatcaaaggagtcaagggagacatcggagtccctggcacacca
ggcttgccgggattccctggggtgtctggccctcctggaattaccgggtttccaggattc
acaggcagccggggcgagaagggtactccaggagtagcaggagtttttggcgagaccggc
cctactggggactttggtgacattggggacactgtggacttaccagggagcccaggcctg
aagggggaacggggcatcacgggaattccaggtctcaagggattctttggagagaaagga
gctgcgggtgacattggatttcctgggataacgggcatggctggagctcagggctctcct
ggacttaaaggacagacaggctttccagggctcacagggctgcaagggccacagggagag
cctggacggatcggaatacccggtgacaagggtgatttcggctggccaggcgtaccaggt
ctaccaggttttcctgggatccggggcatcagcggattgcacggcctgccaggcaccaaa
ggcttccctggatcaccaggtgttgatgctcatggagacccagggttcccaggccccact
ggagacaggggtgaccggggagaggccaacacacttccaggccccgtgggagttccaggg
cagaaaggggagcggggaaccccaggggaacgtggcccagctggaagcccaggacttcag
ggtttccctggtatctctccaccatccaacatctctgggtcacctggtgatgtaggcgca
ccaggaatatttggcttgcaaggctaccaaggccctccaggaccacctgggccaaacgca
cttcctggaatcaaaggagatgaggggagctctggagctgcaggattccctggacagaaa
ggatgggttggggacccagggccccagggccagcctggtgtactcggtcttccgggggag
aaagggcccaagggtgaacaaggattcatgggcaacaccgggccctctggggccgtgggt
gacagaggccccaaaggacccaaaggcgaccaaggattcccaggtgctcctggctctatg
gggtccccaggaattcctggcatcccccagaagattgctgtccagcctggaacgctgggt
ccccagggcaggagaggccttcctggggccctgggagagatagggccgcagggccctcct
ggagatccaggcttccgtggggccccaggcaaggctgggcctcagggcagaggtggcgtg
tctgctgttccagggttccggggagaccaagggcccatgggacaccagggtccagttggc
caggaaggggagccaggccgtccagggagcccaggcctgccagggatgccaggccgcagt
gtgagcatcggctacctcctggtgaagcacagccaaacggaccaggaacccatgtgccct
gtgggcatgaacaagctctggagtgggtacagcctgctatattttgagggccaggagaaa
gcgcacaaccaggacctaggactggcaggctcctgcctggcacgcttcagcaccatgcct
ttcctgtactgcaatccgggtgacgtctgctactatgccagccgcaacgacaagtcctac
tggctctccaccacggcccctctgcccatgatgcccgtggctgaggaggaaatcaagccc
tacatcagccgctgctctgtgtgcgaggctccggccgtggccattgccgtgcacagccag
gatacctctataccccactgcccggctgggtggcggagtttgtggatcggatattcattc
ctcatgcacactgcagccggggatgaaggcggtggccagtcactggtgtcgccgggcagc
tgtctagaggacttccgtgcaacgccatttatcgagtgtaacgggggccgtggtacctgc
cactacttcgctaacaagtacagcttctggctgaccacgatcccagagcagaacttccag
agcacaccatccgctgacacgctcaaggctggcctcatccgcacgcacatcagccgctgc
caagtgtgcatgaagaatctgtga

KEGG   Mus musculus (house mouse): 12828
Entry
12828             CDS       T01002                                 
Symbol
Col4a3, [a]3(IV), alpha3(IV)
Name
(RefSeq) collagen, type IV, alpha 3
  KO
K06237  collagen type IV alpha
Organism
mmu  Mus musculus (house mouse)
Pathway
mmu04151  PI3K-Akt signaling pathway
mmu04382  Cornified envelope formation
mmu04510  Focal adhesion
mmu04512  ECM-receptor interaction
mmu04820  Cytoskeleton in muscle cells
mmu04926  Relaxin signaling pathway
mmu04933  AGE-RAGE signaling pathway in diabetic complications
mmu04974  Protein digestion and absorption
mmu05146  Amoebiasis
mmu05165  Human papillomavirus infection
mmu05200  Pathways in cancer
mmu05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:mmu00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    12828 (Col4a3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    12828 (Col4a3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    12828 (Col4a3)
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    12828 (Col4a3)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    12828 (Col4a3)
  09154 Digestive system
   04974 Protein digestion and absorption
    12828 (Col4a3)
  09158 Development and regeneration
   04382 Cornified envelope formation
    12828 (Col4a3)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    12828 (Col4a3)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    12828 (Col4a3)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    12828 (Col4a3)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    12828 (Col4a3)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    12828 (Col4a3)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:mmu04147]
    12828 (Col4a3)
   00536 Glycosaminoglycan binding proteins [BR:mmu00536]
    12828 (Col4a3)
Exosome [BR:mmu04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   12828 (Col4a3)
Glycosaminoglycan binding proteins [BR:mmu00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   12828 (Col4a3)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 12828
NCBI-ProteinID: NP_031760
MGI: 104688
Ensembl: ENSMUSG00000079465
UniProt: Q9QZS0
LinkDB
Position
1:82564647..82699778
AA seq 1669 aa
MHSKTAPRFLVFLLLTLLLLLAASPVASKGCVCKGKGQCLCAGTKGEKGEKGVPGSPGFP
GQKGFPGPEGLPGPQGPKGSPGLPGLTGPKGIRGITGLPGFAGPPGLPGLPGHPGPRGLA
GLPGCNGSKGEQGFPGFPGTPGYAGLPGPDGLKGQKGEPAQGEDRGFNGKGDPGPPGVPG
FQGFPGLPGFPGPAGPPGPPGFFGLPGAMGPRGPKGHMGDSVIGQKGERGMKGLTGPPGP
PGTVIFTLTQPYNKSDFKGEKGDEGERGEPGPPGPSGPPGDSYGSEKGAPGEPGPRGKPG
KDGAPGFPGTEGAKGNRGFPGLRGEAGIKGRKGDIGPPGFPGPTEYYDAYLEKGERGMPG
LPGPKGARGPQGPSGPPGVPGSPGLSRPGLRGPIGWPGLKGSKGERGPPGKDTVGPPGPL
GCPGSPGPPGPPGPPGCPGDIVFKCSPGEHGMPGDTGPPGVPGLDGPKGEPGSPCTECHC
FPGPPGVPGFPGLDGIKGIPGGRGVPGLKGNPGSPGSAGLPGFAGFPGDQGHPGLKGDKG
DTPLPWGQVGNPGDPGLRGLPGRKGFDGTPGGPGAKGPPGPQGEPALSGRKGDQGPPGPP
GFPGPPGPAGPAGPPGYGPQGEPGPKGAQGVPGVLGPPGEAGLKGEPSTSTPDLGPPGPP
GPPGQAGPRGLPGLPGPVGKCDPGLPGPDGEPGIPEAGCPGPPGPKGNQGFPGTKGSPGC
PGEMGKPGRPGEPGIPGAKGEPSVGRPGKPGKPGFPGERGNAGENGDIGLPGLPGLPGTP
GRGGLDGPPGDPGQPGSPGAKGSPGRCIPGPRGTQGLPGLNGLKGQPGRRGDTGPKGDPG
IPGMDRSGVPGDPGPPGTPGCPGEMGPPGQKGYPGAPGFPGPPGEKGEVGMMGYPGTTGP
PGLPGKPGSQGQRGSLGIPGMKGEKGRPGAKGERGEKGKPGPSQTTLLKGDKGEPGLKGF
VGNPGEKGNRGNPGLPGPKGLEGLPGLPGPPGPRGDTGSRGNPGRPGPHGMPGSMGIMGV
PGPKGRKGTSGLPGLAGRPGLTGIHGPQGDKGEPGYSEGARPGPPGPKGDPGLPGDKGKK
GERGVPGPPGQSGPAGPDGAPGSPGSPGHPGKPGPAGDLGLKGQKGFPGPPGSTGPPGPP
GLPGLPGPMGMRGDQGRDGIPGPPGEKGETGLLGAYPGPKGSPGVPGAKGDRGVPGLSGL
PGRKGVMGDVGPQGPPGTAGLPGPPGLPGAIIPGPKGDRGLPGLRGNPGEPGPPGPPGPI
GKGIKGDKGFMGPPGPKGLPGTVGDMGPPGFPGAPGTPGLPGVRGDPGFPGFPGIKGEKG
NPGFLGPIGHPGPVGPKGPPGPRGKPGTLKVISLPGSPGPPGVPGQPGMKGDPGPLGLPG
IPGPCGPRGKPGKDGKPGTPGPAGTKGNKGLKGQQGPPGLDGLPGLKGNPGDRGTPATGT
RMRGFIFTRHSQTTAIPSCPEGTQPLYSGFSLLFVQGNKRAHGQDLGTLGSCLQRFTTMP
FLFCNINNVCNFASRNDYSYWLSTPALMPMDMAPISGRALEPYISRCTVCEGPAMAIAVH
SQTTAIPPCPQDWVSLWKGFSFIMFTSAGSEGAGQALASPGSCLEEFRASPFIECHGRGT
CNYYSNSYSFWLASLNPERMFRKPIPSTVKAGDLEKIISRCQVCMKKRH
NT seq 5010 nt   +upstreamnt  +downstreamnt
atgcactccaagactgctccaaggttcctggtgttcctgctgcttaccctgctgctactc
ctggctgcctctcctgtggctagcaagggctgtgtctgcaaaggcaaaggacaatgcctc
tgcgcggggaccaaaggggagaagggggagaaaggggttcctggttcccctggatttcct
ggccagaaaggatttccaggtcctgaaggcttgcctggaccacagggacccaagggctcc
ccaggacttccgggactcactggccccaaaggcatcaggggaataactggattaccaggg
tttgcaggtcctcctggacttccaggcctcccaggccaccctgggcctcgtgggctggct
ggtttaccaggatgcaacggatctaagggtgaacaaggattcccgggctttcccggcaca
ccaggctatgcagggctcccaggtcctgatggcttgaaaggacaaaagggtgagcctgct
caaggagaagacaggggattcaatggaaaaggtgaccctgggcctccaggggttccaggc
ttccagggttttccgggactcccaggttttccagggcctgccggtccaccaggacctccg
ggattttttggtttaccaggagcaatgggacccagaggacccaaaggccacatgggcgat
agcgtgataggacaaaaaggagaaaggggtatgaaaggattaacaggacctcctgggcca
ccaggaacagtgatttttacactcacccagccatacaacaaatcggacttcaagggagag
aaaggagatgagggagagaggggtgaacccggacctcctggaccctcggggccacctgga
gactcctatggatcagaaaagggtgcgcctggagagcctggtcctcggggcaaacctgga
aaagatggtgcccctggcttccctggcactgagggagccaaaggcaacaggggctttcct
gggttgaggggtgaagctggcattaagggaaggaaaggagacattggccccccaggattt
cctggtccaacagaatattatgacgcatacctggaaaagggagagagaggaatgccaggc
cttcctgggcccaaaggagcccgtggcccacagggtccaagtggtccccctggagtccct
ggaagtcctggactgtcaaggcctggcctcagaggacccattggatggccaggcttgaaa
gggagtaaaggagagagaggaccccctggaaaagacactgtgggccctcctggacccctg
ggatgtcctggctcaccaggtccaccaggccctccaggacctccaggatgtccaggtgac
attgtttttaaatgttcccctggtgagcatggaatgccaggtgacacaggacctccagga
gttccaggacttgatggacccaaaggagaaccaggcagtccatgcaccgagtgtcactgc
ttcccagggccccctggagtcccaggatttccaggattagatggtatcaaaggaatccca
ggaggacgaggtgtgcctggtctgaagggaaacccagggtccccaggaagtgcaggtctc
ccaggatttgcaggattcccaggtgaccaagggcatccaggacttaaaggggacaaaggg
gatacacctctaccctgggggcaagtgggtaatccaggtgatcctggactcagaggcctg
cccgggagaaagggcttcgatggaactcctggaggtccaggagcgaaaggaccaccagga
cctcagggcgaaccggccctgagtggaaggaaaggggaccaaggacctccaggacctcct
ggattccctggacccccaggacctgcaggaccagctggaccaccaggctatggacctcaa
ggagagccaggtccaaagggagcccaaggagtccccggcgtcctgggaccacctggagaa
gccggtcttaaaggagaacccagtacatcaactccagatctaggtcccccaggccctcca
gggccccctggtcaggctggccccagaggtctacctggtttgcctggaccggtgggaaaa
tgtgatcctggtcttcctggacctgatggtgagccaggaattccagaagctggatgccca
gggcctccaggacctaagggaaatcaaggtttcccagggacaaaaggatcacctggttgt
cctggggaaatgggaaagcctggccgtcctggagaaccaggcatcccaggagccaaggga
gaaccatccgtaggcaggcctgggaaaccagggaaaccaggctttccaggagaaagaggc
aatgctggggaaaatggagatattggactccctgggcttccaggcctccctggaactcca
ggaagaggcgggcttgacgggcctccaggagacccaggacagcctggatcacctggagca
aaaggatccccaggaaggtgcataccaggacccaggggcacccaaggacttcccggcttg
aatggattgaaagggcaaccaggtcgaagaggtgataccgggccaaagggagaccctggc
atcccaggcatggacaggtctggggtccccggagatcctggaccaccaggaacaccaggt
tgtcctggtgagatgggaccacctggacaaaaaggatatccgggagctccaggattccca
ggaccaccaggcgagaaaggagaggttggaatgatgggctatcctggaaccactggccct
ccggggcttcctgggaaaccaggctcacaggggcagcgaggtagcctcggaatcccagga
atgaaaggggaaaaaggacgcccaggagccaaaggcgaacgaggagagaaaggaaaacca
gggccttctcaaacaacactcttaaagggagacaaaggagagcctggacttaaaggattt
gttgggaatccaggtgagaaaggaaacagaggcaacccagggttaccaggtccgaaaggc
ctcgagggattgcctgggctaccaggtcctccaggccctagaggagatacgggaagcaga
ggaaatcctggaagaccaggaccacacggcatgccaggaagcatggggatcatgggggtg
ccaggccctaaagggagaaaaggaacttcagggcttccaggtctagctggaagaccaggc
ctgacagggatccatggtccccaaggagataagggggagccgggttattcagaaggtgca
agaccaggaccaccaggaccaaagggcgatccaggattgccaggtgacaaaggaaagaaa
ggagaaagaggcgtacctgggccacctggacaatcggggcctgccggacctgatggagcc
cctgggagtcctgggagtcctggtcacccaggaaaaccaggtcctgctggtgatttgggt
ctgaaaggacagaaaggcttcccaggccctccaggaagcactggccctccaggccctcca
ggactcccaggacttcctgggccaatgggtatgagaggtgaccaaggacgtgatggaatt
cctggtcctccaggagaaaaaggagaaacaggcttgctgggggcctacccaggcccaaaa
gggagccctggtgtaccaggtgccaaaggagacaggggagttccaggcttgtctggcctt
cccggcaggaagggggttatgggggatgttggaccccaaggacccccaggcactgctgga
ctcccagggccaccaggtctacctggggcaattatccctggccccaaaggagacagaggt
cttcctggcttaagaggaaatccaggtgagccaggtccccctggacctccaggacctatc
ggaaaaggcataaaaggtgacaaaggatttatgggcccacctggccccaaaggcctgcct
ggaactgtaggggacatgggcccaccaggtttcccgggagcaccaggtacccccggtctt
cccggtgtcagaggtgatccaggattccctggatttccaggcataaaaggagaaaagggt
aaccctggatttcttgggccaattggacatccaggaccagttgggcccaaaggaccacca
ggtccacgtggaaaacctggcactcttaaggtcatctctcttccgggaagcccagggcca
cctggtgtacctggacagccaggaatgaaaggagaccctggacccctgggactgccagga
atcccaggaccctgtgggccaagaggtaaaccaggcaaggatgggaaaccaggaactcca
ggaccagctggtacaaagggcaacaagggcttgaaaggacagcaaggcccacctggtctg
gatggattgccaggcttaaagggaaatcctggtgacaggggaacgccggccactggtaca
agaatgcgaggcttcatcttcacccgacacagtcaaaccacggccattccttcatgccct
gaaggaacacagccactctatagtgggttttctcttctttttgtacaaggaaacaaacgt
gcacatggacaagacctaggtactctgggcagctgcctgcagcgattcaccacaatgccg
ttcttattctgtaacatcaataatgtatgtaactttgcatcacgaaatgattattcatac
tggctgtcaacaccagctctgatgccaatggacatggctccaattagtggcagagctctc
gaaccctatattagcagatgcaccgtctgtgaaggtccagcaatggccatagctgttcac
agtcaaactactgctatccctccgtgtccccaggactgggtttctctctggaaaggtttt
tctttcattatgttcacaagtgcaggctctgagggtgctggacaagcacttgcctcgcct
ggctcctgcctggaagaattccgagccagtccatttatagaatgccatggacgagggaca
tgtaactactactcaaactcctacagtttctggctggcttcgctgaacccagaaagaatg
ttcagaaaacctattccatcaactgtgaaagctggagacttagagaaaatcataagccgc
tgtcaggtgtgcatgaagaaaagacattga

KEGG   Mus musculus (house mouse): 12829
Entry
12829             CDS       T01002                                 
Symbol
Col4a4, E130010M05Rik, [a]4(IV)
Name
(RefSeq) collagen, type IV, alpha 4
  KO
K06237  collagen type IV alpha
Organism
mmu  Mus musculus (house mouse)
Pathway
mmu04151  PI3K-Akt signaling pathway
mmu04382  Cornified envelope formation
mmu04510  Focal adhesion
mmu04512  ECM-receptor interaction
mmu04820  Cytoskeleton in muscle cells
mmu04926  Relaxin signaling pathway
mmu04933  AGE-RAGE signaling pathway in diabetic complications
mmu04974  Protein digestion and absorption
mmu05146  Amoebiasis
mmu05165  Human papillomavirus infection
mmu05200  Pathways in cancer
mmu05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:mmu00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    12829 (Col4a4)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    12829 (Col4a4)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    12829 (Col4a4)
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    12829 (Col4a4)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    12829 (Col4a4)
  09154 Digestive system
   04974 Protein digestion and absorption
    12829 (Col4a4)
  09158 Development and regeneration
   04382 Cornified envelope formation
    12829 (Col4a4)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    12829 (Col4a4)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    12829 (Col4a4)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    12829 (Col4a4)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    12829 (Col4a4)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    12829 (Col4a4)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:mmu04147]
    12829 (Col4a4)
   00536 Glycosaminoglycan binding proteins [BR:mmu00536]
    12829 (Col4a4)
Exosome [BR:mmu04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   12829 (Col4a4)
Glycosaminoglycan binding proteins [BR:mmu00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   12829 (Col4a4)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 12829
NCBI-ProteinID: NP_031761
MGI: 104687
Ensembl: ENSMUSG00000067158
UniProt: Q9QZR9
LinkDB
Position
1:complement(82426140..82564570)
AA seq 1682 aa
MRCFFRWTKSFVTAPWSLIFILFTIQYEYGSGKKYGGPCGGRNCSVCQCFPEKGSRGHPG
PLGPQGPIGPLGPLGPIGIPGEKGERGDSGSPGPPGEKGDKGPTGVPGFPGVDGVPGHPG
PPGPRGKPGVDGYNGSRGDPGYPGERGAPGPGGPPGQPGENGEKGRSVYITGGVKGIQGD
RGDPGPPGLPGSRGAQGSPGPMGHAGAPGLAGPIGHPGSPGLKGNPATGLKGQRGEPGEV
GQRGPPGPTLLVQPPDLSIYKGEKGVKGMPGMIGPPGPPGRKGAPGVGIKGEKGIPGFPG
PRGEPGSHGPPGFPGFKGIQGAAGEPGLFGFLGPKGDLGDRGYPGPPGILLTPAPPLKGV
PGDPGPPGYYGEIGDVGLPGPPGPPGRPGETCPGMMGPPGPPGVPGPPGFPGEAGVPGRL
DCAPGKPGKPGLPGLPGAPGPEGPPGSDVIYCRPGCPGPMGEKGKVGPPGRRGAKGAKGN
KGLCTCPPGPMGPPGPPGPPGRQGSKGDLGLPGWHGEKGDPGQPGAEGPPGPPGRPGAMG
PPGHKGEKGDMVISRVKGQKGERGLDGPPGFPGPHGQDGGDGRPGERGDPGPRGDHKDAA
PGERGLPGLPGPPGRTGPEGPPGLGFPGPPGQRGLPGEPGRPGTRGFDGTKGQKGDSILC
NVSYPGKPGLPGLDGPPGLKGFPGPPGAPGMRCPDGQKGQRGKPGMSGIPGPPGFRGDMG
DPGIKGEKGTSPIGPPGPPGSPGKDGQKGIPGDPAFGDPGPPGERGLPGAPGMKGQKGHP
GCPGAGGPPGIPGSPGLKGPKGREGSRGFPGIPGSPGHSCERGAPGIPGQPGLPGTPGDP
GAPGWKGQPGDMGPSGPAGMKGLPGLPGLPGADGLRGPPGIPGPNGEDGLPGLPGLKGLP
GLPGFPGFPGERGKPGPDGEPGRKGEVGEKGWPGLKGDLGERGAKGDRGLPGDAGEAVTS
RKGEPGDAGPPGDGGFSGERGDKGSSGMRGGRGDPGRDGLPGLHRGQPGIDGPPGPPGPP
GPPGSPGLRGVIGFPGFPGDQGDPGSPGPPGFPGDDGARGPKGYKGDPASQCGPPGPKGE
PGSPGYQGRTGVPGEKGFPGDEGPRGPPGRPGQPGSFGPPGCPGDPGMPGLKGHPGEVGD
PGPRGDAGDFGRPGPAGVKGPLGSPGLNGLHGLKGEKGTKGASGLLEMGPPGPMGMPGQK
GEKGDPGSPGISPPGLPGEKGFPGPPGRPGPPGPAGAPGRAAKGDIPDPGPPGDRGPPGP
DGPRGVPGPPGSPGNVDLLKGDPGDCGLPGPPGSRGPPGPPGCQGPPGCDGKDGQKGPMG
LPGLPGPPGLPGAPGEKGLPGPPGRKGPVGPPGCRGEPGPPADVDSCPRIPGLPGVPGPR
GPEGAMGEPGRRGLPGPGCKGEPGPDGRRGQDGIPGSPGPPGRKGDTGEAGCPGAPGPPG
PTGDPGPKGFGPGSLSGFLLVLHSQTDQEPACPVGMPRLWTGYSLLYMEGQEKAHNQDLG
LAGSCLPVFSTLPFAYCNIHQVCHYAQRNDRSYWLSSAAPLPMMPLSEEEIRSYISRCAV
CEAPAQAVAVHSQDQSIPPCPRTWRSLWIGYSFLMHTGAGDQGGGQALMSPGSCLEDFRA
APFVECQGRQGTCHFFANEYSFWLTTVNPDLQFASGPSPDTLKEVQAQRRKISRCQVCMK
HS
NT seq 5049 nt   +upstreamnt  +downstreamnt
atgaggtgctttttcagatggaccaagtcctttgtcacagccccctggtcactaatattt
atcctctttactatacaatatgaatatgggagtggcaagaagtatggcggcccctgcgga
ggaagaaactgctctgtctgccagtgttttcctgagaagggatctcggggtcacccagga
ccactgggaccacagggaccaatagggcccttgggacccctgggacctattgggattcca
ggagaaaaaggagagagaggagacagtggctcacctggaccaccaggagagaaaggagac
aagggtccaactggtgtccctggatttccaggtgtggatggtgtacctggtcacccaggg
cctcctggacccagaggcaaacctggtgtggacggctacaatggctcacgtggtgatccg
ggctatccaggagaaagaggagctccaggcccaggcggcccccctggtcagcctggggaa
aatggagaaaaaggaagatctgtgtacattactggtggcgttaaaggtattcagggagac
cgtggggacccaggaccacccggcttaccaggatctaggggtgcacaagggtcaccaggg
cccatggggcatgcaggcgcaccagggctagcaggacctataggtcatcctggaagccct
gggttgaagggaaatcctgccactggactgaagggacaaaggggagagccgggtgaagtt
ggccagcgtggtcctcctgggcccacgctgttggtacagccgcctgatttgagtatctat
aaaggagaaaagggtgtaaaaggaatgcctggaatgattggtcccccaggacctccaggt
cgcaagggagcgcctggtgtcgggatcaaaggagagaaaggtatccctgggttcccagga
ccccggggtgagcctggttcgcatgggcctccaggttttccaggattcaaggggatacaa
ggagcagctggagagcctgggctgttcggatttcttggtccaaagggggatctgggagat
cgcgggtacccgggaccaccaggcattttgttgactccagctccaccactcaaaggtgtt
ccaggggacccaggaccccctggctactatggggagattggagatgttggattaccaggt
cctccagggcccccaggcagaccaggggaaacttgtccaggcatgatgggacctcctgga
ccaccaggggttcctggtcctccaggatttccaggagaagctggggttcctggaagactc
gactgtgccccaggaaaacctgggaagccagggctacccggcctgcctggcgcaccagga
ccagaggggccccctggatcagatgtgatatactgtcgccctgggtgccctgggccaatg
ggtgaaaagggaaaagtgggtcctccaggaagaagaggagccaaaggagcaaaaggaaac
aagggactctgtacctgtccacctggtcccatgggaccccctggtcccccaggacctcct
ggaagacaaggaagtaaaggagacttagggctccctggttggcatggagaaaagggcgat
ccaggtcagcctggtgctgaaggaccgccagggccaccaggaagacctggtgccatgggg
ccccctggtcacaaaggggaaaagggtgacatggtcatatcaagagtgaaagggcaaaaa
ggagaaagaggactggatgggccaccaggatttccggggccacatggacaagatggtggg
gatggacgtcctggagaaagaggggatcctgggccacgaggggaccataaggatgcagcc
ccgggtgagagagggcttcccggactgccaggccctccaggaagaacaggacctgagggg
cctccaggcctgggatttccaggcccaccaggacagcgaggcctacccggagagcccggg
cgtccgggcaccagaggctttgatggcacgaaggggcagaaaggtgactccattctgtgt
aatgtaagctacccgggaaaaccaggtctgccaggtcttgatggacctccaggtctgaag
gggtttccaggccccccaggggctcctggtatgagatgtccagatgggcagaaaggccag
cggggcaaaccgggaatgtctgggattccaggtccacctggttttcgtggtgacatggga
gatccaggcatcaaaggtgaaaaggggacctcccctattggacccccaggcccacctggg
tctcctggaaaggatggccagaaaggaatcccaggagaccctgcatttggtgacccagga
ccccctggagagaggggtcttccaggagcaccaggcatgaaaggacagaaaggtcaccca
gggtgcccaggggctggaggccctcctggcatccctggatctccaggtctcaaaggtccc
aaaggtagagagggaagtcgtggctttccagggatcccaggatcccctggccattcctgt
gaaagaggtgctcccgggatcccaggacaaccgggacttcctggaactccaggtgatcca
ggggccccaggttggaaaggccagccaggagacatgggtccctctggaccagctgggatg
aagggcctccctggtctcccaggcctaccaggagctgatggtctccgagggcctcctggg
attccaggccccaatggggaagatggtctacctggtcttccaggcctgaagggactccca
gggctgcctggcttccctggttttccaggagagagaggaaagccagggccagatggagaa
cctggaagaaagggagaggttggagagaaaggctggcctggcttgaagggagacctggga
gagcgaggcgccaaaggagacagaggacttcctggtgatgcaggagaagcagtcacttcc
agaaagggtgaacctggggatgctgggcctccaggagatggtgggttctcaggagagaga
ggtgataaaggaagctcagggatgcgagggggaagaggggatcccggaagagatggacta
ccgggactccacagagggcagcctgggatagatgggcctcctgggcctcctgggccccct
ggccctccaggatcacctggactgagaggagtcattggtttcccaggatttccaggtgac
cagggtgatccaggttctccaggaccccctggattcccaggagatgatggagcaagagga
cctaaaggatacaaaggtgaccctgccagtcagtgtggtccacctggtccaaagggtgag
ccaggtagtcctggataccaaggacgtactggagtccccggagagaaaggctttcctgga
gatgaaggaccccgaggaccaccaggcagacctggacagcctggatcttttggaccacca
gggtgtccaggtgacccagggatgcccggactcaagggacatccaggagaagtgggagac
cctgggccaagaggtgatgcgggagatttcgggaggccaggtcctgctggagtgaaaggg
cctctcgggtctcctggtctgaatggcttacatggtctgaagggtgagaaaggaacaaaa
ggcgcctcaggtttgctcgaaatgggtccacctgggccaatggggatgcctgggcaaaaa
ggagagaaaggagaccctgggagtccagggatttctcctccaggtcttcctggagaaaag
ggcttcccaggacccccagggagaccaggaccacctggtcccgcaggtgccccaggaaga
gctgctaaaggtgacattcctgatccaggtccacctggagaccggggacctcctggcccc
gatggcccaagaggagtacctgggcctccgggttcccctgggaatgttgaccttctgaaa
ggggacccaggtgactgtggtttgccaggaccacctggctcccgaggcccaccaggccct
ccagggtgtcagggtcccccaggatgtgatggcaaagatggccagaaaggaccaatggga
cttccagggttgccagggccacctggtcttcctggggcacctggcgagaagggattacct
ggccctccaggcagaaaagggccagtaggtcctccaggctgcagaggtgaacctgggcct
cctgcagacgtggattcctgtcctcgaatcccaggacttcctggagtgccaggtcccaga
ggaccagaaggagctatgggggaacctggacggagaggcctgcctggaccagggtgcaaa
ggagagcctggaccggatggcagacggggccaggatggcatccctggatctcccgggcct
cctggacgcaaaggtgacactggagaggcagggtgtcctggagcaccaggccctcctggt
ccaactggtgatcctgggcccaaggggtttggccctggatcgctcagtggcttcctcctg
gttctccacagtcagactgaccaggaaccggcctgccccgtgggcatgcctcggctttgg
acagggtacagcttgttgtacatggaaggacaggagaaggcgcacaatcaagatcttggt
ttggcaggctcttgtcttcctgtgtttagcactctgccctttgcctactgcaacatccac
caagtgtgccactacgcccagagaaatgacaggtcctactggctgtccagtgcggctcct
cttcctatgatgccactctcagaggaagagatccgctcttacatcagccgctgtgcggtg
tgtgaggccccagcacaggccgtggcggtccacagtcaggaccaatccatcccaccgtgt
ccacggacatggaggagtctctggattgggtactcatttctgatgcacactggagctggg
gaccaaggaggtgggcaggcactgatgtcacctggcagctgcctagaggacttcagagct
gcaccgttcgttgaatgccagggccgacagggaacctgtcacttttttgcaaacgagtac
agcttctggctgaccaccgtgaacccagacctgcagtttgcctctggcccgtcaccagac
accttgaaagaagttcaggcccagcgccggaaaatcagcaggtgccaggtttgcatgaag
cacagctaa

KEGG   Mus musculus (house mouse): 12830
Entry
12830             CDS       T01002                                 
Symbol
Col4a5
Name
(RefSeq) collagen, type IV, alpha 5
  KO
K06237  collagen type IV alpha
Organism
mmu  Mus musculus (house mouse)
Pathway
mmu04151  PI3K-Akt signaling pathway
mmu04382  Cornified envelope formation
mmu04510  Focal adhesion
mmu04512  ECM-receptor interaction
mmu04820  Cytoskeleton in muscle cells
mmu04926  Relaxin signaling pathway
mmu04933  AGE-RAGE signaling pathway in diabetic complications
mmu04974  Protein digestion and absorption
mmu05146  Amoebiasis
mmu05165  Human papillomavirus infection
mmu05200  Pathways in cancer
mmu05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:mmu00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    12830 (Col4a5)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    12830 (Col4a5)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    12830 (Col4a5)
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    12830 (Col4a5)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    12830 (Col4a5)
  09154 Digestive system
   04974 Protein digestion and absorption
    12830 (Col4a5)
  09158 Development and regeneration
   04382 Cornified envelope formation
    12830 (Col4a5)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    12830 (Col4a5)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    12830 (Col4a5)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    12830 (Col4a5)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    12830 (Col4a5)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    12830 (Col4a5)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:mmu04147]
    12830 (Col4a5)
   00536 Glycosaminoglycan binding proteins [BR:mmu00536]
    12830 (Col4a5)
Exosome [BR:mmu04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   12830 (Col4a5)
Glycosaminoglycan binding proteins [BR:mmu00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   12830 (Col4a5)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 12830
NCBI-ProteinID: NP_001156627
MGI: 88456
Ensembl: ENSMUSG00000031274
UniProt: Q63ZW6
LinkDB
Position
X:140258367..140472232
AA seq 1691 aa
MQVRGVCLAAGWFLLALCLWGQPAEAAACHGCSSGSKCDCSGIKGEKGERGFPGLEGHPG
LPGFPGPEGPPGPRGQKGDDGIPGPPGPKGIRGPPGLPGFPGTPGLPGMPGHDGAPGPQG
IPGCNGTKGERGFPGSPGFPGLQGPPGPPGIPGMKGEPGSIIMSPLPGPKGNQGYPGPPG
IQGPPGPKGISGPSGPPGPPGLMGPPGPPGPPGPKGNMGLNFQGPKGEKGEQGLQGPPGP
PGQISEQKRPIDIEFQKGDQGVPGDRGPPGPPGIPGPPGPPGGVKGEKGEPGEAGKRGKP
GKDGETGQPGIPGLPGDPGYPGEPGRDGEKGQKGDIGPPGPPGLVIPAPGTGVTMGEKGN
IGLPGLPGEKGERGFPGIQGPPGFPGPPGTAVVGPPGPPGYPGERGQKGDEGPPGICIPG
SPGLDGQPGAPGLPGPPGPPGPQLPSRDEICKAGPPGPPGPPGDKGLQGERGVKGDKGDT
CFNCIGTGISGPPGQPGLPGLPGPPGSLGIPGEKGDKGQAGITGPKGLPGIPGPPGAPGF
PGSKGDPGDVLTLPGMKGDKGELGFPGAPGLPGLPGSPGKDGLPGLPGPKGEPGGITFKG
ERGPPGSPGLPGLPGNMGPTGPVGFGPPGPIGEKGIQGVAGNPGQPGLPGPKGDPGQTIT
QPGKPGLPGSPGRDGEVGLPGDPGLPGQPGLPGIPGSKGEPGIPGIGPPGPPGPKGFPGI
PGPPGAPGAPGRMGPDGPPGPPGFPGPKGEPGFGLPGPPGPPGFPGFKGTPGPKGDRGFP
GPSGPPGRTGLDGLPGPKGDVGPNGQPGPVGPPGLPGIGLQGPPGPPGTPGPIGQPGLHG
IPGEKGDPGPPGFDVPGLPGERGSPGLPGAPGLIGPPGSPGVPGKAGVPGFPGVKGEMGM
MGPPGPPGPLGIPGRSGAPGLKGDDGMQGQPGLPGPAGEKGGKGEPGLPGPPGPMHPDFL
GSKGEKGEPGLPGIPGLSGPKGYQGLPGDPGQPGLSGQPGSPGLPGPKGNPGLPGPPGLT
GPPGPKGNIGDMGFPGPQGVDGPPGPPGFPGQPGSPGLPGQKGSKGEPGVSGIGLPGLPG
PKGEPGLPGYPGNPGIKGSVGETGLPGLPGNPGAKGQPGLPGFPGTPGLPGPKGMNGPPG
NPGFPGEPGPVGAGGRPGPPGPPGEKGKPGQDGIPGPAGQKGEPGQPGFGIPGPPGLPGL
SGQKGDGGSPGLPGNPGLPGSKGEPGFQGFPGMPGPPGPPGSPGQALEGPKGNPGPQGPP
GRPGPPGFQGPPGTEGPRGPPGNGGIKGERGNPGPPGQPGLPGLKGDQGPPGLPGNPGRP
GLNGMKGDPGLPGVPGFPGMKGPIGVPGSTGPDGEPGLTGPPGPPGLPGPSGQSIVIKGD
PGPPGIPGQPGLKGLPGLPGPQGLPGPIGPPGDPGRNGLPGFDGAGGRKGDPGLPGQPGA
RGLDGPPGPDGLQGPPGPPGTTSVAHGFLITRHSQTTEAPQCPRGTVHIYEGFSLLYVQG
NKRAHGQDLGTAGSCLRRFSTMPFMFCNINNVCNFASRNDYSYWLSTPEPMPMNMEPLKG
QSIQPFISRCAVCEAPAVVIAVHSQTIQIPHCPQGWDSLWIGYSFMMHTSAGAEGSGQAL
ASPGSCLEEFRSAPFIECHGRGTCNYYANSYSFWLATVDMSDMFNKPQSETLKAGDLRTR
ISRCQVCMKRT
NT seq 5076 nt   +upstreamnt  +downstreamnt
atgcaagtgcgtggagtgtgcctggctgccggctggttcttactggccctgtgtctgtgg
gggcagcccgcagaggctgcggcctgccatgggtgttcttctggatcaaaatgtgactgc
agtggtataaaaggagaaaagggagaacgggggtttccaggtttagaaggtcatccaggt
ttaccaggatttccaggtccagaaggccctccaggacctcggggacaaaagggtgatgat
ggaattccagggccaccaggaccaaaaggaatcagaggtcctcctggacttcctggattt
ccagggacaccaggtcttcctgggatgccaggccatgatggggctccaggacctcaaggt
atccctggatgcaatggaaccaagggagaacgtggatttccaggcagtccgggctttcct
ggtttacaaggtcctccaggacctcctgggatcccaggtatgaagggggaaccaggcagt
ataatcatgtcaccactgccaggaccaaagggtaatcaaggatacccaggtcctcctgga
atacaaggcccacctggtcccaagggtatatcagggccaagtggtcctccaggaccacca
ggtttaatgggacctcctggcccaccaggcccaccgggaccaaaggggaatatgggctta
aatttccagggacccaaaggtgaaaaaggcgaacaaggtcttcagggcccacctggtcca
cctgggcagatcagcgaacagaaaagaccaattgacatagaatttcagaaaggagatcag
ggagttcctggtgatagagggcctcctggacctccaggaatacctggtcctcccggtcct
cctggtggtgtgaaaggtgaaaagggtgagccaggagaggcaggcaaaagaggtaaacca
ggcaaagatggagagactggccaaccaggaatcccaggtttgcctggtgatcctggttac
cctggagaaccaggaagagatggtgaaaagggtcaaaagggtgatattggcccacctgga
cctcccggacttgtaattcctgcacctggaactggtgtgactatgggagaaaaaggaaat
atcgggttacctggtttgcctggagaaaagggagagcgaggatttcctggaatacaaggt
ccacctggctttcctggacctccaggaacagcagttgtgggtccccctggtcctcctgga
tatcctggtgaaaggggccagaaaggtgatgaaggtccccctggaatttgtattcctgga
tctcctggacttgatggacagcctggggctcctggccttccaggacctcctggcccccct
ggcccccagttgccatccagagatgaaatctgtaaagcaggccctcctgggcctccagga
cctccaggtgataaaggactccaaggagagcgaggagtaaaaggtgacaaaggtgatact
tgcttcaactgtattggaactggcatttcagggcctccaggccaacctggtttaccaggt
ctcccaggtcctccaggatctcttggaatccctggagagaagggggacaaaggacaagct
gggataactggtccgaaaggattgccaggcatacctggacctccaggtgctccaggcttt
ccagggtctaagggtgaccctggggatgtcctcactcttccaggaatgaaaggtgacaaa
ggagagttgggtttccctggagctccagggcttcctggtctacctggtagtcctggaaag
gatggtttgccaggactccctggccccaaaggagagcctggtggaattacttttaagggt
gaaagaggtcctcctgggagcccaggtttaccaggtctcccaggaaatatgggacctaca
ggcccagttggttttgggcctccaggccccataggcgaaaaaggcatacaaggtgtggca
ggaaatccaggccagccgggattaccaggtcctaaaggagatccaggccagaccataacc
cagccaggaaagcctggtttgccaggtagtccaggcagagatggtgaagtgggtcttcca
ggtgatcctggacttccaggccaaccaggcctgccaggaatacctggtagcaaaggagaa
ccaggtatccctgggattgggcctcctggaccacctggtcctaaaggttttccaggaatt
ccgggacctccaggagctcctggggcccctggaagaatgggtccagatgggcctcctggg
ccaccaggcttcccaggaccaaagggagagccaggatttgggttacctgggccacctggg
ccaccaggattcccaggtttcaaaggaacacctggtccaaaaggtgatcgtggtttccca
ggaccttcaggtcctccaggacgcactggcttggatgggctacctggaccaaaaggtgat
gttggaccaaatggacaacctggcccagtagggcctccagggctgccaggaattggtctt
cagggaccaccaggaccaccagggactccagggccaataggccaacctggcttacatggg
attccaggagagaagggggacccaggacctcctgggtttgatgttccaggactccctgga
gagagaggtagtccagggctccctggagcccctggtcttataggacctccaggatcacct
ggtgtccctggaaaagcaggtgtccctggatttccaggtgtcaaaggcgaaatgggtatg
atgggacctccaggcccacctggacctttgggaattcctggcaggagtggtgctcctggt
cttaaaggtgatgatggcatgcaaggtcagccaggtctccctggccctgcaggagaaaag
ggcggtaagggagagcctggccttccaggtcctcctggaccaatgcatccagatttcctg
ggatcgaaaggagaaaagggtgagcccggcttaccaggtattcctggactttcaggacca
aaaggctaccagggattacctggagacccagggcaacctggactgagtggacaaccagga
tcaccaggactcccaggtcccaagggtaatcctggtctccctggtccaccaggacttaca
ggacctcctggaccaaaaggaaacattggtgacatgggttttcctggccctcaaggtgta
gatgggccccctggacctcctggatttcctggacaacctggctctcctgggttacctggc
cagaaaggaagtaaaggagagcctggggtttcaggcattggacttccaggtcttcctggc
ccaaagggtgaacctggtctgcctggatatccaggaaacccaggtatcaaagggtcggtg
ggagagactggtcttcctggattaccaggaaaccctggagcaaaaggtcaaccgggcctg
cctggattcccaggaacaccaggacttcctggaccaaaaggtatgaatggtcctcctggt
aaccctggctttcccggagaacctggtcctgtaggtgccggagggcgtcctggaccacca
ggacctcctggtgaaaaaggcaagccaggtcaagatggcattcctgggccagcaggacaa
aagggagaaccaggtcaaccaggctttggaatcccaggacctcctggactcccaggactt
tctggacaaaagggtgatggaggatcacctggacttccaggaaatcctggccttccaggt
tcaaagggtgagccaggctttcagggattccctggtatgccaggtcccccaggtcctcct
ggttctcctggtcaagctttggaaggtcccaaaggaaaccctgggcctcaaggccctcct
gggagaccaggtcctccaggttttcaaggtccaccaggtacagaaggtcctcgtggaccc
cctggaaatggaggtatcaaaggagagaggggaaatccaggcccacctgggcagccaggc
ttacctggtttgaaaggagatcaaggaccaccaggactcccgggtaaccctggccggcca
ggtcttaatggaatgaaaggagatcctggtctccctggtgttccaggattcccaggaatg
aaaggacccattggagtaccaggctccactggccctgatggggaaccaggtcttactggg
cccccaggtcctcctgggttgcctggtccttcaggacagagtattgtaatcaaaggagat
cctggacctccagggattcctggacaacctggattaaaaggtctaccaggacttccaggc
cctcaaggcctaccaggtccaattggccctccaggagatccaggacgcaatggactccca
ggctttgatggtgcaggagggcgcaaaggagacccagggctgccaggacagccaggtgcc
cgtggattggatggtcccccaggaccagatggattgcaaggacctccaggaccccctgga
accacctctgttgcccatggattcctcatcacacgtcacagtcaaacaacagaagcacca
caatgcccacggggaacagttcatatctatgaaggcttttctctcctgtatgtacaagga
aataaaagagcccatggtcaagacttgggtacggctggcagctgccttcgtcgcttcagt
accatgcctttcatgttctgcaacattaacaacgtttgtaactttgcttcaagaaatgac
tattcttactggctttccaccccagagcccatgccaatgaacatggaacccctgaaggga
cagagcatccagccattcattagtcgatgtgcagtgtgtgaggcgccagctgtggtgatc
gctgttcacagccagactattcagattccacactgtcctcaggggtgggactctctgtgg
attggctattccttcatgatgcatacaagtgcaggagcagaaggctctggccaagcccta
gcctctcctggctcctgtttggaagaatttcgttccgctcccttcatcgaatgtcatggg
cgagggacatgcaattactatgccaattcgtacagcttttggctggccactgttgatatg
tcagacatgttcaacaaacctcagtcagaaaccctgaaagcaggagacttgaggacgcgt
attagccgatgtcaagtatgcatgaagaggacataa

KEGG   Mus musculus (house mouse): 12833
Entry
12833             CDS       T01002                                 
Symbol
Col6a1, Col6a-1
Name
(RefSeq) collagen, type VI, alpha 1
  KO
K06238  collagen type VI alpha
Organism
mmu  Mus musculus (house mouse)
Pathway
mmu04151  PI3K-Akt signaling pathway
mmu04510  Focal adhesion
mmu04512  ECM-receptor interaction
mmu04820  Cytoskeleton in muscle cells
mmu04974  Protein digestion and absorption
mmu05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:mmu00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    12833 (Col6a1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    12833 (Col6a1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    12833 (Col6a1)
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    12833 (Col6a1)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    12833 (Col6a1)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    12833 (Col6a1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:mmu04147]
    12833 (Col6a1)
   00536 Glycosaminoglycan binding proteins [BR:mmu00536]
    12833 (Col6a1)
Exosome [BR:mmu04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   12833 (Col6a1)
Glycosaminoglycan binding proteins [BR:mmu00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   12833 (Col6a1)
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   12833 (Col6a1)
SSDB
Motif
Pfam: VWA Collagen VWA_2 VWA_3
Other DBs
NCBI-GeneID: 12833
NCBI-ProteinID: NP_034063
MGI: 88459
Ensembl: ENSMUSG00000001119
UniProt: Q04857 B0LAD9
LinkDB
Position
10:complement(76544626..76561869)
AA seq 1025 aa
MRLAHALLPLLLQACWVATQDIQGSKAIAFQDCPVDLFFVLDTSESVALRLKPYGALVDK
VKSFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDEVEIIRGLTRMPSGRDELKASVDAVK
YFGKGTYTDCAIKKGLEELLIGGSHLKENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAKH
LGIKVFSVAITPDHLEPRLSIIATDHTYRRNFTAADWGHSRDAEEVISQTIDTIVDMIKN
NVEQVCCSFECQAARGPPGPRGDPGYEGERGKPGLPGEKGEAGDPGRPGDLGPVGYQGMK
GEKGSRGEKGSRGPKGYKGEKGKRGIDGVDGMKGETGYPGLPGCKGSPGFDGIQGPPGPK
GDAGAFGMKGEKGEAGADGEAGRPGNSGSPGDEGDPGEPGPPGEKGEAGDEGNAGPDGAP
GERGGPGERGPRGTPGVRGPRGDPGEAGPQGDQGREGPVGIPGDSGEAGPIGPKGYRGDE
GPPGPEGLRGAPGPVGPPGDPGLMGERGEDGPPGNGTEGFPGFPGYPGNRGPPGLNGTKG
YPGLKGDEGEVGDPGEDNNDISPRGVKGAKGYRGPEGPQGPPGHVGPPGPDECEILDIIM
KMCSCCECTCGPIDILFVLDSSESIGLQNFEIAKDFIIKVIDRLSKDELVKFEPGQSHAG
VVQYSHNQMQEHVDMRSPNVRNAQDFKEAVKKLQWMAGGTFTGEALQYTRDRLLPPTQNN
RIALVITDGRSDTQRDTTPLSVLCGADIQVVSVGIKDVFGFVAGSDQLNVISCQGLSQGR
PGISLVKENYAELLDDGFLKNITAQICIDKKCPDYTCPITFSSPADITILLDSSASVGSH
NFETTKVFAKRLAERFLSAGRADPSQDVRVAVVQYSGQGQQQPGRAALQFLQNYTVLASS
VDSMDFINDATDVNDALSYVTRFYREASSGATKKRVLLFSDGNSQGATAEAIEKAVQEAQ
RAGIEIFVVVVGPQVNEPHIRVLVTGKTAEYDVAFGERHLFRVPNYQALLRGVLYQTVSR
KVALG
NT seq 3078 nt   +upstreamnt  +downstreamnt
atgaggctggcccacgctctgctgcccctgctgctacaagcctgctgggtggccacacag
gacatccagggctccaaagcgattgccttccaagactgccctgtggatctattcttcgtg
ctcgacacctcggagagtgtggccttgaggctgaaaccttatggggccttggtggacaag
gtgaagtccttcactaagcgcttcattgacaacctgagagacaggtactaccggtgtgac
cgcaacctggtttggaatgcgggtgcgctgcactacagtgacgaggtggagatcatccga
gggctcacgcgcatgcccagtggccgcgatgagctcaaggccagcgtggatgcggtcaag
tacttcgggaaaggcacctacaccgactgcgccattaagaaggggctggaggagctgctc
atagggggctcccacctgaaggagaacaagtacttgatcgtggtgaccgacgggcatcct
ctagagggctacaaggaaccatgcgggggtctggaagatgcagtaaatgaggccaaacac
ctgggcatcaaggtcttttctgtggccatcacacctgaccacctggagccacgtctaagt
atcattgccacagaccacacataccggcgcaatttcacggcagctgactgggggcatagc
cgcgatgcagaagaggtcatcagccagaccattgacaccattgtggacatgattaaaaat
aacgtggaacaagtgtgttgttcttttgagtgccaggctgccagaggacctccagggccc
cgaggcgaccctgggtatgagggggagcgaggaaagccaggtcttccgggagagaaggga
gaagctggagaccctggacgacctggggatcttggaccagtcgggtaccagggtatgaag
ggagaaaaggggagccgtggagagaagggttccagaggaccgaaaggttacaagggcgag
aaaggcaagcgcggaatcgacggggtcgacggcatgaagggagagacggggtacccagga
ctaccgggctgcaagggctccccaggatttgatggcattcaaggacccccgggtcccaag
ggtgatgctggtgcctttgggatgaagggagaaaagggtgaagctggagcagacggtgag
gctgggagaccagggaactcagggtcacctggagatgagggtgatcctggagagcctggt
ccccccggagaaaaaggagaggccggtgatgaaggaaatgctggcccagacggtgcccct
ggagagaggggtggccctggtgaaagaggacctcgggggacccctggtgtgagaggacca
aggggagacccgggtgaagctggaccacagggtgaccaaggaagagaggggcccgtcggc
atccctggagactcgggtgaggctggccccattggacctaaaggataccgaggtgatgag
ggtcctccaggtcctgagggcctcagaggagccccaggacctgttggtcctcctggagac
cccggactgatgggtgagagaggtgaggatggaccaccaggaaacggcacggaaggtttc
cccggcttccctgggtatccaggcaacagaggccctcctgggctaaatggcacaaaaggc
taccctggcctcaagggggatgagggtgaagtgggagacccaggagaggataacaacgac
atttcaccccgtggggtcaaaggggcaaagggataccgaggcccagaaggaccccaggga
cctccaggacatgtgggaccacctgggccagatgagtgtgagatcctggatatcatcatg
aaaatgtgctcctgctgtgagtgcacatgtggacccattgacatcctcttcgtgctggac
agctcggagagcattggcctacagaactttgagattgccaaggacttcatcatcaaggtc
attgaccggttgagcaaggatgagctggtcaaatttgagccagggcagtctcacgcgggc
gtggtacagtacagccacaaccagatgcaagagcacgtggacatgcggagccccaacgtc
cgcaacgcccaggacttcaaagaagctgtcaagaagctacaatggatggctggtggcaca
ttcaccggagaagcgctgcagtacacccgggaccggctactcccacccacacagaacaac
cgaattgccctggtcattacggatggacgttctgacactcaacgggacacgacacctctc
agtgtgctctgtggtgcagacattcaggtagtttctgtgggaatcaaggatgtgtttggc
tttgtggcgggctccgaccagctcaatgtcatttcctgccaaggcttatcgcaaggtcgg
ccaggtatctccctggtgaaggagaactatgcagagcttctcgatgacggctttctgaag
aacataacagcccagatctgtatagataagaagtgtccggattatacctgtccaatcaca
ttctcctccccggctgacatcaccatcctgctagacagctcagccagtgtcggcagccac
aacttcgaaaccaccaaggtcttcgccaagcgcctagctgagcgattcctgtcagcaggc
agggcggatccttcccaggatgtgcgggtggccgtggtacagtatagtggccaggggcag
caacagccaggtcgggcggctcttcagttcttacagaattacacagtgctggccagctct
gtggacagcatggatttcatcaacgacgccacagacgtcaacgatgctctgagctacgtg
actcgtttctaccgggaagcctcgtcaggtgccaccaagaagagagtgctgttgttttca
gacggcaactctcagggggccacagcagaggccattgagaaggctgtgcaggaggcccag
cgtgcaggcattgagatctttgtggtggtggtgggaccccaggtgaacgagccccacatc
cgtgtgcttgtcactggcaagactgcagagtacgacgtggcctttggcgagcgccaccta
ttccgtgtaccaaactaccaggccctgctacgtggcgtactctaccagacagtctccagg
aaggtggcactgggctag

KEGG   Mus musculus (house mouse): 12834
Entry
12834             CDS       T01002                                 
Symbol
Col6a2, Col6a-2
Name
(RefSeq) collagen, type VI, alpha 2
  KO
K06238  collagen type VI alpha
Organism
mmu  Mus musculus (house mouse)
Pathway
mmu04151  PI3K-Akt signaling pathway
mmu04510  Focal adhesion
mmu04512  ECM-receptor interaction
mmu04820  Cytoskeleton in muscle cells
mmu04974  Protein digestion and absorption
mmu05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:mmu00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    12834 (Col6a2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    12834 (Col6a2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    12834 (Col6a2)
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    12834 (Col6a2)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    12834 (Col6a2)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    12834 (Col6a2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:mmu04147]
    12834 (Col6a2)
   00536 Glycosaminoglycan binding proteins [BR:mmu00536]
    12834 (Col6a2)
Exosome [BR:mmu04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   12834 (Col6a2)
Glycosaminoglycan binding proteins [BR:mmu00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   12834 (Col6a2)
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   12834 (Col6a2)
SSDB
Motif
Pfam: VWA Collagen VWA_2 Ins134_P3_kin_N
Other DBs
NCBI-GeneID: 12834
NCBI-ProteinID: NP_666119
MGI: 88460
Ensembl: ENSMUSG00000020241
UniProt: Q02788
LinkDB
Position
10:complement(76431590..76459464)
AA seq 1034 aa
MTTIKMLQGPLSVLLIGGLLGVLHAQQQEAISPQEQEAVSPDISTTERNNNCPEKADCPV
NVYFVLDTSESVAMQSPTDSLLYHMQQFVPQFISQLQNEFYLDQVALSWRYGGLHFSDQV
EVFSPPGSDRASFTKSLQGIRSFRRGTFTDCALANMTQQIRQHVGKGVVNFAVVITDGHV
TGSPCGGIKMQAERAREEGIRLFAVAPNRNLNEQGLRDIANSPHELYRNNYATMRPDSTE
IDQDTINRIIKVMKHEAYGECYKVSCLEIPGPHGPKGYRGQKGAKGNMGEPGEPGQKGRQ
GDPGIEGPIGFPGPKGVPGFKGEKGEFGSDGRKGAPGLAGKNGTDGQKGKLGRIGPPGCK
GDPGSRGPDGYPGEAGSPGERGDQGAKGDSGRPGRRGPPGDPGDKGSKGYQGNNGAPGSP
GVKGGKGGPGPRGPKGEPGRRGDPGTKGGPGSDGPKGEKGDPGPEGPRGLAGEVGSKGAK
GDRGLPGPRGPQGALGEPGKQGSRGDPGDAGPRGDSGQPGPKGDPGRPGFSYPGPRGTPG
EKGEPGPPGPEGGRGDFGLKGTPGRKGDKGEPADPGPPGEPGPRGPRGIPGPEGEPGPPG
DPGLTECDVMTYVRETCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRL
GAIAKDPKSETGTRVGVVQYSHEGTFEAIRLDDERVNSLSSFKEAVKNLEWIAGGTWTPS
ALKFAYNQLIKESRRQKTRVFAVVITDGRHDPRDDDLNLRALCDRDVTVTAIGIGDMFHE
THESENLYSIACDKPQQVRNMTLFSDLVAEKFIDDMEDVLCPDPQIVCPELPCQTELYVA
QCTQRPVDIVFLLDGSERLGEQNFHKVRRFVEDVSRRLTLARRDDDPLNARMALLQYGSQ
NQQQVAFPLTYNVTTIHEALERATYLNSFSHVGTGIVHAINNVVRGARGGARRHAELSFV
FLTDGVTGNDSLEESVHSMRKQNVVPTVVAVGGDVDMDVLTKISLGDRAAIFREKDFDSL
AQPSFFDRFIRWIC
NT seq 3105 nt   +upstreamnt  +downstreamnt
atgactactatcaagatgctccagggtcctctttctgtgctcctgattgggggactcttg
ggggtcctccatgcccagcagcaggaagccatctcaccccaggagcaggaagctgtctca
ccagacatctccaccactgaaaggaacaacaattgtccagagaaggccgactgcccagtc
aacgtgtatttcgtgttggacacctcagagagcgtggccatgcagtccccgacagacagc
ctgctctatcatatgcagcagttcgtaccgcagtttatcagccagctgcagaacgagttc
tacctggaccaggtggccctgagctggcgctacggtggtctacacttctcggaccaagtg
gaggtgttcagcccaccgggcagtgaccgggcctccttcactaagagcctacaaggcatc
cgctccttccgcaggggcaccttcactgactgtgcattggctaacatgacgcagcagatc
cggcagcacgtaggcaagggggtggtcaacttcgccgtggtcatcactgacggccacgtc
acgggcagtccgtgtgggggcatcaagatgcaggctgagcgtgcccgtgaagagggcatc
cggctcttcgctgtggcccctaacaggaacctaaacgaacaaggcctgagggacatcgct
aactctccacatgagctctaccgtaacaactacgccaccatgcgacccgactctaccgag
attgaccaggacaccatcaaccgcatcatcaaggtcatgaaacatgaagcctatggagag
tgctacaaggtgagctgcctggagattcctggaccccacggacccaagggttaccgagga
cagaagggtgccaagggcaacatgggtgaaccaggagagcctggacagaaaggacgacag
ggagaccccggcatcgaaggccccattggattcccgggaccgaagggtgtgcctggcttc
aagggagagaagggtgaatttggatcggatggtcggaagggagcgcctggcctagctggc
aagaatggaacagatggacagaagggcaaactgggccgcattgggcctcctggttgcaag
ggagaccccggaagtcggggccccgatggataccctggagaagctggaagcccaggcgag
cgaggagaccagggtgccaagggggactctggccgcccaggacgcaggggaccaccagga
gatcctggagacaaaggaagcaagggatatcaaggcaacaacggagcccctggaagcccg
ggagtgaaaggaggcaagggagggcctggcccccgtggaccaaaaggagagcctggacgc
agaggagaccccgggaccaagggcggccccggcagcgatggtccaaagggagagaaggga
gaccctggtcctgaggggcctcgaggcctggctggagaagttggcagtaaaggagccaag
ggagacagaggtttgcctggacccagaggcccccagggggctcttggagagccaggaaag
cagggatctcgaggagaccctggtgacgccggacctcgaggggattcaggacagccgggc
cccaagggcgatcctggaaggcctggattcagctacccgggacctcgagggacacccggt
gaaaaaggcgagcccggtccaccaggccctgagggaggccgaggagactttggtctgaaa
ggaacacccggacggaagggagataaaggggagccagctgatcctggtccccctggtgaa
cctggccctcgggggccaagaggaatcccaggacctgagggagaacccggccctccagga
gaccctggtctcacggaatgtgatgtcatgacctatgtgagggagacctgtggatgctgc
gactgtgagaagcgctgtggtgccctggatgtggtcttcgtcatcgacagttctgagagt
attggctacaccaacttcaccttggagaagaactttgtcatcaatgtggtcaacaggcta
ggtgccattgccaaggaccccaagtcagaaacaggcacacgtgtgggtgtggtgcagtac
agccacgagggcacctttgaggccatccggctggacgacgagcgagtcaactccctgtct
agtttcaaggaggctgtcaaaaaccttgaatggatcgccggtggcacttggacgccctct
gccctcaagtttgcctataatcagctcatcaaagaaagccggcgccagaagacccgggtg
ttcgcagtggtcatcacggatgggcgccatgacccccgagatgatgacctcaatcttcgg
gcactgtgtgaccgagatgtcactgtgacagccattggcatcggtgacatgttccacgag
actcatgagagtgagaacctctactccattgcctgtgacaagccacagcaagtgcgcaac
atgacgctgttctctgacctggtggccgagaagttcatcgatgacatggaagacgtcctt
tgtccagacccccagatcgtgtgtccagaacttccctgccaaacagagctctatgtggcc
cagtgcacacaacggcccgtggacattgtcttcctgctggatggctcggagcggctgggc
gagcagaacttccacaaggtgcggcgcttcgtggaggacgtgtcccggcgcctgactctg
gcccggagggatgatgacccactcaacgcccgcatggctctgttgcaatatggcagccag
aatcagcaacaggtggccttcccactgacctacaacgtgaccaccatccacgaggccctg
gagagggccacctacctcaattccttttctcacgtgggcacgggcatcgtacacgccatc
aacaacgtggtgcggggggcacggggtggggcgcggcgccacgcagagctctccttcgtc
ttcctcacggacggtgtcaccggcaatgacagcctggaggagtcagtgcactctatgcgt
aagcagaacgtggtgcccactgtggtcgctgtgggcggcgacgtggacatggatgtgctt
actaagatcagcctgggtgacagggcggccatcttccgggagaaagactttgacagtctg
gcccagcccagcttctttgacaggttcatccgctggatctgttag

KEGG   Mus musculus (house mouse): 12835
Entry
12835             CDS       T01002                                 
Symbol
Col6a3, Col6a-3
Name
(RefSeq) collagen, type VI, alpha 3
  KO
K06238  collagen type VI alpha
Organism
mmu  Mus musculus (house mouse)
Pathway
mmu04151  PI3K-Akt signaling pathway
mmu04510  Focal adhesion
mmu04512  ECM-receptor interaction
mmu04820  Cytoskeleton in muscle cells
mmu04974  Protein digestion and absorption
mmu05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:mmu00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    12835 (Col6a3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    12835 (Col6a3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    12835 (Col6a3)
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    12835 (Col6a3)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    12835 (Col6a3)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    12835 (Col6a3)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:mmu04147]
    12835 (Col6a3)
   00536 Glycosaminoglycan binding proteins [BR:mmu00536]
    12835 (Col6a3)
Exosome [BR:mmu04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   12835 (Col6a3)
Glycosaminoglycan binding proteins [BR:mmu00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   12835 (Col6a3)
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   12835 (Col6a3)
SSDB
Motif
Pfam: VWA VWA_2 Collagen VWA_3 Kunitz_BPTI fn3
Other DBs
NCBI-GeneID: 12835
NCBI-ProteinID: NP_001229937
MGI: 88461
Ensembl: ENSMUSG00000048126
UniProt: Q9Z0I9
LinkDB
Position
1:complement(90694582..90771710)
AA seq 3284 aa
MRKHRHLPLVAVFSLLLSGIATTHAQQHGDVKNGAAADIVFLVDSSWSAGKDRFLLVQEF
LSDVVESLAVGDNDFHFALVRLNGNPHTEFLLNTYHSKQEVLSHIVNMSYIGGSNQTGKG
LEYVIHSHLTEASGSRAADGVPQVIIVLTDGQSEDGFALPSAELKSADVNVFAVGVEGAD
ERALGEVASEPLSMHVFNLENVTSLHGLVGNLVSCIHSSVNPERAGDKESLKDITAQDSA
DIIFLIDGSQNTGNANFDVIRDFLVNVLERLSVGNQQVQVGVVQYSEEPITMFSLNSYPS
KAAVLDAVKGLSLVGGESANIGQALDFVVENHFTRAGGSRVEEGVPQVLVLISAGPSSDE
IRDSVVALKQASVFSFGLGAQAASRAELQHIATDDSLVFTVPEFRSFGDLQEQILPYLVG
VAQRHIVLQPPAIVTQVMEVNKRDIVFLVDGSSSLGPSNFNAIRDFVTRVIQRLEIGQDL
VQVSVAQYADTVKPEFYLNSYTNKRDAITAVRKMRALNGSALYTGSSLDFVRNNLFTSSA
GHRAAEGVPKLLVLITGGKSLDEVSQPAQELKRGSIMALAVGSKAADEDELKEIAFDSSL
VFIPAEFRPAPLQNMLPSLMAPLRTLTGTTEVHVNKRDIIFLLDGSDNVGKNNFPYVRDF
VTNLVNSLDVGSDNIRVGLVQFSDTPVTEFSLDTYQTKSELLAHLRRLQLKGGSGLNAGS
ALSYIHANHFTEAGGSRTREHVPQLLLLLMAGPSEDAYLQAANALVRSGVLTFCVGTNRA
DKAELEHIAFNPSLVYLMDDFRSLPSLPQQLIQPLTTYVSGGVQEVPLSQPESKRDILFL
FDGSVNVLGQFPAVRDFLYRIIEELDVKPDGTRVAIAQFSDDVRLESRFSEHQTKAEILN
LVKKMKLKTGKALNLGYALDYALRNIFVRSAGSRIEDNVQQFLVLLVAGRSSDAVAGPAS
SLKQRGVVPFIFQAKNANPSELEQIVLSPAFILAAESLPKIGDLQSQIVSLLKAEQGSGP
VSGEKDVVFLIDGSEGVRSGFPLLKDFVQRVVESLDVGPDRVRVALVQYSDRTRPEFYLN
SHMDQQGVISAIRRLTLLGGPTPNTGAALEFVLRNILTSSTGSRIAEGVPQLLIVLTAEP
SGDDVRGPSVVLKQGGAVPIGIGIGNADISEMQTISFIPDFAVAIPTFRELGTIQQVISE
RVIQLNREELSSLKPILTPSTGAGVGSKKDVVFLIDGSRNAGPEFQYIRTLIERIVEYLD
IGFDTTRVAVIQFSEDSKMEFPLNAHFSKDEVQNAVRRLRPKGGSQVYIGNALEYVLKNI
FQRPLGSRIEEGVPQFLVLISSGKSDDEVDDSAVELKQFGVAPLTIARHTDQEELVKISL
SPEYVYSVSTFRELPRLEQKLLTPITTLTSQQIHQILASTRYPPSVVESDAADIVFLIDS
SDAVKPDGIAHIRDFVSRIVRRLNIGPSKVRIGVVQFSNDVFPEFYLKTHKSQSSVLEAI
RRLRFKGGSPLNTGRALEFVARNLFVKSAGSRIEDGVPQHLVLFLGGKSQDDVARHAQVI
SSSGIVSLGIGDRNIDRTDLQTITNDPRLVFTVREFRELPNIEERVMLSFGPSGATPQPP
GVDLPSPSRPEKKKADIVFLLDGSINFRRDSFQEVLRFASEIVDTVYEDGDSIRVGLVQY
NSDPTDEFFLRDFSTKRQIIDAINKVVYKGGRHANTRVGIEHLLRNHFVPEAGSRLDERV
PQIAFVITGGKSVEDAQDVSLALTQKGVKVFAVGVRNIDSEEVGKIASNSATAFRVGSVQ
ELSELSETVLETLHDAMHETLCPGVTDVSKACNLEVILGFDGSRDQNVFVSQKGLESKVD
IILNRISQIQRISCSGNQLPTVRVSVMANTPSGPVEAFDFAEYQPELFEKFRNMRSQRPY
VLTADTLKLYQNKFRQSSPDTVKVVIHFTDGADGDMADLYRASEELRQAGAQALILVGLE
RVANLERLMHLEFGRGFMYDRPLRLNLLDLDYELAEQLDNIAEKACCGVPCKCSGERGDR
GPIGSIGPKGISGEDGYRGYPGDEGGPGERGPPGVNGTQGFQGCPGQRGVKGSRGFPGEK
GELGEIGLDGLDGEEGDKGLPGSSGEKGSPGRRGDKGPKGDKGERGDVGIRGDPGDSGRD
SQQRGPKGETGDIGPMGLPGRDGIPGSPGDPGKDGGSGRRGPAGAKGNRGGPGQPGFEGE
QGTRGSQGPPGPIGPPGLIGEQGIPGPRGGGGTAGAPGERGRTGPLGRKGEPGEPGPKGS
IGNRGPRGETGDDGRDGVGSEGRRGKKGERGFPGYPGPKGTPGEPGADGPPGPKGIRGRR
GNSGPPGATGQKGDPGYPGPSGHKGNRGDSVDQCALIQSIRDKCPCCYGPLECPVFPTEL
AFALDTSEGVTQDTFSRMREVLLGIVGDLTIAESNCPRGARVAVVTYNNEVTTEIRFADS
KKKSALLDSIQNLQVALTSKQQSLETAMSFVARNTFKRVRSGFLMRKVAVFFSNKPTRAS
PQLREAVLKLSDAGITPLFLTSQEDRQLINALQINNTAVGHALVLPARRDLTDFLKNVLT
CHVCLDICNIDPSCGFGSWRPSFRDRRAAGSDVDIDLAFILDSSEATTLFQFNEMKKYIG
YVIRQLDLSPDPKASQHFARVAVVQQSTYESVDNASVPPVKVEFSLTDYGAKEKLLDFLS
RRMTQLQGTMGLGNAIEYTIENIFESAPNPRDLKIMVLMLTGDMQRQQLEEAQRAILQAK
CKGYFFVVLGIGRKVNIKEVYSFASEPNDVFFKFVDKSTELNEEPLMRFGRLLPSFVSSE
NAFYLPPDLRKQCDWFQGDQPVKNGVKFGHKQINPPHTANSSLTSKVVTTMKPVTTTKPT
AIVNLPPAKPAPARPAPAQPVLAKPDPAKPAQARPAPAKPASAKLVPPQPVHVQPAPAQT
ASVRPAPAKPAPPQPAAAKPVPAKPAVPAQPAPPQPAAAKPVPAKPAVPAQPAPPQPAAA
KPVPAKPAVPAQPAAAQPMPAQPVLTKSAAVKPASANKPVAAKPVATNTATATARPALAA
KPAAAKPAATRPLAAAIRPVATKPEAPRQQAKPAATKPATTKPLARVSREVQVSEVTENS
ARLHWERPEPSSSFFYDLTVTSAHDQSLVLRQNLTVTDRVIGGLLAGQLYHVVVVSYLQS
QVRAIYQGSFNTKKTQPPPLQAAHRASSSTINLMVNTEPLFLTKTDICKLSRDAGTCVDF
KLLWHYDLESKSCKRFWYGGCGGNENRFHSQEECEKMCSPELTV
NT seq 9855 nt   +upstreamnt  +downstreamnt
atgaggaaacaccggcatctgcccctagtggctgtctttagccttctcctctcaggcatt
gccacgactcatgcccaacagcatggagatgtcaaaaacggtgcggcggctgacatagtg
tttctagtggattcctcttggagtgctgggaaggaccgtttcctgcttgttcaagagttt
ctgtctgatgttgtagagtctttagctgtgggagacaatgattttcactttgctctggtc
cggctcaacggaaacccacataccgagttcctgttaaatacataccactctaaacaagaa
gtcctttctcatattgtgaacatgtcttacattgggggaagcaaccaaactggaaaagga
ttagaatacgtaattcacagccacctcactgaggcttccggaagccgggctgctgacgga
gtccctcaggttatcatagtattaactgatggacaatcggaggatggcttcgctctgccc
tcagcggaacttaagtctgcggacgttaatgtgtttgcagtcggagttgagggtgcagat
gaaagagccctaggagaagtagcaagcgaaccactcagtatgcatgttttcaacctagag
aacgttacctcactgcatggcctagtagggaacttagtgtcctgtatacactcctctgtg
aatccagaaagggctggggacaaggaaagccttaaagacatcacagcacaagactcagct
gacattattttccttattgatggatcacagaacaccggaaatgccaatttcgatgtcatt
cgagacttccttgtaaatgtccttgagagactctcagttggaaaccagcaggtccaagtg
ggggtggtacagtatagcgaggagcccataaccatgttctccttgaatagctacccctcc
aaggctgcagttctggatgccgtgaaagggctcagcttggttgggggggagtcggccaat
ataggccaggctcttgactttgtagtggagaatcacttcactcgggcaggtggcagccgc
gtggaggaaggggtcccccaggtgctggtcctcataagtgctgggccttctagtgatgag
attcgagattcggtggtagctctgaagcaggctagtgtgttctcgtttggcctcggagcc
caggcagcctccagggcagagcttcagcacatagctaccgatgacagcttggtgtttact
gtcccggaattccgtagctttggggatctccaggagcaaatactgccgtaccttgttggt
gtggcccaaaggcacattgtcttgcaaccaccagctattgtcacacaagtcatggaagtc
aacaagagggacatagtcttcctggtggatggttcctccagcctggggccatccaacttc
aatgccatccgagacttcgttaccagagtcatccagaggttggaaattggacaggacctg
gttcaggtatcagtggctcagtatgcagacactgtcaagcctgagttctatttaaattcc
tataccaacaaaagggatgccataactgctgtacggaagatgagggccctgaatggctca
gccctgtacacgggatcttctctggactttgttcgaaacaacctcttcactagctcagcc
ggccatagggccgctgagggggttcctaagctcctggtgctgatcacaggcggtaagtcc
ctcgatgaagtcagccagcctgcccaggagctgaagagaggcagcatcatggccttggcc
gttggcagcaaggccgccgatgaggatgagctgaaagagatcgctttcgactcctccctg
gtcttcattcccgctgagttccgccctgctcccttgcaaaacatgctgcccagcctgatg
gcacctctcaggactctcactggaaccacggaagttcacgtaaacaaacgggatatcatc
ttccttttggatggctccgacaacgttgggaaaaacaatttcccttatgtgcgtgacttt
gtaaccaacctagttaacagcctcgatgttggaagtgacaatatccgtgttggcttggtg
cagtttagtgacactccggtcacggagttctctctagacacttaccagaccaagtccgag
ctgctcgctcacctgaggcgcttgcagctcaaggggggctccggcttgaacgcaggctct
gccttgagctatatccatgccaatcacttcacggaggcaggaggcagcaggactcgtgaa
cacgtgcctcagctgctgctcctgctcatggccgggccctctgaggatgcctatttgcaa
gctgccaatgccctggtgcgctcaggcgtgctgaccttctgtgtggggaccaatcgggcg
gataaggctgagttagagcatattgcttttaatcccagcctggtatacctcatggacgat
ttccgctccctgccatctttgcctcaacagctgattcagcccctaaccacatatgttagt
ggaggtgtacaggaagttccactatcccagccagagagcaagcgggacattcttttcctc
tttgacggctcagtcaatgtcttgggtcagttccctgctgtcagagactttctctacagg
atcattgaggaacttgatgtgaagccggatgggacccgggtagcaatagctcagttcagt
gatgacgtcaggctggagtcccgcttcagtgagcaccagacgaaagctgaaatccttaat
ctcgtgaagaagatgaagctaaaaacaggcaaagccctcaacctgggctacgccctggac
tacgcgcttaggaacatctttgtgaggtcggctggcagccgcatcgaggataacgtgcag
cagttcctggtgctgctggttgctggaaggtcgtcagatgctgtggccgggcctgcaagc
tccctgaagcagagaggagtggtacctttcatctttcaagccaagaacgccaaccccagt
gagctggagcagattgtgctgtcgcctgcattcatcctggctgcagagtcactccctaag
atcggagacctccaatcacagattgtgagccttctgaaagcagaacagggttcaggccct
gtatcaggtgaaaaggatgtggtgtttctcattgatggctctgagggtgtccgcagtggc
ttccccctgctaaaggactttgtgcagagggttgtggagagcttggatgtgggtcccgac
cgtgtgcgtgtggcactggtgcagtacagtgaccggaccaggccggagttctacctgaat
tcccacatggaccagcagggtgtcatcagcgccatccgcaggctgacactgctgggcggc
ccaacccccaacacaggggcggcactggagttcgtgttaaggaacattctgaccagttct
actgggagcaggatagcagaaggtgttcctcagctcctgattgtcctcacagcagagcca
tcaggggatgacgtgcgaggcccttcagtagtcctgaagcagggcggggctgtgcccatt
ggcattggcattgggaatgctgacatctctgaaatgcagaccatctccttcatccctgac
tttgctgtggccatccccaccttccgggagcttgggacgatacagcaggtcatctctgag
agggtgatccagcttaatcgtgaggagctgagctcgttgaaacccattttgacgccctca
acaggtgcaggtgttggcagcaagaaagatgtggtctttctcatcgatggatcccgaaat
gctggccctgagttccagtacatccgtaccctgattgagaggatagtcgagtacctggac
ataggctttgacaccacccgggtagccgtcatccagttcagtgaggactccaagatggag
tttcctctgaatgcccacttcagcaaggatgaagtacagaacgctgtgcgcaggctgcgg
cccaagggtgggagtcaggtctacatcggaaatgccctggagtacgtgctgaagaacatc
ttccagaggccactggggagccggatagaagagggcgttccgcagttcctagtcctcatt
tcgtctggaaagtctgatgatgaggtggacgactcagccgtggaactcaagcagttcggt
gtggcccctctcaccatagccagacacacagaccaagaggagctggtgaagatctcccta
agtcctgagtacgtgtactccgtgagcaccttcagggagctgcccagactggagcagaag
ctgctgacgcccatcaccactctaacctcacagcagatccatcagatcctggccagtacc
cgatatcccccttcagtggttgaaagcgatgcggctgacatcgtctttctgattgacagc
tctgatgccgtcaagcccgatggcatcgctcatatccgagactttgtcagcaggatcgtt
cgcagactcaacattggtcccagtaaagtgaggattggggtcgtgcagtttagcaatgat
gtcttccctgagttctacctgaagacccacaagtcccagagcagtgtgcttgaagccata
cggcgtctgaggttcaaaggagggtctcccctaaacactggcagagccctggagtttgtg
gcaagaaacctcttcgtgaagtctgctgggagccggatagaagatggggtccctcaacac
ctggtcttattcctgggtgggaagtctcaggacgatgtggccaggcatgcccaagtcata
agctcatcggggattgtgagcttaggaataggagatcggaacattgataggacggacttg
caaaccatcaccaatgaccccaggctggtcttcacagttcgggaattcagagagctgccc
aacatagaagagagggtcatgctttcctttggaccctctggggctactcctcagcctcct
ggggtagatttaccttctccttcccggccagagaagaagaaggccgacattgtgttcctg
ctggatggttccatcaattttaggagggacagtttccaggaagtgctccgttttgcatct
gaaatcgtggacacggtctatgaagacggagattccatcagagtgggactggtccagtac
aactcagaccccacggatgaattcttcctgagggacttttctaccaaaaggcagattatc
gatgccatcaacaaagtcgtctataaaggagggaggcatgccaacaccagggtgggcatc
gagcacctgctgcggaatcactttgtgccggaggctggtagccgcctggatgagcgggtt
cctcagatcgccttcgtgatcaccggagggaagtcagtggaggatgctcaggatgtgagc
ctggctctcacccagaaaggtgtcaaggtgtttgccgttggcgtgaggaacatcgactct
gaggaggtgggcaagatagcctccaacagtgccacagccttccgggtgggcagcgtgcag
gaactgtccgaactgagtgagacagtgctcgagactctgcatgatgcaatgcatgagacc
ctctgccctggtgtgaccgatgtctccaaagcctgcaatctggaagtcattctggggttt
gatggatctcgagatcagaatgtgtttgtgagtcagaagggccttgagtccaaggtggac
atcatcttgaacagaatcagccagatacaaaggatcagctgcagtggcaaccagctgccc
actgtgagggtatccgtgatggcgaacacgccctctgggccagtggaggcctttgacttt
gccgagtatcagccggagctgtttgagaagtttcgcaacatgcgcagtcagcgcccatac
gtcctcacagctgacacgctgaagttgtaccagaacaagttcaggcagtcgtcgcctgat
actgtgaaggtagtcattcacttcacggatggagcagatggagacatggctgacttatac
agagcatctgaggaactccggcaagcaggtgcccaagccctgatcctggtgggtcttgaa
cgtgtggctaacctggagcggctgatgcatctggagttcgggcggggcttcatgtatgac
aggcccctgaggctcaacctgcttgacttagactatgaactggcagagcagcttgacaat
attgctgagaaagcttgctgtggggttccatgcaagtgctctggagagaggggagacaga
gggcccattggcagcattgggccaaagggtatctccggggaagatggctaccgaggctac
ccgggagacgaaggcggacctggtgagcgaggtccacctggtgtgaatggcacacaaggt
ttccagggctgtccaggccagaggggagtgaagggttctcgaggattcccaggagagaag
ggtgaattgggagaaatcggcttggatggtctggatggtgaagagggagacaaggggctg
cctggttcttctggagaaaaggggagtcctggaagaaggggtgacaaaggacccaaagga
gacaaaggcgagagaggagatgtcgggattcgaggtgacccgggtgactcaggacgggac
agccagcagagaggacccaaaggagagacgggagacatcggccccatgggtctcccaggc
agagatggcatccctggcagcccaggagaccctgggaaggacggtggctctggccgaagg
ggccctgcaggagctaagggcaacagaggtggtcctggccagccaggcttcgaaggcgag
cagggtaccagaggctcacagggcccacctggtcccattggtcccccaggcctgattggg
gaacaaggaattcccggacctcgaggaggcggaggtacggctggggctcctggagaacgt
ggcagaactggtcctctggggaggaagggtgagccaggagagccagggccgaagggaagc
atcgggaaccgtggaccccgaggagagacgggagatgacgggagagatggggttggcagt
gaaggacgcagaggcaaaaaaggagaaagaggcttccctgggtatccgggaccaaagggc
acccctggtgagccaggggcagatggaccaccaggacccaaaggcatcagaggtcgaagg
ggaaattcaggacctccaggggcaactggacaaaagggagaccctggttacccaggacca
tctggtcacaagggcaaccgaggcgactccgttgatcaatgcgccctcatccagagcatc
agagacaaatgcccttgctgctacgggcccctggagtgccctgtattcccaacagaactc
gcctttgccctggatacctctgagggggtgactcaggacaccttcagccggatgcgagag
gtgcttctgggcattgtgggcgatctgaccattgcagagagcaactgtccgcggggtgcc
cgcgtggctgtggtcacctataacaatgaggtgactacagagatccggtttgctgactcc
aagaagaagtctgccctcttagacagtattcagaacctccaagtggccttgacctctaag
cagcagagcctggagaccgcaatgtcattcgtggccaggaatacattcaaacgtgtgagg
agtggcttcctgatgaggaaagtggctgttttcttcagcaacaagcccaccagggcatcc
ccccagctccgagaggccgtactcaagctttccgatgcggggatcacacccttgttcctt
acaagccaggaggaccgacagctcatcaacgctttgcagatcaataacacagcagtggga
catgccctggttctgcctgcaaggagggatctcacagacttcctgaagaatgtccttacc
tgccatgtttgcttggacatttgcaatatcgacccatcctgtggcttcggcagctggagg
ccttccttcagggaccggagggcagcaggcagtgatgtggacatagacctggctttcatc
ttggacagctcagaggctaccacactgttccagttcaatgagatgaagaaatacataggc
tatgtgatcagacagctggacctgagcccagacccgaaggcttcccaacactttgccagg
gtggctgttgtgcagcagtcaacctacgagtctgtggacaatgccagtgtgccgcctgtg
aaggtggaattctcactgacagactatggtgccaaggaaaagctgttggatttccttagc
cggaggatgacccaacttcagggaaccatgggcctgggcaatgccattgaatacactata
gagaacatctttgaaagtgctccaaacccacgggacctcaaaattatggtgctgatgttg
actggtgacatgcagaggcagcagctggaggaggcccagagagccatcctccaggccaag
tgcaagggttacttctttgtggttctgggcattggcaggaaggtgaatatcaaggaggtc
tacagctttgccagtgagcccaatgatgtcttcttcaaatttgtggacaaatcaactgag
ctcaacgaggagcctctgatgcgctttgggaggctgctcccatcctttgtcagcagtgaa
aatgcattttacctgcctccagatctcaggaaacagtgtgattggttccaaggggaccaa
ccagtgaagaatggcgtgaagtttggtcacaaacagataaaccctccacatactgctaat
tcaagtcttacatccaaagttgtgaccacaatgaagccagtgacaacgaccaaaccaaca
gcaattgtaaacctgccacctgctaagccagccccagcgaggccagcccctgctcagcca
gttcttgctaagccagaccccgcaaaaccagcccaagcaaggccagcccctgctaagcca
gcttctgctaagttggtgcctcctcagccagtccatgttcagccagcccctgcccagaca
gcttctgtaagaccagcccccgctaagccagcccctcctcagccagctgctgctaagcca
gtccctgctaagccagcagtccctgctcagccagcccctcctcagccagctgctgctaag
ccagtccctgctaagccagcagtccctgctcagccagcccctcctcagccagctgctgct
aagccagtccctgctaagccagcagtccctgctcagccagccgctgctcaaccaatgcct
gctcagccagtccttactaaatcagccgctgtgaaaccagcctctgccaacaagcctgtg
gcagccaaacctgtggccacgaacacagccacagccacagccagacctgcattagcagcg
aagcctgctgcagcgaagcctgcagcaacgagaccccttgctgctgccataaggccagtt
gctaccaagccagaggcccccaggcaacaggccaaaccagctgccaccaagccagccact
actaagcccttggcgagagtgtcccgagaagtccaggtctctgaagtcacggagaacagt
gccagactccactgggagcggcccgagccttccagctcctttttttatgacctcactgta
acctcggcccacgaccagtccttggttctgagacaaaacctcactgtcactgaccgtgtc
atcggaggcctactggctggacagctgtaccatgtcgtagtggtctcctacctgcagtct
caggtcagggccatctaccaaggaagtttcaatacgaagaaaacccagcctccacctctt
caggcagcacaccgagcatccagttcaaccatcaacctcatggtgaacacagaaccattg
tttctcactaaaacagatatatgtaagctgtccagagatgctgggacttgtgtggacttc
aagttactatggcactatgacctagagagcaaaagttgcaagagattctggtatggaggt
tgtggaggcaacgagaacagattccactcccaggaagaatgtgaaaagatgtgtagtcct
gagttaacagtttga

DBGET integrated database retrieval system