KEGG   Pan troglodytes (chimpanzee): 452659
Entry
452659            CDS       T01005                                 
Symbol
COL4A1
Name
(RefSeq) collagen alpha-1(IV) chain isoform X1
  KO
K06237  collagen type IV alpha
Organism
ptr  Pan troglodytes (chimpanzee)
Pathway
ptr04151  PI3K-Akt signaling pathway
ptr04382  Cornified envelope formation
ptr04510  Focal adhesion
ptr04512  ECM-receptor interaction
ptr04820  Cytoskeleton in muscle cells
ptr04926  Relaxin signaling pathway
ptr04933  AGE-RAGE signaling pathway in diabetic complications
ptr04974  Protein digestion and absorption
ptr05146  Amoebiasis
ptr05165  Human papillomavirus infection
ptr05200  Pathways in cancer
ptr05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:ptr00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    452659 (COL4A1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    452659 (COL4A1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    452659 (COL4A1)
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    452659 (COL4A1)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    452659 (COL4A1)
  09154 Digestive system
   04974 Protein digestion and absorption
    452659 (COL4A1)
  09158 Development and regeneration
   04382 Cornified envelope formation
    452659 (COL4A1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    452659 (COL4A1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    452659 (COL4A1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    452659 (COL4A1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    452659 (COL4A1)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    452659 (COL4A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:ptr04147]
    452659 (COL4A1)
   00536 Glycosaminoglycan binding proteins [BR:ptr00536]
    452659 (COL4A1)
Exosome [BR:ptr04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   452659 (COL4A1)
Glycosaminoglycan binding proteins [BR:ptr00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   452659 (COL4A1)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 452659
NCBI-ProteinID: XP_016781016
Ensembl: ENSPTRG00000006031
VGNC: 10713
UniProt: K7C8P4 A0A6D2XL77
LinkDB
Position
14:complement(115766711..115925159)
AA seq 1669 aa
MGPRLSVWLLLLPAALLLHEEHSRAAAKGGCAGSGCGKCDCHGVKGQKGERGLPGLQGVI
GFPGMQGPEGPQGPPGQKGDTGEPGLPGTKGTRGPPGASGYPGNPGLPGIPGQDGPPGPP
GIPGCNGTKGERGPLGPPGLPGFAGNPGPPGLPGMKGDPGEILGHVPGMLLKGERGFPGI
PGTPGPPGLPGLQGPVGPPGFTGPPGPPGPPGPPGEKGQMGLSFQGPKGDKGDQGVSGPP
GVPGQAQVQEKGDFATKGEKGQKGEPGFQGMPGVGEKGEPGKPGPRGKPGKDGDKGEKGS
PGFPGEPGYPGLIGRQGPQGEKGEAGPPGPPGIVIGTGPLGEKGERGYPGTPGPRGEPGP
KGFPGLPGQPGPPGLPVPGQAGAPGFPGERGEKGDRGFPGTSLPGPSGRDGLPGPPGSPG
PPGQPGYTNGIVECQPGPPGDQGPPGIPGQPGFIGEIGEKGQKGESCLICDIDGYRGPPG
PQGPPGEIGFPGQPGAKGDRGLPGRDGVAGVPGPQGTPGLIGQPGAKGEPGEIYFDLRLK
GDKGDPGFPGQPGMPGRAGSPGRDGHPGLPGPKGSPGSVGLKGERGPPGGVGFPGSRGDT
GPPGPPGYGPTGPIGDKGQAGFPGGPGSPGLPGPKGEPGKVVPLPGPPGAEGLPGSPGFP
GPQGDRGLPGTPGRPGLPGEKGAVGQPGIGFPGPPGPKGVDGLPGDVGPPGTPGRPGFNG
LPGNPGVQGQKGEPGVGLPGLKGLPGLPGIPGTPGEKGSIGVPGVPGEHGAIGPPGLQGI
RGEPGPPGLPGSVGSPGVPGIGPPGARGPPGGQGPPGLSGPPGIKGEKGFPGFPGLDMPG
PKGDKGAQGLPGITGQSGLPGLPGQQGAPGIPGFPGSKGEMGVMGTPGQPGSPGPVGAPG
LPGEKGDHGFPGSSGPRGDPGLKGDKGDVGLPGKPGSMDKVDMGSMKGQKGDQGEKGQIG
PIGEKGSRGDPGTPGVPGKDGQAGQPGQPGPKGDPGISGTPGAPGLPGPKGSVGGMGLPG
TPGEKGVPGIPGPQGSPGLPGDKGAKGEKGQAGPPGIGIPGLRGEKGDQGIAGFPGSPGE
KGEKGTIGIPGMPGSPGLKGSPGSVGYPGSPGLPGEKGDKGLPGLDGIPGVKGEAGLPGT
PGPTGPAGQKGEPGSDGIPGSAGEKGEPGLPGRGFPGFPGAKGDKGSKGEVGFPGLAGSP
GIPGSKGEQGFMGPPGPQGQPGLPGSPGHATEGPKGDRGPQGQPGLPGLPGPMGPPGLPG
IDGVKGDKGNPGWPGAPGVPGPKGDPGFQGMPGIGGSPGITGSKGDMGPPGVPGFQGPKG
LPGLQGIKGDQGDQGIPGAKGLPGPPGPPGPYDIIKGEPGLPGPEGPPGLKGLQGLPGPK
GQQGVTGLVGIPGPPGIPGFDGAPGQKGEMGPAGPTGPRGFPGPPGPDGLPGSMGPPGTP
SVDHGFLVTRHSQTIDDPQCPSGTKILYHGYSLLYVQGNERAHGQDLGTAGSCLRKFSTM
PFLFCNINNVCNFASRNDYSYWLSTPEPMPMSMAPITGENIRPFISRCAVCEAPAMVMAV
HSQTIQIPPCPSGWSSLWIGYSFVMHTSAGAEGSGQALASPGSCLEEFRSAPFIECHGRG
TCNYYANAYSFWLATIERSEMFKKPTPSTLKAGELRTHVSRCQVCMRRT
NT seq 5010 nt   +upstreamnt  +downstreamnt
atggggccccggctcagcgtctggctgctgctgctgcccgccgcccttctgctccacgag
gagcacagccgggccgctgcgaagggtggctgtgctggctctggctgtggcaaatgtgac
tgccatggagtgaagggacaaaagggtgaaagaggcctcccggggttacaaggtgtcatt
gggtttcctggaatgcaaggacctgaggggccacagggaccaccaggacaaaagggtgat
actggagaaccaggactacctggaacaaaagggacaagaggacctccgggagcatctggc
taccctggaaacccaggacttcccggaattcctggccaagacggcccgccaggcccccca
ggtattccaggatgcaatggcacaaagggagagagagggccgctcgggcctcctggcttg
cccggtttcgctggaaatcccggaccaccagggttaccagggatgaagggtgatccaggt
gagatacttggccatgtgcccgggatgctgttgaaaggtgaaagaggatttcccggaatc
ccagggactccaggcccaccaggactgccagggcttcaaggtcctgttgggcctccagga
tttaccggaccaccaggtcccccaggccctcccggccctccaggtgaaaagggacaaatg
ggcttaagttttcaaggaccaaaaggtgacaagggtgaccaaggggtcagtgggcctccg
ggagtaccaggacaagctcaagttcaagaaaaaggagacttcgccaccaagggagaaaag
ggccaaaaaggtgaacctggatttcaggggatgccaggggtcggagagaaaggtgaaccc
ggaaaaccaggacccagaggcaaacccggaaaagatggtgacaaaggggaaaaagggagt
cccggttttcctggtgaacccgggtacccaggactcataggccgccagggcccgcaggga
gaaaagggtgaagcaggtcctcctggcccccctggaattgttataggcacaggacctttg
ggagaaaaaggagagaggggctaccctggaactccggggccaagaggagagccaggccca
aaaggtttcccaggactaccaggccaacccggacctccaggcctccctgtacctgggcag
gctggtgcccctggcttccctggtgaaagaggagaaaaaggtgaccgaggatttcctggt
acatctctgccaggaccaagtggaagagatgggctcccgggtcctcctggttcccccggg
ccccctgggcagcctggctacacaaatggaattgtggaatgtcagcccggacctccaggt
gaccagggtcctcctggaattccagggcagccaggatttataggtgaaattggagagaaa
ggtcaaaaaggagagagttgcctcatctgtgatatagacggatatcgggggcctcctggg
ccacagggacccccgggagaaataggtttcccaggacagccaggggccaagggcgacaga
ggtttgcctggcagagatggtgttgcgggagtgccaggccctcaaggtacaccagggctg
ataggccagccaggagccaagggggagcctggtgagatttatttcgacttgcggctcaaa
ggtgacaaaggagacccaggctttccaggacagcccggcatgccagggagagcgggttct
cctggaagagatggccatccgggtcttcctggccccaagggctcgccgggttctgtagga
ttgaaaggagagcgtggcccccctggaggagttggattcccaggcagtcgtggtgacacc
ggcccccctgggcctccaggctatggtcctactggtcccattggtgacaaaggacaagca
ggctttcctggaggccctggatccccaggcctgccaggtccaaagggtgaaccaggaaaa
gttgttcctttaccaggtccccctggagcagaaggactgccggggtccccaggcttccca
ggtccccaaggagaccgaggccttcccggaaccccaggaaggccaggcctgccaggagag
aagggcgctgtgggccagccgggcattggatttccagggccccccggccccaaaggtgtt
gacggcttacctggagacgtggggcctccggggactccaggtcgcccgggatttaatggc
ttacctgggaacccaggtgtgcagggccagaagggagagcctggagttggtctaccggga
ctcaaaggtttgccaggtcttcccggcattcctggcacacccggggagaaggggagcatt
ggggtaccaggcgttcctggagaacatggagcgatcggaccccctgggcttcaggggatc
agaggtgaaccgggacctcctggattgccaggctccgtggggtctccaggagttccagga
ataggcccccctggagctaggggtccccctggaggacagggaccaccggggttgtcaggc
cctcctggaataaaaggagagaagggtttccccggattccctggactggacatgccgggc
cctaaaggagataaaggggctcaaggactccctggcataacgggacagtcggggctccct
ggccttcctggacagcagggggctcctgggattcctgggtttccaggttccaagggagaa
atgggcgtcatggggacccccgggcagccgggctcaccaggaccagtgggtgctcctgga
ttaccgggtgaaaaaggggaccatggctttccgggctcctcaggacccaggggagaccct
ggcttgaaaggtgataagggggatgtcggtctccctggcaagcctggctccatggataag
gtggacatgggcagcatgaagggccagaaaggagaccaaggagagaaaggacaaattgga
ccaattggtgagaagggatcccgaggagaccctgggaccccaggagtgcctggaaaggac
gggcaggcaggacagcctgggcagccaggacctaaaggtgatccaggtataagtggaacc
ccaggtgctccaggacttccgggaccaaaaggatctgttggtggaatgggcttgccagga
acacctggagagaaaggtgtgcctggcatccctggcccacaaggctcacctggcttacct
ggagacaaaggtgcaaaaggagagaaaggacaggcaggcccacctggcataggcatccca
gggctgcgtggtgaaaagggagatcaagggatagcgggtttcccaggaagccctggagag
aagggagaaaaaggaaccattgggatcccaggaatgccagggtccccaggccttaaaggg
tctcccgggagtgttggctatccaggaagtcctgggctgcctggagaaaaaggtgacaaa
ggcctcccaggattggatggcatccctggtgtcaaaggagaagcaggtcttcctgggact
cctggccccacaggcccagctggccagaaaggggagccaggcagtgatggaatcccgggg
tcagcaggagagaagggtgaaccaggtctaccaggaagaggattcccagggtttccgggg
gccaaaggagacaaaggttcaaagggtgaggtgggtttcccaggattagccgggagccca
ggaattcctggatccaaaggagagcaaggattcatgggtcctccggggccccagggacag
ccggggttaccgggatccccaggccatgccacggaggggcccaaaggagaccgcggacct
cagggccagcctggcctgccaggacttccgggacccatggggcctccagggcttcctggg
attgatggagttaaaggtgacaaaggaaatccaggctggccaggagcacccggtgtccca
gggcccaagggagaccctggattccagggcatgcctggtattggtggctctccaggaatc
acaggctctaagggtgatatggggcctccaggagttccaggatttcaaggcccaaaaggt
cttcctggcctccagggaattaaaggtgatcaaggcgatcaaggcatcccgggagctaaa
ggtctcccgggtcctcctggccccccaggtccttatgacatcatcaaaggggagcctggg
ctccctggtcctgagggccccccagggctgaaagggcttcagggactgccaggcccgaaa
ggccagcaaggtgttacaggattggtgggtatacctggacctccaggtattcctgggttt
gacggtgcccctggccagaaaggagagatggggcctgccgggcctactggtccaagagga
tttccaggtccaccaggccccgatgggttgccaggatccatggggcccccaggcacccca
tctgttgatcacggcttccttgtgaccaggcatagtcaaacaatagatgacccacagtgt
ccttctgggaccaaaattctttaccacgggtactctttgctctacgtgcaaggcaatgaa
cgggcccacggccaggacttgggcacggccggcagctgcctgcgcaagttcagcacaatg
cccttcctgttctgcaatattaacaacgtgtgcaactttgcatcacgaaatgactactcg
tactggctgtccacccctgagcccatgcccatgtcaatggcacccatcacgggggaaaac
ataagaccatttattagtaggtgtgctgtgtgtgaggcgcctgccatggtgatggccgtg
cacagtcagaccattcagatcccaccgtgtcccagcgggtggtcctcgctgtggatcggc
tactcttttgtgatgcacaccagcgctggtgcagaaggctctggccaagccctggcgtcc
cccggctcctgtctggaggagtttagaagtgcgccattcatcgagtgtcacggccgtggg
acctgtaattactacgcaaatgcttacagcttttggctcgccaccatagagaggagcgag
atgttcaagaagcccacgccgtccaccttgaaggcaggggagctgcgcacgcacgtcagc
cgctgccaagtctgtatgagaagaacataa

DBGET integrated database retrieval system