KEGG   Pongo abelii (Sumatran orangutan): 100444160
Entry
100444160         CDS       T01416                                 
Symbol
COL4A1
Name
(RefSeq) collagen alpha-1(IV) chain
  KO
K06237  collagen type IV alpha
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04151  PI3K-Akt signaling pathway
pon04382  Cornified envelope formation
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon04820  Cytoskeleton in muscle cells
pon04926  Relaxin signaling pathway
pon04933  AGE-RAGE signaling pathway in diabetic complications
pon04974  Protein digestion and absorption
pon05146  Amoebiasis
pon05165  Human papillomavirus infection
pon05200  Pathways in cancer
pon05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100444160 (COL4A1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100444160 (COL4A1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100444160 (COL4A1)
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    100444160 (COL4A1)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    100444160 (COL4A1)
  09154 Digestive system
   04974 Protein digestion and absorption
    100444160 (COL4A1)
  09158 Development and regeneration
   04382 Cornified envelope formation
    100444160 (COL4A1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    100444160 (COL4A1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    100444160 (COL4A1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100444160 (COL4A1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    100444160 (COL4A1)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    100444160 (COL4A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:pon04147]
    100444160 (COL4A1)
   00536 Glycosaminoglycan binding proteins [BR:pon00536]
    100444160 (COL4A1)
Exosome [BR:pon04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   100444160 (COL4A1)
Glycosaminoglycan binding proteins [BR:pon00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   100444160 (COL4A1)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 100444160
NCBI-ProteinID: XP_024086740
Ensembl: ENSPPYG00000005496
LinkDB
Position
14:complement(116410338..116567744)
AA seq 1669 aa
MGPRLSVWLLLLPAALLLHEEHSRAAAKGGCAGSGCGKCDCHGVKGQKGERGLPGLQGVI
GFPGMQGPEGPQGPPGQKGDTGEPGLPGTKGTRGPPGASGYPGNPGLPGIPGQDGPPGPP
GIPGCNGTKGERGPLGPPGLPGFAGNPGPPGLPGMKGDPGEILGHVPGMLLKGERGFPGI
PGNPGPPGLPGLQGPVGPPGFTGPPGPPGPPGPPGEKGQMGLSFQGPKGDKGDQGVSGPP
GVPGQAQVQEKGDFATKGEKGQKGEPGFQGMPGVGEKGEPGKPGPRGKPGKDGDKGEKGS
PGFPGEPGYPGLIGRQGPQGEKGEAGPPGPPGIVIGTGPLGEKGERGYPGTPGPRGEPGP
KGFPGLPGQPGPPGLPVPGQAGAPGFPGERGEKGDRGFPGTSLPGPSGRDGLPGPPGSPG
PPGQPGYTNGIVECQPGPPGDQGPPGIPGQPGLIGEIGEKGQKGESCLICDIDGYRGPPG
PQGPPGEIGFPGQPGAKGDRGLPGRDGVAGVPGPQGTPGLIGQPGAKGEPGEIYFDLRLK
GDKGDPGFPGQPGMPGRAGSPGRDGHPGLPGPKGSPGSVGLKGERGPPGGVGFPGSRGDT
GPPGPPGYGPTGPIGDKGQAGFPGGPGSPGLPGPKGEPGKVVPLPGLPGAEGLPGSPGFP
GPQGDRGFPGTPGRPGLPGEKGAVGQPGIGFPGPPGPKGVDGLPGDMGPPGTPGRPGFDG
LPGNPGVQGQKGEPGVGLPGLKGLPGLPGIPGTPGEKGSIGVPGVPGEHGAIGPPGLQGI
RGEPGPPGLPGSVGSPGVPGIGPPGARGPPGGQGPPGLSGPPGIKGEKGFPGFPGLDMPG
PKGDKGAQGLPGITGQSGLPGLPGQQGAPGIPGFPGSKGEMGVMGTPGQPGSPGPVGAPG
LPGEKGDHGFPGSSGPRGDPGFKGDKGDVGLPGKPGSMDKVDMGSMKGQKGDQGEKGQIG
PIGEKGSRGDPGTPGVPGKDGQAGQPGQPGPKGDPGISGTPGAPGLPGPKGSVGGMGLPG
TPGEKGVPGIPGLQGSPGLPGDKGAKGEKGQAGPPGIGIPGLRGEKGDQGIAGFPGSPGE
KGEKGSIGIPGMPGSPGLKGSPGSVGYPGSPGLPGEKGDKGLPGLDGIPGVKGEAGLPGT
PGPTGPAGQKGEPGSDGIPGSAGEKGEPGLPGRGFPGFPGAKGDKGSKGEVGFPGLAGSP
GIPGSKGEQGFMGPPGPQGQPGLPGSPGHATEGPKGDRGPQGQPGLPGLPGPMGPPGLPG
IDGVKGDKGNPGWPGAPGVPGPKGDPGFQGMPGIGGSPGITGSKGDMGPPGVPGFQGPKG
LPGLQGIKGDQGDQGVPGAKGLPGPPGPPGPYDIIKGEPGLPGPEGPPGLKGLQGLPGPK
GQQGVTGLVGIPGPPGIPGFDGAPGQKGEMGPAGPTGPRGFPGPPGPDGLPGSMGPPGTP
SVDHGFLVTRHSQTIDDPQCPSGTKILYHGYSLLYVQGNERAHGQDLGTAGSCLRKFSTM
PFLFCNINNVCNFASRNDYSYWLSTPEPMPMSMAPITGDNIRPFISRCAVCEAPAMVMAV
HSQTIQIPPCPSGWSSLWIGYSFVMHTSAGAEGSGQALASPGSCLEEFRSAPFIECHGRG
TCNYYANAYSFWLATIERSEMFKKPTPSTLKAGELRTHVSRCQVCMRRT
NT seq 5010 nt   +upstreamnt  +downstreamnt
atggggccccggctcagcgtctggcttctgctgctgcccgccgcccttctgctccacgag
gagcacagccgggccgctgcgaagggtggctgtgctggctctggctgtggcaaatgtgac
tgccatggagtgaagggacaaaagggtgaaagaggcctcccggggttacaaggtgtcatt
gggtttcctggaatgcaaggacctgaggggccgcagggaccaccaggacaaaagggtgat
actggagaaccaggactacctggaacaaaagggacaagaggacctccgggagcatctggc
taccctggaaacccaggacttcccggaattcctggccaagacggcccgccaggcccccca
ggtattccaggatgcaatggcacaaagggagagagagggccactcgggcctcctggcttg
cctggtttcgctggaaatcccggaccaccagggttaccgggaatgaagggtgatccaggt
gagatacttggccatgtgcctgggatgctgttgaaaggtgaaagaggatttcccggaatc
ccagggaatccaggcccaccaggactgccagggcttcaaggtcctgttgggcctccagga
tttacgggaccaccaggtccccccggccctcccggccctccaggtgaaaagggacaaatg
ggcttaagttttcaaggaccgaaaggtgacaagggtgaccaaggggtcagtgggcctccg
ggagtaccaggacaagctcaagttcaagaaaaaggagacttcgccaccaagggagaaaag
ggccaaaaaggtgaacctggatttcaggggatgccaggggtcggagagaaaggtgaaccc
ggaaaaccaggacccagaggaaaacccggaaaagatggtgacaaaggggaaaaagggagt
cccggttttcctggtgaacccgggtacccaggactcataggccgccagggcccgcaggga
gaaaagggtgaagcaggtcctcccggcccacctggaattgttataggcacaggacctttg
ggagaaaaaggagagaggggctaccctggaactccagggccaagaggagagccaggccca
aaaggtttcccaggactaccaggccaacccggacctccaggcctcccggtacctgggcag
gctggtgcccctggcttccctggtgaaagaggagaaaaaggtgaccgaggatttcctggt
acatctctgccaggaccaagtggaagagatgggctcccaggtcctcctggttcccccggg
ccccctgggcagcctggctacacaaatggaattgtggaatgtcagcccggacctccaggt
gaccagggtcctcctggaattccagggcagccaggattgataggcgaaattggagagaaa
ggtcaaaaaggagagagttgcctcatctgtgatatagacggatatcgggggcctcccggg
ccacagggacccccaggagaaataggtttcccaggacagccaggggccaagggcgacaga
ggtttgcctggcagagatggtgttgcgggagtgccagggcctcaaggtacaccagggctg
ataggccagccaggagccaagggggagcctggtgagatttatttcgacttgcggctcaaa
ggtgacaaaggagacccaggcttcccaggacagcccggcatgccagggagagcgggttct
cctggaagagatggccatccgggtcttcctggccccaagggctcgccgggttctgtagga
ttgaaaggagagcgtggcccccctggaggagttggattcccaggcagtcgtggtgacacc
ggcccccctgggcctccaggatatggtcctactggtcccattggtgacaaaggacaagca
ggctttcctggaggccctggatccccaggcctgccaggtccaaagggtgaaccaggaaaa
gttgttcctttaccaggcctccctggagcagaaggactgccggggtccccaggcttccca
ggtccccaaggagaccgaggctttcccggaaccccaggaaggccagggctgccaggagag
aagggtgctgtgggccagccaggaattggatttccagggccccctggccccaaaggtgtt
gacggcttacctggagacatggggcctccggggactccaggtcgcccgggatttgatggc
ttacctgggaacccaggtgtgcagggccagaagggagagcctggagttggtctgccggga
ctcaaaggtttgccaggtcttcccggcattcctggcacacccggggagaaggggagcatt
ggggtaccaggcgttcctggagaacacggagcgatcggaccccctgggcttcaggggatc
agaggtgaaccgggacctcctggattgccaggctccgtggggtctccaggagttccagga
ataggcccccctggagctaggggtccccctggaggacagggaccaccggggttgtcaggc
cctcctggaataaaaggagagaagggtttccccggattccctggactggacatgccgggc
cctaaaggagataaaggggctcaaggacttcctggcataacgggacagtcagggctccct
ggccttcctggacagcagggggctcctgggattcctgggtttccaggttccaagggagaa
atgggcgtcatggggacccctgggcagccgggctcaccaggaccagtgggtgctccggga
ttaccgggtgaaaaaggggaccatggctttccgggctcctcaggacccaggggagaccct
ggctttaaaggtgataagggggatgtcggtctccctggcaagcctggctccatggataag
gtggacatgggcagcatgaagggccagaaaggagaccaaggagagaaaggacaaattgga
ccaattggtgagaagggatcccgaggagaccctgggaccccaggagtgcctggaaaggat
gggcaggcaggacagcctgggcagccaggacctaaaggtgatccaggtataagtggaacc
ccaggtgctccaggacttccgggaccaaaaggatctgttggtggaatgggcttgccagga
acacctggagagaaaggtgtgcccggcatccctggcctgcaaggttcacctggcttacct
ggagacaaaggtgcaaaaggagagaaagggcaggcaggcccacctggcataggcatccca
gggctgcgtggtgaaaagggagatcaagggatagcgggtttcccaggaagccctggtgag
aagggagaaaaaggaagcatcgggatcccaggaatgccagggtccccaggccttaaaggg
tctcccgggagtgttggctatccaggaagccctgggctgcctggagaaaaaggtgacaaa
ggcctcccaggattggatggcatccctggtgtcaaaggagaagcaggtcttcctgggact
cctggccccacaggcccagctggccagaaaggggagccaggcagtgacggaatcccgggg
tcagcaggagagaagggtgaaccaggtctaccaggaagaggattcccagggtttccaggg
gccaaaggagacaaaggttcaaagggtgaggtgggtttcccaggattagctgggagccca
ggaattcctggatccaaaggagagcaaggattcatgggtcctccagggccccagggacag
ccggggttaccgggatccccaggccatgccacggaggggcccaaaggagaccgtggacct
cagggccagcctggcctgccaggacttccgggacctatggggcctccagggcttcctggg
attgatggagttaaaggtgacaaaggaaatccaggctggccaggagcacccggtgtccca
gggcccaagggagaccctggattccagggcatgcctggtattggtggctctccaggaatc
acaggctctaagggtgatatggggcctccaggagttccaggatttcaaggtccaaaaggt
cttcctggcctccagggaattaaaggtgatcaaggcgatcagggtgtcccgggagctaaa
ggtctcccgggtcctcctggacccccaggtccttacgacatcatcaaaggggagcccggg
ctccctggtcctgagggccccccagggctgaaagggcttcagggacttccaggcccgaaa
ggccagcaaggtgttacaggattggtgggtatacctggacctccaggtattcctgggttt
gacggtgcccctggccagaaaggagagatgggacctgctgggcctactggtccaagagga
tttccaggtccaccaggccctgatgggttgccaggatccatggggcccccaggcacccca
tctgttgatcacggcttccttgtgaccaggcacagtcaaacaatagatgacccacagtgt
ccttctgggaccaaaattctttaccatgggtactctttgctctacgtgcaaggcaatgaa
cgggcccatggccaggacctgggcacggccggcagctgcctgcgcaagttcagcacaatg
cccttcctgttctgcaatattaacaacgtgtgcaactttgcatcacgaaatgactattcg
tactggctgtccacccccgagcccatgcccatgtcaatggcacccatcacgggggacaac
ataagaccatttattagtaggtgtgctgtgtgtgaggcgcccgccatggtgatggccgtg
cacagtcagaccattcagatcccaccgtgccccagcgggtggtcctcgctgtggatcggc
tactcttttgtgatgcacaccagtgctggtgcagaaggctctggccaagccctggcgtcc
cccggctcctgtctggaggagtttagaagtgcgccattcatcgagtgtcacggccgtggg
acctgtaattactacgcaaacgcttacagcttttggctcgccaccatagagaggagcgag
atgttcaagaagcccacgccgtccaccttgaaggcaggggagctgcgcacgcacgtcagc
cgctgccaagtctgtatgagaagaacataa

DBGET integrated database retrieval system