KEGG   Cercocebus atys (sooty mangabey): 105584910
Entry
105584910         CDS       T07242                                 
Symbol
COL4A1
Name
(RefSeq) collagen alpha-1(IV) chain
  KO
K06237  collagen type IV alpha
Organism
caty  Cercocebus atys (sooty mangabey)
Pathway
caty04151  PI3K-Akt signaling pathway
caty04510  Focal adhesion
caty04512  ECM-receptor interaction
caty04820  Cytoskeleton in muscle cells
caty04926  Relaxin signaling pathway
caty04933  AGE-RAGE signaling pathway in diabetic complications
caty04974  Protein digestion and absorption
caty05146  Amoebiasis
caty05165  Human papillomavirus infection
caty05200  Pathways in cancer
caty05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:caty00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    105584910 (COL4A1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    105584910 (COL4A1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    105584910 (COL4A1)
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    105584910 (COL4A1)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    105584910 (COL4A1)
  09154 Digestive system
   04974 Protein digestion and absorption
    105584910 (COL4A1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    105584910 (COL4A1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    105584910 (COL4A1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    105584910 (COL4A1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    105584910 (COL4A1)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    105584910 (COL4A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:caty04147]
    105584910 (COL4A1)
   00536 Glycosaminoglycan binding proteins [BR:caty00536]
    105584910 (COL4A1)
Exosome [BR:caty04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   105584910 (COL4A1)
Glycosaminoglycan binding proteins [BR:caty00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   105584910 (COL4A1)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 105584910
NCBI-ProteinID: XP_011913415
Ensembl: ENSCATG00000031204
UniProt: A0A2K5LN90
LinkDB
Position
Unknown
AA seq 1669 aa
MGPRLSVWLLLLPAALLLHEEHSRAAAKGGCAGSGCGKCDCHGVKGQKGERGLPGLQGVI
GFPGMQGPEGPQGPPGQKGDTGEPGLPGTKGTRGPPGASGYPGNPGLPGIPGQDGPPGPP
GIPGCNGTKGERGPLGPPGLPGFAGNPGPPGLPGMKGDPGEILGHVPGMLLKGERGFPGL
PGNPGPPGLPGLQGPVGPPGFTGPPGPPGPPGPPGEKGQMGLSFQGPKGDKGDQGVSGPP
GVPGQAQVQEKGDFATKGEKGQKGEPGFQGMPGVGEKGEPGRPGPRGKPGKDGDKGEKGS
PGFPGEPGYPGLVGRQGPQGEKGEAGPPGPPGIVIGTGPLGEKGERGYPGTPGPRGEPGP
KGFPGLPGQPGPPGPPLPGQAGAPGFPGERGEKGDRGFPGASLPGPSGRDGLPGPPGSPG
PPGQPGYTNGIVECQPGPPGDQGPPGIPGQPGLIGEIGEKGQKGESCLICDTTGYRGPPG
PQGPPGEIGFPGQPGAKGDRGLPGRDGVAGVPGPQGTPGLIGQPGAKGEPGEIYFDLRLK
GDKGDPGFPGQPGMPGRAGSPGRDGHAGLPGPKGSPGSVGLKGERGPPGGVGFPGSRGDT
GPPGPPGYGPTGPIGDKGQAGFPGGPGSPGLPGPKGEPGKVVPLPGPPGAEGLPGSPGFP
GPQGDRGFPGTPGRPGLPGEKGAVGQPGIGFPGPPGPKGVDGLPGDMGPPGTPGRPGFNG
LPGNPGVQGQKGEPGVGLPGLKGLPGLPGIPGTPGEKGSIGVPGIPGEHGAIGPPGLQGI
RGEPGPPGLPGSVGSPGVPGIGPPGARGPPGGQGPPGLSGPPGIKGEKGFPGFPGLDMPG
PKGDKGSQGLPGVTGQSGLPGLPGQQGSPGIPGFPGSKGEMGVMGTPGQPGSPGPVGVPG
LPGEKGDHGFPGSSGPRGDPGLKGDKGDVGLPGKPGSMDKVDMGSMKGQKGDQGEKGQIG
PIGEKGSRGDPGTPGVPGKDGQAGQPGQPGPKGDPGISGTPGAPGLPGPKGSVGGMGLPG
TPGEKGVPGIPGPQGSPGLPGEKGAKGAKGQAGPPGIGIPGLPGEKGDQGIAGFPGSPGE
KGEKGSIGIPGMPGSPGLKGSPGSVGYPGSPGLPGEKGDKGLPGLDGIPGVKGEAGLPGT
PGPSGPAGQKGEPGSDGIPGSAGEKGEPGLPGRGFPGFPGAKGDKGSKGEVGFPGLAGSP
GIPGSKGEQGFMGPPGPQGQPGLPGSPGHATEGPKGDRGPQGQPGLPGLPGPMGPPGLPG
IDGVKGDKGNPGWPGAPGVPGPKGDPGFQGMPGIGGSPGITGSKGDMGPPGVPGFQGPKG
VPGLQGIKGDQGDQGVPGAKGLPGPPGPPGPYDIIKGEPGLPGPEGPAGLKGLQGPPGPK
GQQGVTGLVGIPGPPGIPGFDGAPGQKGEMGPTGPTGPRGFPGPPGPDGLPGSMGPPGTP
SVDHGFLVTRHSQTIDDPQCPSGTKILYHGYSLLYVQGNERAHGQDLGTAGSCLRKFSTM
PFLFCNINNVCNFASRNDYSYWLSTPEPMPMSMAPITGDNIRPFISRCAVCEAPAMVMAV
HSQTIQIPPCPSGWSSLWIGYSFVMHTSAGAEGSGQALASPGSCLEEFRSAPFIECHGRG
TCNYYANAYSFWLATIERSEMFKKPTPSTLKAGELRTHVSRCQVCMRRT
NT seq 5010 nt   +upstreamnt  +downstreamnt
atggggccccggctcagcgtctggctgctgctgctgcccgccgcccttctgctccacgag
gagcacagccgggccgctgcgaagggtggctgtgctggctctggctgtggcaaatgtgac
tgccatggagtgaagggacaaaagggcgaaagaggccttccagggttacaaggtgtcatt
gggtttcctggaatgcaaggacctgaggggccgcagggaccaccaggacaaaagggtgat
actggagaaccaggactacctggaacaaaagggacaagaggacccccgggagcatctggc
taccctggaaacccgggacttcccggtattcctggccaagacggcccaccaggcccccca
ggtattccaggatgcaatggcacaaagggggagagagggccactcgggcctcctggcttg
cctggtttcgctggaaatcccggaccaccagggttaccaggaatgaagggtgatccaggt
gagatacttggccatgtgcccgggatgctgttgaaaggtgaaagaggatttcccggactc
ccagggaatccaggcccaccaggactgccagggcttcaaggtcctgttgggcctccagga
tttaccggaccaccgggtcccccaggtcctcctggccctccaggtgaaaagggacaaatg
ggcttaagttttcaaggacccaaaggtgacaagggtgaccaaggggtcagtgggcctccg
ggagtaccaggacaagctcaagttcaagaaaaaggagattttgccaccaagggagaaaag
ggccaaaaaggtgaacctggatttcagggaatgccaggggtcggagagaaaggtgaaccc
ggaagaccaggacccagaggaaaacctggaaaagatggtgacaaaggggaaaaagggagc
cccggttttcctggtgaacccgggtacccaggactcgtaggccgccagggcccgcaggga
gaaaagggtgaagcaggtcctcccggcccacctggaattgttataggcacaggacctttg
ggagaaaaaggagagcggggctaccctggaactccagggccaagaggagagccaggccca
aaaggtttcccaggactaccaggccaacctggacctccaggcccccctctacctgggcag
gctggtgccccaggcttccctggtgaaagaggagaaaaaggtgaccgaggatttccgggt
gcctctctgccaggaccaagtggaagagacgggctcccgggtcctcctggttcccccggg
ccccctgggcagcctggctacacaaatggaattgtggaatgtcagcccggacctccaggt
gaccagggtcctcctggaattccagggcagccaggattgataggcgaaattggagagaaa
ggtcaaaaaggagagagttgcctcatctgtgatacaaccgggtatcgggggcctcccggg
ccacagggacccccaggagaaataggtttcccaggacagccaggggccaagggcgacaga
ggtttgcctggcagagacggtgttgcaggagtgcctgggcctcaaggtacaccagggctg
atcggccagccaggagccaagggggagcctggtgagatttatttcgacctgcggctcaaa
ggtgacaaaggagacccaggctttccaggacagcccggcatgccagggagagcgggttct
cctggaagagatggccatgcgggtcttcctggccccaagggctcgccgggttctgtagga
ttgaaaggagagcgtggtccccctggaggagttggattcccaggcagtcgtggtgacacc
ggcccccctgggcctccaggatatggtcctactggtcccattggtgacaaaggacaagca
ggctttcctggaggccctgggtccccaggcctgccaggtccaaagggtgaaccaggaaaa
gttgttcctttaccaggcccccctggagcagaaggactgccggggtccccaggcttccca
ggtccccaaggagaccgaggctttcctggaaccccaggaaggccaggcctgccaggagag
aagggtgctgtgggccagccaggaattggatttccagggccccccggccccaaaggtgtt
gatggcttacctggagacatggggcctccagggactccaggtcgcccgggatttaatggc
ttacctgggaacccaggtgtgcagggccagaagggagagcctggagttggtctaccggga
ctcaaaggtttgccaggtcttcctggcattcctggcacacccggggagaaggggagcatt
ggggtaccaggtattcctggagaacacggagcgattggaccccctgggcttcaagggatc
agaggtgaaccggggcctcctggattgccaggctccgtggggtctccaggagttccagga
atcggcccccctggagctaggggcccccctggaggacagggaccaccggggttgtcgggc
cctcctggaataaaaggagagaagggtttccccggattccccggactggacatgccaggc
cctaaaggagataaagggtctcaaggacttcctggcgtaacaggacagtccgggctccct
ggccttcctggacagcaggggtctcctgggattcctgggtttccaggttccaagggagaa
atgggcgtcatggggacccccgggcaaccgggctcaccaggaccagtgggtgttccggga
ttaccgggtgaaaaaggggaccatggcttcccgggctcctcaggacccaggggagaccct
ggcctgaaaggtgataagggggatgtcggtctccctggcaagcctggctccatggataag
gtggacatgggcagcatgaagggccagaagggagaccaaggagagaaaggacaaattgga
ccaattggtgagaaaggatcccgaggagaccctgggaccccgggagtacctggaaaggat
gggcaggcaggacagcctgggcagccaggacctaaaggtgatccaggtataagtggaacc
ccaggtgctccaggacttccgggaccaaaaggatctgttggtggaatgggcttgccagga
acacctggagagaaaggtgtgcctggcatccctggcccacaaggttcacctggcttacct
ggagaaaaaggtgcaaaaggagcgaaagggcaggcaggcccacctggcataggcatccca
gggctgcctggtgaaaagggagatcaagggatagcgggtttcccaggaagccctggagag
aagggagaaaaaggaagcattgggatcccaggaatgccagggtccccaggccttaaaggg
tctcctgggagtgttggctatccagggagccctgggctgcccggagaaaaaggtgacaaa
ggcctcccaggattggatggcatccctggtgtcaaaggagaagcaggtcttcctgggacg
cctggcccctcaggcccagctggccagaaaggggagccaggcagtgacggaatcccaggc
tcagcaggagagaagggtgaaccaggtctaccaggaagaggattcccagggtttccaggg
gccaaaggagacaaaggttcaaagggtgaggtgggtttcccaggattagctgggagccca
ggaattcctggatccaaaggagagcaaggattcatgggtcctccagggccccagggacag
ccggggttaccaggatccccaggccatgccacggaggggcccaaaggagaccgtggacct
cagggccagcctggcttgccaggacttccgggacccatggggcctccagggcttcctggg
attgatggagttaaaggtgacaaaggaaatccaggctggccaggagcacccggtgtccca
gggcccaagggagaccctggattccagggcatgcctggcattggtggctctccaggaatc
acaggctctaagggtgatatggggcctccaggagttccaggatttcaaggtccaaaaggt
gttcctggcctccagggaattaaaggtgatcaaggcgatcaaggtgtcccgggagctaaa
ggtctcccgggtcctcctggccccccaggtccttacgacatcatcaaaggggagcccggg
ctccctggtcctgagggccccgcagggctgaaagggcttcagggacctccaggccccaaa
ggccagcaaggtgttacaggattggtgggtatacctggacctccaggtattcctgggttc
gacggtgcccctggccagaaaggagagatgggacctaccgggcctactggtccaagagga
tttccaggtccaccaggccccgatgggttgccagggtccatggggcccccaggcacccca
tctgttgatcacggcttccttgtgaccaggcacagtcaaacaatagatgacccacagtgt
ccttctgggaccaaaattctttaccatgggtactctttgctctacgtgcaaggcaacgaa
cgggcccatggccaggacttgggcacggccggcagctgcctgcgcaagttcagcacaatg
cccttcctgttctgcaacattaacaacgtgtgcaactttgcatcacgaaatgactactca
tactggctgtccacccccgagcccatgcccatgtcaatggcacccatcacgggggacaac
ataagaccctttattagtaggtgtgctgtgtgtgaggcgcccgccatggtgatggccgtg
cacagtcagaccattcagatcccaccgtgccccagcgggtggtcctcgctgtggatcggc
tactcttttgtgatgcacaccagcgctggtgcagaaggctctggccaagccctggcgtcc
cctggctcctgtctggaggagtttagaagtgcgccattcatcgagtgtcacggccgtggg
acctgtaattactacgcaaatgcttacagcttttggctcgccaccatagagaggagcgag
atgttcaagaaacccacgccgtccaccttgaaggcaggggagctgcgcacgcacgtcagc
cgctgccaagtctgtatgagaagaacataa

DBGET integrated database retrieval system