KEGG   Pongo abelii (Sumatran orangutan): 100443710
Entry
100443710         CDS       T01416                                 
Symbol
COL4A4
Name
(RefSeq) collagen alpha-4(IV) chain
  KO
K06237  collagen type IV alpha
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon04820  Cytoskeleton in muscle cells
pon04926  Relaxin signaling pathway
pon04933  AGE-RAGE signaling pathway in diabetic complications
pon04974  Protein digestion and absorption
pon05146  Amoebiasis
pon05165  Human papillomavirus infection
pon05200  Pathways in cancer
pon05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100443710 (COL4A4)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100443710 (COL4A4)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100443710 (COL4A4)
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    100443710 (COL4A4)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    100443710 (COL4A4)
  09154 Digestive system
   04974 Protein digestion and absorption
    100443710 (COL4A4)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    100443710 (COL4A4)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    100443710 (COL4A4)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100443710 (COL4A4)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    100443710 (COL4A4)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    100443710 (COL4A4)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:pon04147]
    100443710 (COL4A4)
   00536 Glycosaminoglycan binding proteins [BR:pon00536]
    100443710 (COL4A4)
Exosome [BR:pon04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   100443710 (COL4A4)
Glycosaminoglycan binding proteins [BR:pon00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   100443710 (COL4A4)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 100443710
NCBI-ProteinID: XP_024098889
Ensembl: ENSPPYG00000013236
LinkDB
Position
2B:complement(114284855..114436281)
AA seq 1684 aa
MWSLHIVLMRYSFGLTKSLATGPWSLILILFSVQYVYGSGKKYVGPCGGRDCSVCHCVPE
KGSRGPPGPPGPQGPIGPLGAPGPIGLSGEKGMRGDRGPPGAAGDKGDKGPTGVPGFPGL
DGIPGHPGPPGPRGKPGMSGHNGSRGDPGFLGGRGALGPGGPPGHPGEKGEKGNSVFILG
AIKGIQGDRGDPGLPGLPGFWGAGGPAGPTGYPGEPGLVGPPGQPGRPGLKGNPGVGIKG
QMGDSGEVGQQGSPGPTLLVEPPDFCLYKGEKGIKGIPGMIGLPGPPGRKGESGIGAKGE
KGIPGFPGPRGDPGSYGSPGFPGLKGELGLVGDPGLFGLIGPKGDPGNRGHPGPPGVLVT
PPLPLKGPPGDPGFPGRYGETGDVGPPGPPGLLGRPGEACAGMIGPPGPQGFPGLPGLPG
EAGIPGRPDSAPGKPGKPGSPGLPGEPGLQGLPGSSATYCSVGNPGPQGIKGKVGPPGGR
GSKGEKGNEGLCACEPGPMGPPGPPGLPGRQGSKGDLGIPGWLGTEGDPGSPGAEGPPGL
PGKHGASGPPGNKGAKGDMVVSRVKGHKGERGPDGPPGFPGQPGSHGRDGHAGEKGDPGP
PGDHEDATPGGKGFPGPLGPPGKAGPVGPPGLGFPGLPGERGHPGVPGRPGVRGPDGLKG
QKGDTISCNVTYPGRQGPPGFDGPPGPKGFPGPQGAPGLSGSDGHKGRPGTPGTSEIPGP
PGFRGDMGDPGFGGEKGSSPVGPPGPPGSPGVNGQKGIPGDPAFGHLGSPGRRGLSGVPG
IKGPRGDPGCPGAEGPAGIPGFPGLKGPKGREGHAGFPGVPGPPGHSCERGAPGIPGQPG
LPGYPGGKGQPGDVGPPGPAGMKGLPGLPGRPGAHGPPGLPGIPGPFGDDGLPGPPGPKG
PQGLPGFPGFPGERGKPGAEGCPGTKGEPGEKGMSGFPGDRGLRGAKGAIGPPGDEGEMA
IISQKGTPGEPGPPGDDGFPGERGDKGTPGMQGRRGEPGRYGPPGFHRGQPGEKGQPGPP
GPPGPPGSMGLRGFIGFPGLPGDQGEPGSPGPPGFSGIDGARGPKGNKGDPASHFGPPGR
KGEPGSPGCPGHFGASGEQGLPGIQGPRGSPGRPGPPGSSGPPGCPGDQGMPGLRGQPGE
MGDPGPRGLQGDPGIPGPPGIKGPSGSPGLNGLHGLKGQKGTKGASGLHDVGPPGPVGIP
GLKGERGDPGSPGISPPGPYGEKGPPGPPGRSGPPGPAGATGRAPKDIPDPGPPGDQGPP
GPDGPRGAPGPPGLPGSVDLLRGEPGDCGLPGPPGPPGPPGPPGYKGFPGCDGKDGQKGP
MGFPGPQGPHGFPGPPGEKGLPGPPGRKGPTGLPGPRGEPGPPADVDDCPRIPGLPGAPG
MRGPEGAMGLPGMRGPPGPGCKGEPGLDGRRGVDGVPGSPGPPGRKGDTGEDGYPGGPGP
PGPTGDPGPKGFGPGYLGGFLLVLHSQTDQEPTCPLGMPRLWTGYSLLYLEGQEKAHNQD
LGLAGSCLPVFSTLPFAYCNIHQVCHYAQRNDRSYWLASAAPLPMMPLSEEAIRPYVSRC
VVCESPAQAVAVHSQDQSIPPCPQTWRSLWIGYSFLMHTGAGDQGGGQALMSPGSCLEDF
RAAPLLECQGRQGTCHFFANKYSFWLTTVKADLQFSSAPAPDTLKESQAQRQKISRCQVC
VKYS
NT seq 5055 nt   +upstreamnt  +downstreamnt
atgtggtctctgcacatagtactaatgaggtactccttcggattgaccaagtccttggcc
acaggtccctggtcacttatactcattctcttttctgtacaatatgtatatgggagtgga
aagaaatacgttggtccttgcggaggaagagattgctctgtttgccactgtgttcctgaa
aaggggtctcggggtccaccaggaccaccagggccacagggtccaattggacccctggga
gccccaggacccattgggctttcaggagagaaaggaatgagaggggaccgcggccctcct
ggagcagcaggggacaaaggagataagggtccaactggtgttcctggatttccaggttta
gatggcatacctgggcacccagggcctcctggacccagaggcaaacctggcatgagtggc
cacaatggctcaagaggtgacccagggtttctaggaggaagaggagctcttggcccagga
ggccccccaggccatcctggggaaaagggagaaaaaggaaattcagtgttcattttaggt
gccattaaaggtattcagggagacagaggggacccaggactgcctggcttaccaggattt
tggggtgcaggaggaccagcgggccccacaggatatcctggagagccagggttagtggga
cctccgggccaaccagggcgtccaggtttgaagggaaatcccggtgtgggaataaagggg
caaatgggagactcgggtgaggttggtcagcaaggttctcctggacccaccctgttggta
gagccacctgacttttgtctctataaaggagaaaagggtataaaaggaattcccggaatg
attggactgccaggaccaccaggacgcaagggagaatccggtattggggcaaaaggagaa
aaaggtattcccggatttccagggcctcggggggatcctggttcctatggatctccaggt
tttccaggattaaagggagaactaggactggttggagatcctgggctatttggattaatt
ggcccaaagggggatcctggaaatcgagggcacccaggaccaccaggtgttttggtgact
ccacctcttccactcaaaggcccaccaggggacccagggttccctggccgctatggagaa
acaggggatgttggaccacctggtcccccaggtctcttgggcagaccaggggaagcctgt
gcaggcatgataggaccccctgggccacaaggatttcctggtcttcctgggcttccagga
gaagctggtattcctgggagacctgattctgctccaggaaaaccagggaagccaggatca
cctggcttgcctggagaaccaggcctgcagggcctcccaggatcaagtgcgacatactgc
agtgttgggaaccctggaccacaaggaataaaaggcaaagtgggtcccccaggaggaaga
ggctcaaaaggagaaaaaggaaatgaaggactctgtgcctgtgagcctggtcccatgggc
ccccctgggcctccaggacttcctgggaggcaggggagtaagggagacttggggatccct
ggctggcttggaacagaaggtgacccgggatctcctggtgctgaaggacctccagggcta
ccaggaaagcatggtgcctccggaccacctggcaacaaaggggcaaagggtgacatggtt
gtatcaagagttaaagggcacaaaggagaaagaggtcctgatgggcccccaggatttcca
gggcagccaggatcacatggtcgggatggacatgctggagaaaaaggggatccaggaccc
ccaggggatcatgaagatgcgaccccaggtggtaaaggatttcctggacctctgggcccc
ccgggcaaagcaggacctgtggggcccccaggactgggatttcctggtctaccaggagag
cgaggccacccaggagttccaggccgcccaggtgtgaggggccctgatggcttgaagggt
cagaaaggtgacacaatttcttgcaacgtaacctaccctgggaggcaaggccctccaggt
tttgatggacctccaggtccaaagggatttccaggtccccaaggtgcccccgggctgagt
ggttcagatggacataaaggcagacctggcacaccaggaacatcggaaataccaggtcca
cctggttttcgtggtgacatgggagatccgggttttggaggtgaaaaggggtcctcccct
gttgggcccccaggccctcccgggtcaccaggagtgaatggtcagaaaggaatcccggga
gaccctgcatttggtcacctgggatccccaggaaggaggggtctttcaggagtgccaggg
ataaaaggacccagaggtgatccgggatgtccaggggctgaagggccagctggcattcct
ggattcccaggtctcaaaggtcccaaaggcagagagggacatgctgggtttccaggtgtc
ccaggtccgcctggccattcctgtgaaagaggtgctccagggataccagggcaaccggga
ctccctgggtatccaggtgggaaaggacagccgggagatgtggggcctcccgggccagct
ggaatgaaaggtctccccggactcccaggacggcctggggcacatggtcccccaggcctc
ccaggaatcccaggtccttttggggatgatgggctacccggtcctccaggtccaaaggga
ccccaggggctgcctggtttcccaggttttcccggagaaagaggaaagcctggtgcagag
ggatgtcctggcacaaagggagaacctggagagaagggcatgtctggctttcccggagac
cggggactgagaggagccaaaggagccataggacctcccggagatgaaggagaaatggct
atcatttcccaaaagggaacacctggggaacctggacctcctggagatgatggattccca
ggagaaagaggtgataaaggaactcccgggatgcaagggagaagaggagagccgggaaga
tacggaccacctggatttcacagagggcaacctggcgagaaaggtcagccagggcctcct
ggacccccaggccctccaggctcaatgggtctaagagggttcattggttttccaggactt
ccaggtgaccagggtgagccaggttctccaggtccccctggattttcaggaattgatgga
gcaagaggacctaaaggaaacaaaggtgaccctgccagtcactttggtccacctggtcga
aagggtgagccaggtagccctggatgtccagggcattttggagcatccggagagcagggc
ttgcctggcattcaagggcccagaggatcacccggaaggccagggccacctggctcctct
ggaccaccagggtgcccaggtgatcaggggatgcctgggctgaggggacagccaggagaa
atgggagaccctgggccaagaggcctccagggggatccagggataccaggtcctccggga
ataaaaggtccctccggatcacctggtctaaacggcttgcatggattgaagggtcagaaa
ggaaccaaaggtgcttcaggtttgcatgatgtggggccacctggtccagtgggaatacct
gggctaaaaggggagagaggagatcctgggagcccaggaatctctcctccaggtccttat
ggagaaaaaggtcccccaggtcccccagggagatcaggaccacctggtcctgcaggtgcc
acaggaagagctcctaaggacattcctgacccgggtccacctggagatcagggacctcct
ggtcctgatggcccaagaggagcacctgggcctccaggcctccctgggagtgttgacctt
ctgagaggggagccaggtgactgtggtctaccagggccaccaggtccccctggcccacca
ggccctccaggatacaaaggctttccaggatgcgatggaaaagatggccagaaaggacca
atgggattcccggggccgcagggaccacatggatttcctgggccacctggagagaagggt
ttacctggacctccagggagaaaagggcccactggtcttccaggtcccagaggtgaacca
gggccacctgcagatgtggatgactgtccccgaatcccaggccttcctggggcaccaggc
atgagaggaccagaaggagccatggggctccctggaatgagaggccccccaggaccaggg
tgcaaaggagagcctgggctggatggcaggaggggtgtggatggcgtccctgggtctcct
gggcctcctggacgtaaaggtgacacaggagaagacggctaccctggaggaccagggcct
cctggtcccactggggatcctgggcccaaagggtttggccctggatacctcggtggcttc
ctcctggttctccacagtcagacggaccaggagcccacctgccccctgggcatgcccagg
ctctggactgggtatagtctgttatacctggaagggcaagagaaagctcacaatcaagac
cttggtctggcaggttcttgccttcccgtgtttagcacactgccctttgcctactgcaac
atccaccaggtgtgccactatgcccagagaaacgacagatcctactggctggccagtgct
gcgcccctccccatgatgccactctctgaagaggcgatccgcccctatgtcagccgctgt
gtggtatgcgagtccccggcccaggcggtggcggtgcacagccaggaccagtccatcccc
ccatgtccgcagacctggaggagcctctggatcgggtattcattcctgatgcacacagga
gctggggaccaaggaggagggcaggccctcatgtcacctggcagctgcctggaagatttc
agagcagcaccattgcttgaatgccaaggccggcagggaacttgccacttttttgcaaat
aagtatagcttctggctcacaacagtgaaagcagacttgcagttttcctctgctccagca
ccagacaccttaaaagaaagccaggcccaacgccagaaaatcagccggtgccaggtctgt
gtgaagtatagctag

DBGET integrated database retrieval system