Pongo abelii (Sumatran orangutan): 100443710
Help
Entry
100443710 CDS
T01416
Symbol
COL4A4
Name
(RefSeq) collagen alpha-4(IV) chain
KO
K06237
collagen type IV alpha
Organism
pon
Pongo abelii (Sumatran orangutan)
Pathway
pon04151
PI3K-Akt signaling pathway
pon04510
Focal adhesion
pon04512
ECM-receptor interaction
pon04820
Cytoskeleton in muscle cells
pon04926
Relaxin signaling pathway
pon04933
AGE-RAGE signaling pathway in diabetic complications
pon04974
Protein digestion and absorption
pon05146
Amoebiasis
pon05165
Human papillomavirus infection
pon05200
Pathways in cancer
pon05222
Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:
pon00001
]
09130 Environmental Information Processing
09132 Signal transduction
04151 PI3K-Akt signaling pathway
100443710 (COL4A4)
09133 Signaling molecules and interaction
04512 ECM-receptor interaction
100443710 (COL4A4)
09140 Cellular Processes
09144 Cellular community - eukaryotes
04510 Focal adhesion
100443710 (COL4A4)
09142 Cell motility
04820 Cytoskeleton in muscle cells
100443710 (COL4A4)
09150 Organismal Systems
09152 Endocrine system
04926 Relaxin signaling pathway
100443710 (COL4A4)
09154 Digestive system
04974 Protein digestion and absorption
100443710 (COL4A4)
09160 Human Diseases
09161 Cancer: overview
05200 Pathways in cancer
100443710 (COL4A4)
09162 Cancer: specific types
05222 Small cell lung cancer
100443710 (COL4A4)
09172 Infectious disease: viral
05165 Human papillomavirus infection
100443710 (COL4A4)
09174 Infectious disease: parasitic
05146 Amoebiasis
100443710 (COL4A4)
09167 Endocrine and metabolic disease
04933 AGE-RAGE signaling pathway in diabetic complications
100443710 (COL4A4)
09180 Brite Hierarchies
09183 Protein families: signaling and cellular processes
04147 Exosome [BR:
pon04147
]
100443710 (COL4A4)
00536 Glycosaminoglycan binding proteins [BR:
pon00536
]
100443710 (COL4A4)
Exosome [BR:
pon04147
]
Exosomal proteins
Exosomal proteins of other cancer cells
100443710 (COL4A4)
Glycosaminoglycan binding proteins [BR:
pon00536
]
Heparan sulfate / Heparin
Extracellular matrix molecules
100443710 (COL4A4)
BRITE hierarchy
SSDB
Ortholog
Paralog
Gene cluster
GFIT
Motif
Pfam:
Collagen
C4
Motif
Other DBs
NCBI-GeneID:
100443710
NCBI-ProteinID:
XP_024098889
Ensembl:
ENSPPYG00000013236
LinkDB
All DBs
Position
2B:complement(114284855..114436281)
Genome browser
AA seq
1684 aa
AA seq
DB search
MWSLHIVLMRYSFGLTKSLATGPWSLILILFSVQYVYGSGKKYVGPCGGRDCSVCHCVPE
KGSRGPPGPPGPQGPIGPLGAPGPIGLSGEKGMRGDRGPPGAAGDKGDKGPTGVPGFPGL
DGIPGHPGPPGPRGKPGMSGHNGSRGDPGFLGGRGALGPGGPPGHPGEKGEKGNSVFILG
AIKGIQGDRGDPGLPGLPGFWGAGGPAGPTGYPGEPGLVGPPGQPGRPGLKGNPGVGIKG
QMGDSGEVGQQGSPGPTLLVEPPDFCLYKGEKGIKGIPGMIGLPGPPGRKGESGIGAKGE
KGIPGFPGPRGDPGSYGSPGFPGLKGELGLVGDPGLFGLIGPKGDPGNRGHPGPPGVLVT
PPLPLKGPPGDPGFPGRYGETGDVGPPGPPGLLGRPGEACAGMIGPPGPQGFPGLPGLPG
EAGIPGRPDSAPGKPGKPGSPGLPGEPGLQGLPGSSATYCSVGNPGPQGIKGKVGPPGGR
GSKGEKGNEGLCACEPGPMGPPGPPGLPGRQGSKGDLGIPGWLGTEGDPGSPGAEGPPGL
PGKHGASGPPGNKGAKGDMVVSRVKGHKGERGPDGPPGFPGQPGSHGRDGHAGEKGDPGP
PGDHEDATPGGKGFPGPLGPPGKAGPVGPPGLGFPGLPGERGHPGVPGRPGVRGPDGLKG
QKGDTISCNVTYPGRQGPPGFDGPPGPKGFPGPQGAPGLSGSDGHKGRPGTPGTSEIPGP
PGFRGDMGDPGFGGEKGSSPVGPPGPPGSPGVNGQKGIPGDPAFGHLGSPGRRGLSGVPG
IKGPRGDPGCPGAEGPAGIPGFPGLKGPKGREGHAGFPGVPGPPGHSCERGAPGIPGQPG
LPGYPGGKGQPGDVGPPGPAGMKGLPGLPGRPGAHGPPGLPGIPGPFGDDGLPGPPGPKG
PQGLPGFPGFPGERGKPGAEGCPGTKGEPGEKGMSGFPGDRGLRGAKGAIGPPGDEGEMA
IISQKGTPGEPGPPGDDGFPGERGDKGTPGMQGRRGEPGRYGPPGFHRGQPGEKGQPGPP
GPPGPPGSMGLRGFIGFPGLPGDQGEPGSPGPPGFSGIDGARGPKGNKGDPASHFGPPGR
KGEPGSPGCPGHFGASGEQGLPGIQGPRGSPGRPGPPGSSGPPGCPGDQGMPGLRGQPGE
MGDPGPRGLQGDPGIPGPPGIKGPSGSPGLNGLHGLKGQKGTKGASGLHDVGPPGPVGIP
GLKGERGDPGSPGISPPGPYGEKGPPGPPGRSGPPGPAGATGRAPKDIPDPGPPGDQGPP
GPDGPRGAPGPPGLPGSVDLLRGEPGDCGLPGPPGPPGPPGPPGYKGFPGCDGKDGQKGP
MGFPGPQGPHGFPGPPGEKGLPGPPGRKGPTGLPGPRGEPGPPADVDDCPRIPGLPGAPG
MRGPEGAMGLPGMRGPPGPGCKGEPGLDGRRGVDGVPGSPGPPGRKGDTGEDGYPGGPGP
PGPTGDPGPKGFGPGYLGGFLLVLHSQTDQEPTCPLGMPRLWTGYSLLYLEGQEKAHNQD
LGLAGSCLPVFSTLPFAYCNIHQVCHYAQRNDRSYWLASAAPLPMMPLSEEAIRPYVSRC
VVCESPAQAVAVHSQDQSIPPCPQTWRSLWIGYSFLMHTGAGDQGGGQALMSPGSCLEDF
RAAPLLECQGRQGTCHFFANKYSFWLTTVKADLQFSSAPAPDTLKESQAQRQKISRCQVC
VKYS
NT seq
5055 nt
NT seq
+upstream
nt +downstream
nt
atgtggtctctgcacatagtactaatgaggtactccttcggattgaccaagtccttggcc
acaggtccctggtcacttatactcattctcttttctgtacaatatgtatatgggagtgga
aagaaatacgttggtccttgcggaggaagagattgctctgtttgccactgtgttcctgaa
aaggggtctcggggtccaccaggaccaccagggccacagggtccaattggacccctggga
gccccaggacccattgggctttcaggagagaaaggaatgagaggggaccgcggccctcct
ggagcagcaggggacaaaggagataagggtccaactggtgttcctggatttccaggttta
gatggcatacctgggcacccagggcctcctggacccagaggcaaacctggcatgagtggc
cacaatggctcaagaggtgacccagggtttctaggaggaagaggagctcttggcccagga
ggccccccaggccatcctggggaaaagggagaaaaaggaaattcagtgttcattttaggt
gccattaaaggtattcagggagacagaggggacccaggactgcctggcttaccaggattt
tggggtgcaggaggaccagcgggccccacaggatatcctggagagccagggttagtggga
cctccgggccaaccagggcgtccaggtttgaagggaaatcccggtgtgggaataaagggg
caaatgggagactcgggtgaggttggtcagcaaggttctcctggacccaccctgttggta
gagccacctgacttttgtctctataaaggagaaaagggtataaaaggaattcccggaatg
attggactgccaggaccaccaggacgcaagggagaatccggtattggggcaaaaggagaa
aaaggtattcccggatttccagggcctcggggggatcctggttcctatggatctccaggt
tttccaggattaaagggagaactaggactggttggagatcctgggctatttggattaatt
ggcccaaagggggatcctggaaatcgagggcacccaggaccaccaggtgttttggtgact
ccacctcttccactcaaaggcccaccaggggacccagggttccctggccgctatggagaa
acaggggatgttggaccacctggtcccccaggtctcttgggcagaccaggggaagcctgt
gcaggcatgataggaccccctgggccacaaggatttcctggtcttcctgggcttccagga
gaagctggtattcctgggagacctgattctgctccaggaaaaccagggaagccaggatca
cctggcttgcctggagaaccaggcctgcagggcctcccaggatcaagtgcgacatactgc
agtgttgggaaccctggaccacaaggaataaaaggcaaagtgggtcccccaggaggaaga
ggctcaaaaggagaaaaaggaaatgaaggactctgtgcctgtgagcctggtcccatgggc
ccccctgggcctccaggacttcctgggaggcaggggagtaagggagacttggggatccct
ggctggcttggaacagaaggtgacccgggatctcctggtgctgaaggacctccagggcta
ccaggaaagcatggtgcctccggaccacctggcaacaaaggggcaaagggtgacatggtt
gtatcaagagttaaagggcacaaaggagaaagaggtcctgatgggcccccaggatttcca
gggcagccaggatcacatggtcgggatggacatgctggagaaaaaggggatccaggaccc
ccaggggatcatgaagatgcgaccccaggtggtaaaggatttcctggacctctgggcccc
ccgggcaaagcaggacctgtggggcccccaggactgggatttcctggtctaccaggagag
cgaggccacccaggagttccaggccgcccaggtgtgaggggccctgatggcttgaagggt
cagaaaggtgacacaatttcttgcaacgtaacctaccctgggaggcaaggccctccaggt
tttgatggacctccaggtccaaagggatttccaggtccccaaggtgcccccgggctgagt
ggttcagatggacataaaggcagacctggcacaccaggaacatcggaaataccaggtcca
cctggttttcgtggtgacatgggagatccgggttttggaggtgaaaaggggtcctcccct
gttgggcccccaggccctcccgggtcaccaggagtgaatggtcagaaaggaatcccggga
gaccctgcatttggtcacctgggatccccaggaaggaggggtctttcaggagtgccaggg
ataaaaggacccagaggtgatccgggatgtccaggggctgaagggccagctggcattcct
ggattcccaggtctcaaaggtcccaaaggcagagagggacatgctgggtttccaggtgtc
ccaggtccgcctggccattcctgtgaaagaggtgctccagggataccagggcaaccggga
ctccctgggtatccaggtgggaaaggacagccgggagatgtggggcctcccgggccagct
ggaatgaaaggtctccccggactcccaggacggcctggggcacatggtcccccaggcctc
ccaggaatcccaggtccttttggggatgatgggctacccggtcctccaggtccaaaggga
ccccaggggctgcctggtttcccaggttttcccggagaaagaggaaagcctggtgcagag
ggatgtcctggcacaaagggagaacctggagagaagggcatgtctggctttcccggagac
cggggactgagaggagccaaaggagccataggacctcccggagatgaaggagaaatggct
atcatttcccaaaagggaacacctggggaacctggacctcctggagatgatggattccca
ggagaaagaggtgataaaggaactcccgggatgcaagggagaagaggagagccgggaaga
tacggaccacctggatttcacagagggcaacctggcgagaaaggtcagccagggcctcct
ggacccccaggccctccaggctcaatgggtctaagagggttcattggttttccaggactt
ccaggtgaccagggtgagccaggttctccaggtccccctggattttcaggaattgatgga
gcaagaggacctaaaggaaacaaaggtgaccctgccagtcactttggtccacctggtcga
aagggtgagccaggtagccctggatgtccagggcattttggagcatccggagagcagggc
ttgcctggcattcaagggcccagaggatcacccggaaggccagggccacctggctcctct
ggaccaccagggtgcccaggtgatcaggggatgcctgggctgaggggacagccaggagaa
atgggagaccctgggccaagaggcctccagggggatccagggataccaggtcctccggga
ataaaaggtccctccggatcacctggtctaaacggcttgcatggattgaagggtcagaaa
ggaaccaaaggtgcttcaggtttgcatgatgtggggccacctggtccagtgggaatacct
gggctaaaaggggagagaggagatcctgggagcccaggaatctctcctccaggtccttat
ggagaaaaaggtcccccaggtcccccagggagatcaggaccacctggtcctgcaggtgcc
acaggaagagctcctaaggacattcctgacccgggtccacctggagatcagggacctcct
ggtcctgatggcccaagaggagcacctgggcctccaggcctccctgggagtgttgacctt
ctgagaggggagccaggtgactgtggtctaccagggccaccaggtccccctggcccacca
ggccctccaggatacaaaggctttccaggatgcgatggaaaagatggccagaaaggacca
atgggattcccggggccgcagggaccacatggatttcctgggccacctggagagaagggt
ttacctggacctccagggagaaaagggcccactggtcttccaggtcccagaggtgaacca
gggccacctgcagatgtggatgactgtccccgaatcccaggccttcctggggcaccaggc
atgagaggaccagaaggagccatggggctccctggaatgagaggccccccaggaccaggg
tgcaaaggagagcctgggctggatggcaggaggggtgtggatggcgtccctgggtctcct
gggcctcctggacgtaaaggtgacacaggagaagacggctaccctggaggaccagggcct
cctggtcccactggggatcctgggcccaaagggtttggccctggatacctcggtggcttc
ctcctggttctccacagtcagacggaccaggagcccacctgccccctgggcatgcccagg
ctctggactgggtatagtctgttatacctggaagggcaagagaaagctcacaatcaagac
cttggtctggcaggttcttgccttcccgtgtttagcacactgccctttgcctactgcaac
atccaccaggtgtgccactatgcccagagaaacgacagatcctactggctggccagtgct
gcgcccctccccatgatgccactctctgaagaggcgatccgcccctatgtcagccgctgt
gtggtatgcgagtccccggcccaggcggtggcggtgcacagccaggaccagtccatcccc
ccatgtccgcagacctggaggagcctctggatcgggtattcattcctgatgcacacagga
gctggggaccaaggaggagggcaggccctcatgtcacctggcagctgcctggaagatttc
agagcagcaccattgcttgaatgccaaggccggcagggaacttgccacttttttgcaaat
aagtatagcttctggctcacaacagtgaaagcagacttgcagttttcctctgctccagca
ccagacaccttaaaagaaagccaggcccaacgccagaaaatcagccggtgccaggtctgt
gtgaagtatagctag
DBGET
integrated database retrieval system