Nomascus leucogenys (northern white-cheeked gibbon): 100603791
Help
Entry
100603791 CDS
T03265
Symbol
COL4A1
Name
(RefSeq) collagen alpha-1(IV) chain isoform X1
KO
K06237
collagen type IV alpha
Organism
nle
Nomascus leucogenys (northern white-cheeked gibbon)
Pathway
nle04151
PI3K-Akt signaling pathway
nle04382
Cornified envelope formation
nle04510
Focal adhesion
nle04512
ECM-receptor interaction
nle04518
Integrin signaling
nle04820
Cytoskeleton in muscle cells
nle04926
Relaxin signaling pathway
nle04933
AGE-RAGE signaling pathway in diabetic complications
nle04974
Protein digestion and absorption
nle05146
Amoebiasis
nle05165
Human papillomavirus infection
nle05200
Pathways in cancer
nle05222
Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:
nle00001
]
09130 Environmental Information Processing
09132 Signal transduction
04151 PI3K-Akt signaling pathway
100603791 (COL4A1)
09133 Signaling molecules and interaction
04512 ECM-receptor interaction
100603791 (COL4A1)
04518 Integrin signaling
100603791 (COL4A1)
09140 Cellular Processes
09144 Cellular community - eukaryotes
04510 Focal adhesion
100603791 (COL4A1)
09142 Cell motility
04820 Cytoskeleton in muscle cells
100603791 (COL4A1)
09150 Organismal Systems
09152 Endocrine system
04926 Relaxin signaling pathway
100603791 (COL4A1)
09154 Digestive system
04974 Protein digestion and absorption
100603791 (COL4A1)
09158 Development and regeneration
04382 Cornified envelope formation
100603791 (COL4A1)
09160 Human Diseases
09161 Cancer: overview
05200 Pathways in cancer
100603791 (COL4A1)
09162 Cancer: specific types
05222 Small cell lung cancer
100603791 (COL4A1)
09172 Infectious disease: viral
05165 Human papillomavirus infection
100603791 (COL4A1)
09174 Infectious disease: parasitic
05146 Amoebiasis
100603791 (COL4A1)
09167 Endocrine and metabolic disease
04933 AGE-RAGE signaling pathway in diabetic complications
100603791 (COL4A1)
09180 Brite Hierarchies
09183 Protein families: signaling and cellular processes
04147 Exosome [BR:
nle04147
]
100603791 (COL4A1)
00536 Glycosaminoglycan binding proteins [BR:
nle00536
]
100603791 (COL4A1)
Exosome [BR:
nle04147
]
Exosomal proteins
Exosomal proteins of other cancer cells
100603791 (COL4A1)
Glycosaminoglycan binding proteins [BR:
nle00536
]
Heparan sulfate / Heparin
Extracellular matrix molecules
100603791 (COL4A1)
BRITE hierarchy
SSDB
Ortholog
Paralog
GFIT
Motif
Pfam:
Collagen
C4
Motif
Other DBs
NCBI-GeneID:
100603791
NCBI-ProteinID:
XP_030669544
Ensembl:
ENSNLEG00000007914
LinkDB
All DBs
Position
5:complement(136428524..136586812)
Genome browser
AA seq
1669 aa
AA seq
DB search
MGPRLSVWLLLLPAALLLHEEHSRAAAKGGCAGSGCGKCDCHGVKGQKGERGLPGLQGVI
GFPGMQGPEGPQGPPGQKGDTGEPGLPGTKGTRGPPGASGYPGNPGLPGIPGQDGPPGPP
GIPGCNGTKGERGPLGPPGLPGFAGNPGPPGLPGMKGDPGEILGHVPGMLLKGERGFPGI
PGNPGPPGLPGLQGPIGPPGFTGPPGPPGPPGPPGEKGQMGLSFQGPKGDKGDQGVSGPP
GVPGQAQVQEKGDFATKGEKGQKGEPGFQGMPGVGEKGEPGKLGPRGKPGKDGDKGEKGS
PGFPGEPGYPGLVGRQGPQGEKGEAGPPGPPGIVIGTGPLGEKGERGYPGTPGPRGEPGP
KGFPGLPGQPGAPGLPVPGLAGAPGFPGERGEKGDRGFPGVPLPGPSGRDGLPGPPGSPG
PPGQPGYTNGIVECQPGPPGDQGPPGIPGQPGFIGEIGEKGQKGESCLLCDIDGYRGPPG
PQGPPGEIGFPGQPGAKGDRGLPGRDGVAGVPGPQGTPGLIGQPGAKGEPGEIYFDLRLK
GDKGDPGFPGQPGMPGRAGSPGRDGHPGLPGPKGSPGSVGLKGERGPPGGVGFPGSRGDT
GPPGPPGYGPTGPIGDKGQAGFPGGPGSPGLPGPKGEPGKVVPLPGPPGAEGLPGSPGFP
GPQGDRGFPGTPGRPGLPGEKGAVGQPGIGFPGPPGPKGVDGLPGDMGPPGTPGRPGFNG
LPGNPGVQGQKGEPGVGLPGLKGLPGLPGIPGTPGEKGSIGVPGVPGEHGVIGPPGLQGI
RGEPGPPGLPGSVGSPGVPGIGPPGARGPPGGQGPPGLSGPPGIKGEKGFPGFPGLDMPG
PKGDKGAQGLPGITGQSGLPGLPGQQGAPGIPGFPGSKGEMGVMGTPGQPGSPGPVGAPG
LPGEKGDHGFPGSSGPRGDPGLKGDKGDVGLPGKPGSMDKVDMGSMKGQKGDQGEKGQIG
PIGEKGSRGDPGTPGVPGKDGQAGQPGQPGPKGDPGISGTPGAPGLPGPKGSVGGMGLPG
TPGEKGVPGIPGPQGSPGLPGDKGAKGEKGQAGPPGIGIPGLPGEKGDQGIVGFPGSPGE
KGEKGSIGIPGMPGSPGLKGSPGSVGYPGSPGLPGEKGDKGLPGLDGIPGVKGEAGLPGT
PGPTGPAGQKGEPGSDGIPGSAGEKGEPGLPGRGFPGFPGAKGDKGSKGEVGFPGLAGSP
GIPGSKGEQGFMGPPGPQGQPGLPGSPGHATEGPKGDRGPQGQPGLPGLPGPMGPPGLPG
IDGVKGDKGNPGWPGAPGVPGPKGDPGFQGMPGIGGSPGITGSKGDMGPPGVPGFQGPKG
LPGLQGIKGDQGDQGVPGAKGLPGPPGPPGPYDIIKGEPGLPGPEGPPGLKGLQGLPGPK
GQQGVTGLVGIPGPPGIPGFDGAPGQKGEMGPAGPTGPRGFPGPPGPDGLPGSMGPPGTP
SVDHGFLVTRHSQTIDDPQCPSGTKILYHGYSLLYVQGNERAHGQDLGTAGSCLRKFSTM
PFLFCNINNVCNFASRNDYSYWLSTPEPMPMSMAPITGDNIRPFISRCAVCEAPAMVMAV
HSQTIQIPPCPSGWSSLWIGYSFVMHTSAGAEGSGQALASPGSCLEEFRSAPFIECHGRG
TCNYYANAYSFWLATVERSEMFKKPTPSTLKAGELRTHVSRCQVCMRRT
NT seq
5010 nt
NT seq
+upstream
nt +downstream
nt
atggggccccggctcagcgtctggctgctgctgctgcccgccgcccttctgctccacgag
gagcacagccgggccgctgcgaagggtggctgtgctggctctggctgtggcaaatgtgac
tgccatggagtgaagggacaaaagggtgaaaggggcctcccggggttacaaggtgtcatt
gggtttcctggaatgcaaggacctgaggggccgcagggaccaccaggacaaaagggtgat
actggagaaccaggactacctggaacaaaagggacaagaggacctccgggagcatctggc
taccctggaaacccaggacttcccggcattcctggccaagacggcccaccaggcccccca
ggtattccaggatgcaatggcacaaagggagagagagggccgctcgggcctcctggcttg
cctggtttcgctggaaatcccgggccaccagggttaccaggaatgaagggtgatccaggt
gagatacttggccatgtgcccgggatgctgttgaaaggcgaaagaggatttcccggaatc
ccagggaatccaggcccaccaggactgccagggcttcaaggtcctattgggcctccagga
tttaccggaccaccaggtcccccaggccctcccggccctccaggtgaaaagggacaaatg
ggcttaagttttcaaggaccgaaaggtgacaagggtgaccaaggggtcagtgggcctccg
ggagtaccaggacaagctcaagttcaagaaaaaggagactttgccaccaagggagaaaag
ggccaaaaaggtgaacctggatttcaggggatgccaggggtcggagagaaaggtgaacct
ggaaaactaggacccagaggaaaacccggaaaagatggtgacaaaggggaaaaagggagt
cccggttttcctggtgaacccgggtacccaggactcgtaggccgccagggcccgcaggga
gaaaagggtgaagcaggtcctcccggcccacctggaattgttataggcacaggacctttg
ggagaaaaaggagagaggggctaccctggaactccggggccaagaggagagccaggccca
aaaggtttcccaggactaccaggccaacccggagctccaggcctccctgtacctgggctg
gctggtgcccctggcttccctggtgaaagaggagaaaaaggtgaccgaggatttcctggt
gtacctctgccaggaccaagtggaagagacgggctcccgggtcctcctggttcccccggg
ccccctgggcagcctggctacacaaatggaattgtggaatgtcagcccggacctccaggt
gaccagggtcctcctggaattccagggcagccaggatttataggcgaaattggagagaaa
ggtcaaaaaggagagagttgccttctctgtgatatagatggatatcgggggcctcccggg
ccacaaggacccccaggagaaataggtttcccaggacagccgggggccaagggcgacaga
ggtttgcctggcagagatggtgttgcgggagtgccagggcctcaaggtacaccagggctg
ataggccagccgggagccaagggggagcctggtgagatttatttcgacttgcggctcaaa
ggtgacaaaggagacccaggctttccaggacagcccggcatgccagggagagcgggttct
cctggaagagatggccatccgggtcttcctggccccaagggctcaccgggttctgtagga
ttgaaaggagagcgtggcccccctggaggagttggatttccaggcagtcgtggtgacacc
ggcccccctgggcctccaggatatggtcctactggtcccattggtgacaaaggacaagca
ggctttcctggaggccctggatccccaggcctgccaggtccaaagggtgaaccaggaaaa
gttgttcctttaccaggcccccctggagcagaaggactgccggggtccccaggcttccca
ggtccccaaggagaccgaggctttcccggaaccccaggaaggccaggcctgccaggagag
aagggtgctgtgggccagccgggaattggatttccagggccccccggccccaaaggtgtt
gacggcttacctggagacatggggcctccggggactccaggtcgcccgggatttaatggc
ttacctggaaacccaggtgtgcagggccagaagggagagcctggagttggtctgccggga
ctcaaaggtttgccaggtcttccaggcattcctggcacacctggggagaaggggagcatt
ggggtaccaggcgttcctggagaacacggagtgatcggaccccctgggcttcaggggatc
agaggtgaaccgggacctcctggattgccaggctccgtggggtctccaggagttccagga
ataggcccccctggagctaggggtccccccggaggacaaggaccaccggggttgtcaggc
cctcctggaataaaaggagagaagggtttccccggattccctggactggacatgccgggc
cctaaaggagataaaggggctcaaggacttcctggcataacgggacagtcagggctccct
ggccttcctggacagcagggggctcctgggattcctgggtttccaggttccaagggagaa
atgggcgtcatggggacccccgggcagccgggctcaccaggaccagtgggtgctccggga
ttaccgggtgaaaaaggggaccatggctttccgggctcctcaggacccaggggagaccct
ggcttgaaaggtgataagggggatgtcggtcttcctggcaagcctggctccatggataag
gtggacatgggcagcatgaagggccagaagggagaccaaggagagaaaggacaaattgga
ccaattggtgagaagggatcccgaggagaccctgggaccccaggagtgcctggaaaggac
gggcaggcaggacagcctgggcagccaggacctaaaggtgatccaggtataagtggaacc
ccaggtgctccaggacttccgggaccaaaagggtctgttggtggaatgggcttgccagga
acacctggagagaaaggtgtgcctggcatccctggcccacaaggttcacctggcttacct
ggagacaaaggtgcaaaaggagagaaagggcaggcaggcccacctggcataggcatccca
gggctgcctggtgaaaagggagatcaagggatagtgggtttcccaggaagccctggagag
aagggagaaaaaggaagcattgggatcccaggaatgccagggtctccaggccttaaaggg
tctcccgggagtgttggttatccaggaagccctgggctgcctggagaaaaaggtgacaaa
ggcctcccaggattggatggcatccctggcgtcaaaggagaagcaggtcttcctgggacg
cctggccccacaggcccagctggccagaaaggggagccaggcagtgacggaatcccaggg
tcagcaggagagaagggtgaaccaggtctaccaggaagaggattcccagggtttccaggg
gccaaaggagacaaaggttcaaagggtgaggtgggtttcccaggattagccgggagccca
ggaattcctggatccaaaggagagcaaggattcatgggtcccccggggccccaaggtcag
ccagggttaccgggatccccaggccatgccacggaggggcccaaaggagaccgcggacct
cagggccagcctggcctgccaggacttccgggacccatggggcctccagggcttcctggg
attgatggagttaaaggtgacaaaggaaatccaggctggccaggagcacccggtgtccca
gggcccaagggagaccctggattccagggcatgcctggtatcggtggctctccaggaatc
acaggctctaagggtgatatggggcctccaggagttccaggatttcaaggtccaaaaggt
cttcctggcctccagggaattaaaggtgatcaaggcgatcaaggtgtcccgggagctaaa
ggtctcccgggtcctcctggccccccaggtccttacgacatcatcaaaggggagcccggg
ctccctggtcctgagggccccccagggctgaaagggcttcagggactgccaggcccgaaa
ggccagcaaggtgttacaggattggtgggcatacctggacctccaggtattcctgggttt
gacggtgcccctggccagaaaggagagatgggacctgccgggcctactggtccaagagga
tttccaggtccaccaggccccgatgggttgccaggatccatggggcccccaggcacccca
tctgttgatcacggcttccttgtgaccaggcacagtcaaacaatagatgacccacagtgt
ccttctgggaccaaaattctttaccatgggtactctttgctctacgtgcaaggcaatgag
cgggcccatggccaggacttgggcacggccggcagctgcctgcgcaagttcagcacgatg
cccttcctgttctgcaatattaacaacgtgtgcaactttgcatcacgcaatgactactcg
tactggctgtccacccccgagcccatgcccatgtcaatggcacccatcacgggggacaat
ataagaccatttattagtaggtgtgctgtgtgtgaggcgcctgccatggtaatggccgtg
cacagtcagaccattcagatcccaccgtgccccagtgggtggtcctcgctgtggattggc
tactcttttgtgatgcacaccagcgctggtgcagaaggctctggccaagccctggcgtcc
cccggctcctgtctggaggagtttagaagtgcgccattcatcgagtgtcatggccgtggg
acctgtaattactacgcaaacgcttacagcttttggctcgccaccgtagagaggagcgag
atgttcaagaagcccacgccgtccaccttgaaggcaggggagctacgcacgcacgtcagc
cgctgccaagtctgtatgagaagaacataa
DBGET
integrated database retrieval system