Capra hircus (goat): 102171079
Help
Entry
102171079 CDS
T02910
Symbol
COL4A1
Name
(RefSeq) LOW QUALITY PROTEIN: collagen alpha-1(IV) chain
KO
K06237
collagen type IV alpha
Organism
chx
Capra hircus (goat)
Pathway
chx04151
PI3K-Akt signaling pathway
chx04382
Cornified envelope formation
chx04510
Focal adhesion
chx04512
ECM-receptor interaction
chx04820
Cytoskeleton in muscle cells
chx04926
Relaxin signaling pathway
chx04933
AGE-RAGE signaling pathway in diabetic complications
chx04974
Protein digestion and absorption
chx05146
Amoebiasis
chx05165
Human papillomavirus infection
chx05200
Pathways in cancer
chx05222
Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:
chx00001
]
09130 Environmental Information Processing
09132 Signal transduction
04151 PI3K-Akt signaling pathway
102171079 (COL4A1)
09133 Signaling molecules and interaction
04512 ECM-receptor interaction
102171079 (COL4A1)
09140 Cellular Processes
09144 Cellular community - eukaryotes
04510 Focal adhesion
102171079 (COL4A1)
09142 Cell motility
04820 Cytoskeleton in muscle cells
102171079 (COL4A1)
09150 Organismal Systems
09152 Endocrine system
04926 Relaxin signaling pathway
102171079 (COL4A1)
09154 Digestive system
04974 Protein digestion and absorption
102171079 (COL4A1)
09158 Development and regeneration
04382 Cornified envelope formation
102171079 (COL4A1)
09160 Human Diseases
09161 Cancer: overview
05200 Pathways in cancer
102171079 (COL4A1)
09162 Cancer: specific types
05222 Small cell lung cancer
102171079 (COL4A1)
09172 Infectious disease: viral
05165 Human papillomavirus infection
102171079 (COL4A1)
09174 Infectious disease: parasitic
05146 Amoebiasis
102171079 (COL4A1)
09167 Endocrine and metabolic disease
04933 AGE-RAGE signaling pathway in diabetic complications
102171079 (COL4A1)
09180 Brite Hierarchies
09183 Protein families: signaling and cellular processes
04147 Exosome [BR:
chx04147
]
102171079 (COL4A1)
00536 Glycosaminoglycan binding proteins [BR:
chx00536
]
102171079 (COL4A1)
Exosome [BR:
chx04147
]
Exosomal proteins
Exosomal proteins of other cancer cells
102171079 (COL4A1)
Glycosaminoglycan binding proteins [BR:
chx00536
]
Heparan sulfate / Heparin
Extracellular matrix molecules
102171079 (COL4A1)
BRITE hierarchy
SSDB
Ortholog
Paralog
Gene cluster
GFIT
Motif
Pfam:
Collagen
C4
Motif
Other DBs
NCBI-GeneID:
102171079
NCBI-ProteinID:
XP_017911853
LinkDB
All DBs
Position
12:1302093..1432028
Genome browser
AA seq
1669 aa
AA seq
DB search
MGPRLGAWLLLGLAALLLHEESSRAAAKGGCAGSGCGKCDCHGVKGQKGERGLPGLQGVI
GFPGMQGPEGPQGPPGQKGDTGEPGLPGTKGTRGPSGVPGYPGNPGLPGIPGQDGPPGPP
GIPGCNGTKGERGPAGPPGLPGFAGNPGPPGLPGMKGDPGEILGHIPGTLLKGERGYPGQ
PGAPGSPGLPGLQGPVGPPGFTGPPGPPGPPGPPGEKGQMGLSFQGPKGEKGDQGVSGPP
GLPGQAQVITKGDTAMRGEKGQKGEPGFPGLQGFGEKGEPGKPGPRGKPGKDGEKGEKGS
QGFPGDSGYPGQPGREGLKGEKGEAGPPGLPGTVIGTGPLGEKGEPGYPGGPGAKGETGP
KGYPGIPGQPGPPGFPTPGLIGAPGFPGDRGEKGEPGLPGVSLPGPSGRDGLPGPPGPPG
PPGQPGHTNGLVECQPGPPGDQGPPGIPGQPGLTGEVGEKGQKGDSCLVCDTTELRGPPG
PQGPPGEIGFPGQPGAKGDRGLPGRAGVEGTPGPQGVPGLMGQPGAKGEPGEIYFDIRLK
GDKGDPGLPGQPGMPGRAGSPGRDGQPGLPGPRGSPGSVGLKGERGPPGGVGFPGSRGDX
GPPGPPGFGPIGPIGDKGEMGFPGNPGAPGQPGLKGETGKVVPLPGPPGAEGLPGSPGFQ
GPQGDRGFPGSPGRPGLPGEKGAIGQPGIGFPGPPGPKGVDGIPGDAGPPGNPGRQGFNG
LPGNPGPPGQKGEPGVGLPGLKGLPGIPGIPGTPGEKGNVGGPGVPGEHGAIGPPGLQGL
RGDPGPPGLQGPRGAPGVPGIGPPGAMGPPGGQGPPGSSGPPGVKGEKGFPGFPGLDMPG
PKGDKGSQGLPGLTGQSGLPGLPGQQGSPGQPGIPGPKGEMGVMGTPGQPGSPGPAGVPG
LPGAKGEHGFPGSSGPRGDPGFKGDKGDVGLPGKPGSMDKVDMGSMKGEKGDQGEKGQTG
PTGDKGSRGDPGTPGVPGKDGQAGHPGQPGPKGDPGVSGIPGAPGLPGPKGSTGGMGLPG
MPGPKGVAGIPGPQGIPGLPGDKGAKGEKGQAGLPGIGIPGRPGEKGDQGLAGFPGSPGE
KGEKGSTGIPGMPGAPGPKGSPGSVGYPGSPGLPGEKGDKGLPGLDGTPGIKGEAGLPGK
PGPTGPAGQKGEPGSDGIPGSVGEKGEAGLPGRGFPGFPGSKGEKGSKGDVGFPGLAGSP
GIPGSKGEQGFMGPPGPQGQPGLPGTPGHAVEGPKGDRGPQGQPGLPGHPGPMGPPGLPG
LDGLKGDKGNPGWPGTPGAPGPKGDPGFQGMPGIGGSPGITGAKGDMGPPGVPGFHGQKG
APGLQGVKGDQGDQGFPGTKGLPGPPGPPGPFNIIKGEPGLPGPEGPAGLKGLQGPPGPK
GQQGVTGSAGLPGPPGEPGFDGAPGQKGETGPFGPPGPRGFPGPPGPDGLPGSMGPPGTP
SVDHGFLVTRHSQTTDDPQCPPGTKILYHGYSLLYVQGNERAHGQDLGTAGSCLRKFSTM
PFLFCNINNVCNFASRNDYSYWLSTPEPMPMSMAPITGENIRPFISRCAVCEAPAMVMAV
HSQTIQIPQCPAGWSSLWIGYSFVMHTSAGAEGSGQALASPGSCLEEFRSAPFIECHGRG
TCNYYANAYSFWLATIERSEMFKKPTPSTLKAGELRTHVSRCQVCLRRT
NT seq
5010 nt
NT seq
+upstream
nt +downstream
nt
atggggccccggctcggcgcgtggctgctgctggggctcgccgcgctcctgctccacgag
gagagcagccgggccgccgcgaagggtgggtgtgctggctctggctgcgggaagtgtgac
tgccatggcgtgaaaggacagaagggagaaagaggcctcccggggttacaaggggtcatc
gggtttcccggaatgcaaggacccgaggggccgcagggaccaccaggacagaagggtgac
accggcgagcccggactgccaggcactaaagggacgagaggaccctcaggagtgcctggt
taccctggaaacccaggacttcctggtattcctggccaggacggtcctccgggtccccca
ggtattccaggatgtaacgggacaaagggtgagagagggcccgcgggacctcccggtttg
cctggattcgccggaaatcccggaccaccagggttaccgggaatgaagggagatccaggt
gagattctgggccatataccagggaccctgctgaaaggcgaaagaggatatcctggacag
ccgggagcgcctggttcaccaggcctgccaggactgcaaggccccgtcgggcccccagga
ttcaccgggccaccaggccctccaggccctcctggccctccaggcgaaaaggggcaaatg
ggcttgagctttcaagggccgaaaggtgaaaagggtgatcaaggggtcagcgggcccccg
ggattaccaggacaggctcaagtcatcacgaaaggggacacagccatgcgtggcgagaag
ggtcaaaaaggtgaacccggatttccggggctgcaagggtttggagagaaaggagaacct
ggaaaaccagggccccgtggaaaaccaggaaaagatggtgaaaaaggagaaaaagggagt
caagggtttccgggcgattcagggtacccaggacagccaggccgagaaggtttaaaggga
gagaaaggtgaagcaggtcctcccgggctgcctggaactgttattggcacaggacccttg
ggagagaaaggagagcccgggtacccagggggcccaggggcgaaaggggagacaggtccc
aaaggttacccaggaataccaggccagccaggccctccaggcttcccgactccggggctg
attggtgcccccggcttccccggcgacagaggagagaagggtgaaccgggcttgccgggt
gtgtcgctgccaggacccagcggaagggacgggcttcccggcccccccgggccccccggg
ccccctgggcagccgggccacacaaatggactcgtggaatgccagcctgggccaccaggg
gaccagggtcctcccggaattccagggcagccggggttgacgggcgaagttggagaaaaa
ggtcaaaaaggagacagctgcctcgtctgtgacacaacagagcttcgtgggcccccaggg
ccacagggaccccccggagaaataggtttcccaggacaaccaggggccaagggagacaga
ggcttacccggcagggctggtgtggaaggaacgcctggtcctcaaggtgtgccagggctc
atgggccagccgggagccaagggcgagcccggcgagatctacttcgacatacggctcaag
ggcgacaaaggagaccccggcttaccaggccagcctggcatgccaggcagagcgggctcc
cctggaagagacggccaaccgggccttcccggccccagaggctccccgggttcagtagga
ttgaaaggggagcgtggccccccgggaggcgtcggattccccgggagccgtggcgacnat
ggccctccggggcctccaggcttcggcccgattggccccattggtgacaaaggagaaatg
ggcttcccaggcaaccctggggccccaggccagccaggtctcaagggagagacgggaaaa
gtcgtgcccttgcccggcccccctggagcagaaggacttcccgggtcccccggcttccag
gggccacaaggtgaccgaggttttcctggaagccccggaaggccgggcctccctggagag
aagggtgccatcggccagcctgggattggatttcctgggcctcctggccccaaaggcgtt
gatggtatacctggagacgctggacctcctgggaatccgggtcgtcaaggcttcaacggc
ttacctggcaaccccggtccacctggccagaagggcgagcctggagtcggtctgccggga
ctcaaaggcctgcctgggatacctggcatccctggcacccccggggagaagggaaacgtc
ggaggaccgggcgttcctggagagcacggcgccatcggccccccaggcctccaggggctc
agaggtgacccgggacctcctggattgcaaggccccagaggagctccgggagtccccgga
atcggccctcctggagcaatgggcccccccggaggacagggacccccagggtcatcaggc
ccccccggagtgaaaggagagaaaggcttccccggcttcccaggtctggacatgccaggt
cccaaaggagacaaagggtcccaggggctccccggcctgacggggcagtcggggctgcct
ggccttcctggacagcagggctcccccggccagcctggcattccaggtcccaagggagag
atgggagtcatggggactccggggcagcccggctcgccaggaccagcgggcgtgccagga
ttgccgggtgccaaaggggaacacggcttccccggctcctcaggacccaggggagaccct
ggcttcaagggtgacaaaggcgacgtggggctccccggcaagccaggctccatggataag
gtggacatgggcagcatgaagggcgagaagggggaccaaggcgagaaaggacagactggt
ccgactggcgataaaggatcccgcggagacccgggaaccccaggcgtgccgggaaaggac
ggtcaggcaggacaccccgggcagccaggacctaaaggtgatccaggtgtgagcgggatc
cctggtgctccgggacttcctggtcccaaaggatccactggtggaatgggcctcccagga
atgccgggaccaaaaggtgtggctggcatccccggcccgcagggcattcctggcttacct
ggagacaagggggcaaaaggagagaaagggcaggcgggtctgcctggcattgggattcca
ggacggcctggggagaagggagaccagggccttgcaggatttcccggaagccccggcgag
aagggagagaaaggaagcacggggatcccagggatgcccggggctccgggccccaaaggc
tccccgggcagtgttggctatccgggaagccctgggttgcctggggagaaaggtgacaag
ggcctcccaggactggatggcactcctggcatcaaaggagaagcaggtcttcctgggaag
cctggccccacgggcccagccggccagaaaggggagcccggcagcgatggaatcccaggg
tcggtgggagagaagggcgaggcaggtctacctggaagaggattcccagggtttccaggg
agcaaaggagagaaaggttcaaagggcgatgtgggcttcccaggattagctgggagccca
ggaattcctggatccaaaggagaacaaggattcatgggtcccccgggaccacagggacag
ccgggattgccagggaccccaggccacgcggtagaggggcccaaaggagaccgcggcccg
caaggacaacccggcctgccagggcatccgggacccatggggcctccaggcctccccggg
ctcgatgggctgaaaggtgacaaggggaacccaggctggccgggcactccgggagctcca
gggcccaagggagacccaggattccagggcatgccgggcattggcggctctccaggaatc
acaggagctaagggagatatgggacccccaggagttccagggtttcatggtcaaaaaggc
gcccccggcctgcagggagtcaaaggcgaccaaggagaccaaggcttcccgggaacgaaa
ggtcttcctggccccccgggccccccaggtccgttcaacatcatcaagggggaaccaggg
ctccctggtcccgagggccctgcgggtctgaaagggcttcagggacctccaggcccgaaa
ggacagcaaggtgtgacgggatccgcgggcttgcctgggcccccaggtgagcccggcttt
gacggcgcccccggccagaaaggagagacggggcccttcggccctccaggtccacgaggc
ttcccaggtccgcccggccccgacgggctgccggggtccatgggtcccccgggcaccccg
tcagtcgatcacggcttccttgtgacccggcacagtcagacgacagacgacccccagtgc
cctcctgggaccaaaatcctctaccacggctactctttgctctacgtgcaaggcaacgag
cgggcgcatggccaggacttgggcacggcgggcagctgcctgcggaagttcagcaccatg
cccttcctcttctgcaacatcaacaacgtctgcaacttcgcctcccgcaatgactactcg
tactggctgtccacgccggagcccatgcccatgtccatggcgcccatcaccggggagaac
atccggcccttcatcagcaggtgtgctgtgtgtgaggccccagcgatggtgatggccgtg
cacagccagaccatccagattccgcagtgccctgccggctggtcctcgctctggatcggc
tactcctttgtgatgcacaccagcgccggggctgaaggctctggccaagccctcgcctcc
cccggctcatgtctggaggagttcaggagcgcccccttcatcgagtgccacggccgtgga
acttgcaattactacgcaaacgcttacagcttttggcttgccacgatagagcggagcgag
atgttcaagaagcccacgccgtccacgctgaaggccggggagctgcgcacgcacgtcagc
cggtgccaggtgtgcctgcggaggacatga
DBGET
integrated database retrieval system