KEGG   Capra hircus (goat): 102171079
Entry
102171079         CDS       T02910                                 
Symbol
COL4A1
Name
(RefSeq) LOW QUALITY PROTEIN: collagen alpha-1(IV) chain
  KO
K06237  collagen type IV alpha
Organism
chx  Capra hircus (goat)
Pathway
chx04151  PI3K-Akt signaling pathway
chx04382  Cornified envelope formation
chx04510  Focal adhesion
chx04512  ECM-receptor interaction
chx04820  Cytoskeleton in muscle cells
chx04926  Relaxin signaling pathway
chx04933  AGE-RAGE signaling pathway in diabetic complications
chx04974  Protein digestion and absorption
chx05146  Amoebiasis
chx05165  Human papillomavirus infection
chx05200  Pathways in cancer
chx05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:chx00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102171079 (COL4A1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102171079 (COL4A1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102171079 (COL4A1)
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    102171079 (COL4A1)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    102171079 (COL4A1)
  09154 Digestive system
   04974 Protein digestion and absorption
    102171079 (COL4A1)
  09158 Development and regeneration
   04382 Cornified envelope formation
    102171079 (COL4A1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    102171079 (COL4A1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    102171079 (COL4A1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102171079 (COL4A1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    102171079 (COL4A1)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    102171079 (COL4A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:chx04147]
    102171079 (COL4A1)
   00536 Glycosaminoglycan binding proteins [BR:chx00536]
    102171079 (COL4A1)
Exosome [BR:chx04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   102171079 (COL4A1)
Glycosaminoglycan binding proteins [BR:chx00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   102171079 (COL4A1)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 102171079
NCBI-ProteinID: XP_017911853
LinkDB
Position
12:1302093..1432028
AA seq 1669 aa
MGPRLGAWLLLGLAALLLHEESSRAAAKGGCAGSGCGKCDCHGVKGQKGERGLPGLQGVI
GFPGMQGPEGPQGPPGQKGDTGEPGLPGTKGTRGPSGVPGYPGNPGLPGIPGQDGPPGPP
GIPGCNGTKGERGPAGPPGLPGFAGNPGPPGLPGMKGDPGEILGHIPGTLLKGERGYPGQ
PGAPGSPGLPGLQGPVGPPGFTGPPGPPGPPGPPGEKGQMGLSFQGPKGEKGDQGVSGPP
GLPGQAQVITKGDTAMRGEKGQKGEPGFPGLQGFGEKGEPGKPGPRGKPGKDGEKGEKGS
QGFPGDSGYPGQPGREGLKGEKGEAGPPGLPGTVIGTGPLGEKGEPGYPGGPGAKGETGP
KGYPGIPGQPGPPGFPTPGLIGAPGFPGDRGEKGEPGLPGVSLPGPSGRDGLPGPPGPPG
PPGQPGHTNGLVECQPGPPGDQGPPGIPGQPGLTGEVGEKGQKGDSCLVCDTTELRGPPG
PQGPPGEIGFPGQPGAKGDRGLPGRAGVEGTPGPQGVPGLMGQPGAKGEPGEIYFDIRLK
GDKGDPGLPGQPGMPGRAGSPGRDGQPGLPGPRGSPGSVGLKGERGPPGGVGFPGSRGDX
GPPGPPGFGPIGPIGDKGEMGFPGNPGAPGQPGLKGETGKVVPLPGPPGAEGLPGSPGFQ
GPQGDRGFPGSPGRPGLPGEKGAIGQPGIGFPGPPGPKGVDGIPGDAGPPGNPGRQGFNG
LPGNPGPPGQKGEPGVGLPGLKGLPGIPGIPGTPGEKGNVGGPGVPGEHGAIGPPGLQGL
RGDPGPPGLQGPRGAPGVPGIGPPGAMGPPGGQGPPGSSGPPGVKGEKGFPGFPGLDMPG
PKGDKGSQGLPGLTGQSGLPGLPGQQGSPGQPGIPGPKGEMGVMGTPGQPGSPGPAGVPG
LPGAKGEHGFPGSSGPRGDPGFKGDKGDVGLPGKPGSMDKVDMGSMKGEKGDQGEKGQTG
PTGDKGSRGDPGTPGVPGKDGQAGHPGQPGPKGDPGVSGIPGAPGLPGPKGSTGGMGLPG
MPGPKGVAGIPGPQGIPGLPGDKGAKGEKGQAGLPGIGIPGRPGEKGDQGLAGFPGSPGE
KGEKGSTGIPGMPGAPGPKGSPGSVGYPGSPGLPGEKGDKGLPGLDGTPGIKGEAGLPGK
PGPTGPAGQKGEPGSDGIPGSVGEKGEAGLPGRGFPGFPGSKGEKGSKGDVGFPGLAGSP
GIPGSKGEQGFMGPPGPQGQPGLPGTPGHAVEGPKGDRGPQGQPGLPGHPGPMGPPGLPG
LDGLKGDKGNPGWPGTPGAPGPKGDPGFQGMPGIGGSPGITGAKGDMGPPGVPGFHGQKG
APGLQGVKGDQGDQGFPGTKGLPGPPGPPGPFNIIKGEPGLPGPEGPAGLKGLQGPPGPK
GQQGVTGSAGLPGPPGEPGFDGAPGQKGETGPFGPPGPRGFPGPPGPDGLPGSMGPPGTP
SVDHGFLVTRHSQTTDDPQCPPGTKILYHGYSLLYVQGNERAHGQDLGTAGSCLRKFSTM
PFLFCNINNVCNFASRNDYSYWLSTPEPMPMSMAPITGENIRPFISRCAVCEAPAMVMAV
HSQTIQIPQCPAGWSSLWIGYSFVMHTSAGAEGSGQALASPGSCLEEFRSAPFIECHGRG
TCNYYANAYSFWLATIERSEMFKKPTPSTLKAGELRTHVSRCQVCLRRT
NT seq 5010 nt   +upstreamnt  +downstreamnt
atggggccccggctcggcgcgtggctgctgctggggctcgccgcgctcctgctccacgag
gagagcagccgggccgccgcgaagggtgggtgtgctggctctggctgcgggaagtgtgac
tgccatggcgtgaaaggacagaagggagaaagaggcctcccggggttacaaggggtcatc
gggtttcccggaatgcaaggacccgaggggccgcagggaccaccaggacagaagggtgac
accggcgagcccggactgccaggcactaaagggacgagaggaccctcaggagtgcctggt
taccctggaaacccaggacttcctggtattcctggccaggacggtcctccgggtccccca
ggtattccaggatgtaacgggacaaagggtgagagagggcccgcgggacctcccggtttg
cctggattcgccggaaatcccggaccaccagggttaccgggaatgaagggagatccaggt
gagattctgggccatataccagggaccctgctgaaaggcgaaagaggatatcctggacag
ccgggagcgcctggttcaccaggcctgccaggactgcaaggccccgtcgggcccccagga
ttcaccgggccaccaggccctccaggccctcctggccctccaggcgaaaaggggcaaatg
ggcttgagctttcaagggccgaaaggtgaaaagggtgatcaaggggtcagcgggcccccg
ggattaccaggacaggctcaagtcatcacgaaaggggacacagccatgcgtggcgagaag
ggtcaaaaaggtgaacccggatttccggggctgcaagggtttggagagaaaggagaacct
ggaaaaccagggccccgtggaaaaccaggaaaagatggtgaaaaaggagaaaaagggagt
caagggtttccgggcgattcagggtacccaggacagccaggccgagaaggtttaaaggga
gagaaaggtgaagcaggtcctcccgggctgcctggaactgttattggcacaggacccttg
ggagagaaaggagagcccgggtacccagggggcccaggggcgaaaggggagacaggtccc
aaaggttacccaggaataccaggccagccaggccctccaggcttcccgactccggggctg
attggtgcccccggcttccccggcgacagaggagagaagggtgaaccgggcttgccgggt
gtgtcgctgccaggacccagcggaagggacgggcttcccggcccccccgggccccccggg
ccccctgggcagccgggccacacaaatggactcgtggaatgccagcctgggccaccaggg
gaccagggtcctcccggaattccagggcagccggggttgacgggcgaagttggagaaaaa
ggtcaaaaaggagacagctgcctcgtctgtgacacaacagagcttcgtgggcccccaggg
ccacagggaccccccggagaaataggtttcccaggacaaccaggggccaagggagacaga
ggcttacccggcagggctggtgtggaaggaacgcctggtcctcaaggtgtgccagggctc
atgggccagccgggagccaagggcgagcccggcgagatctacttcgacatacggctcaag
ggcgacaaaggagaccccggcttaccaggccagcctggcatgccaggcagagcgggctcc
cctggaagagacggccaaccgggccttcccggccccagaggctccccgggttcagtagga
ttgaaaggggagcgtggccccccgggaggcgtcggattccccgggagccgtggcgacnat
ggccctccggggcctccaggcttcggcccgattggccccattggtgacaaaggagaaatg
ggcttcccaggcaaccctggggccccaggccagccaggtctcaagggagagacgggaaaa
gtcgtgcccttgcccggcccccctggagcagaaggacttcccgggtcccccggcttccag
gggccacaaggtgaccgaggttttcctggaagccccggaaggccgggcctccctggagag
aagggtgccatcggccagcctgggattggatttcctgggcctcctggccccaaaggcgtt
gatggtatacctggagacgctggacctcctgggaatccgggtcgtcaaggcttcaacggc
ttacctggcaaccccggtccacctggccagaagggcgagcctggagtcggtctgccggga
ctcaaaggcctgcctgggatacctggcatccctggcacccccggggagaagggaaacgtc
ggaggaccgggcgttcctggagagcacggcgccatcggccccccaggcctccaggggctc
agaggtgacccgggacctcctggattgcaaggccccagaggagctccgggagtccccgga
atcggccctcctggagcaatgggcccccccggaggacagggacccccagggtcatcaggc
ccccccggagtgaaaggagagaaaggcttccccggcttcccaggtctggacatgccaggt
cccaaaggagacaaagggtcccaggggctccccggcctgacggggcagtcggggctgcct
ggccttcctggacagcagggctcccccggccagcctggcattccaggtcccaagggagag
atgggagtcatggggactccggggcagcccggctcgccaggaccagcgggcgtgccagga
ttgccgggtgccaaaggggaacacggcttccccggctcctcaggacccaggggagaccct
ggcttcaagggtgacaaaggcgacgtggggctccccggcaagccaggctccatggataag
gtggacatgggcagcatgaagggcgagaagggggaccaaggcgagaaaggacagactggt
ccgactggcgataaaggatcccgcggagacccgggaaccccaggcgtgccgggaaaggac
ggtcaggcaggacaccccgggcagccaggacctaaaggtgatccaggtgtgagcgggatc
cctggtgctccgggacttcctggtcccaaaggatccactggtggaatgggcctcccagga
atgccgggaccaaaaggtgtggctggcatccccggcccgcagggcattcctggcttacct
ggagacaagggggcaaaaggagagaaagggcaggcgggtctgcctggcattgggattcca
ggacggcctggggagaagggagaccagggccttgcaggatttcccggaagccccggcgag
aagggagagaaaggaagcacggggatcccagggatgcccggggctccgggccccaaaggc
tccccgggcagtgttggctatccgggaagccctgggttgcctggggagaaaggtgacaag
ggcctcccaggactggatggcactcctggcatcaaaggagaagcaggtcttcctgggaag
cctggccccacgggcccagccggccagaaaggggagcccggcagcgatggaatcccaggg
tcggtgggagagaagggcgaggcaggtctacctggaagaggattcccagggtttccaggg
agcaaaggagagaaaggttcaaagggcgatgtgggcttcccaggattagctgggagccca
ggaattcctggatccaaaggagaacaaggattcatgggtcccccgggaccacagggacag
ccgggattgccagggaccccaggccacgcggtagaggggcccaaaggagaccgcggcccg
caaggacaacccggcctgccagggcatccgggacccatggggcctccaggcctccccggg
ctcgatgggctgaaaggtgacaaggggaacccaggctggccgggcactccgggagctcca
gggcccaagggagacccaggattccagggcatgccgggcattggcggctctccaggaatc
acaggagctaagggagatatgggacccccaggagttccagggtttcatggtcaaaaaggc
gcccccggcctgcagggagtcaaaggcgaccaaggagaccaaggcttcccgggaacgaaa
ggtcttcctggccccccgggccccccaggtccgttcaacatcatcaagggggaaccaggg
ctccctggtcccgagggccctgcgggtctgaaagggcttcagggacctccaggcccgaaa
ggacagcaaggtgtgacgggatccgcgggcttgcctgggcccccaggtgagcccggcttt
gacggcgcccccggccagaaaggagagacggggcccttcggccctccaggtccacgaggc
ttcccaggtccgcccggccccgacgggctgccggggtccatgggtcccccgggcaccccg
tcagtcgatcacggcttccttgtgacccggcacagtcagacgacagacgacccccagtgc
cctcctgggaccaaaatcctctaccacggctactctttgctctacgtgcaaggcaacgag
cgggcgcatggccaggacttgggcacggcgggcagctgcctgcggaagttcagcaccatg
cccttcctcttctgcaacatcaacaacgtctgcaacttcgcctcccgcaatgactactcg
tactggctgtccacgccggagcccatgcccatgtccatggcgcccatcaccggggagaac
atccggcccttcatcagcaggtgtgctgtgtgtgaggccccagcgatggtgatggccgtg
cacagccagaccatccagattccgcagtgccctgccggctggtcctcgctctggatcggc
tactcctttgtgatgcacaccagcgccggggctgaaggctctggccaagccctcgcctcc
cccggctcatgtctggaggagttcaggagcgcccccttcatcgagtgccacggccgtgga
acttgcaattactacgcaaacgcttacagcttttggcttgccacgatagagcggagcgag
atgttcaagaagcccacgccgtccacgctgaaggccggggagctgcgcacgcacgtcagc
cggtgccaggtgtgcctgcggaggacatga

DBGET integrated database retrieval system