KEGG   Bos mutus (wild yak): 102282767
Entry
102282767         CDS       T02919                                 
Symbol
COL4A1
Name
(RefSeq) collagen alpha-1(IV) chain
  KO
K06237  collagen type IV alpha
Organism
bom  Bos mutus (wild yak)
Pathway
bom04151  PI3K-Akt signaling pathway
bom04510  Focal adhesion
bom04512  ECM-receptor interaction
bom04820  Cytoskeleton in muscle cells
bom04926  Relaxin signaling pathway
bom04933  AGE-RAGE signaling pathway in diabetic complications
bom04974  Protein digestion and absorption
bom05146  Amoebiasis
bom05165  Human papillomavirus infection
bom05200  Pathways in cancer
bom05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:bom00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102282767 (COL4A1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102282767 (COL4A1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102282767 (COL4A1)
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    102282767 (COL4A1)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    102282767 (COL4A1)
  09154 Digestive system
   04974 Protein digestion and absorption
    102282767 (COL4A1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    102282767 (COL4A1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    102282767 (COL4A1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102282767 (COL4A1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    102282767 (COL4A1)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    102282767 (COL4A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:bom04147]
    102282767 (COL4A1)
   00536 Glycosaminoglycan binding proteins [BR:bom00536]
    102282767 (COL4A1)
Exosome [BR:bom04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   102282767 (COL4A1)
Glycosaminoglycan binding proteins [BR:bom00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   102282767 (COL4A1)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 102282767
NCBI-ProteinID: XP_070236843
LinkDB
Position
12:72274569..72408072
AA seq 1669 aa
MGPRLGVWLLLLLAALLLHEESSRAAAKGGCAGSGCGKCDCHGVKGQKGERGLPGLQGVI
GFPGMQGPEGPQGPPGQKGDTGEPGLPGTKGTRGPSGVPGYPGNPGLPGIPGQDGPPGPP
GIPGCNGTKGERGPVGPPGLPGFAGNPGPPGLPGMKGDPGEILGHIPGTLLKGERGYPGQ
PGAPGSPGLPGLQGPVGPPGFTGPPGPPGPPGPPGEKGQMGLSFQGPKGEKGDQGVSGPP
GLPGQAQVITKGDTAMRGEKGQKGEPGFPGLPGFGEKGEPGKPGPRGKPGKDGEKGEKGS
PGFPGDSGYPGQPGQDGLKGEKGEAGPPGLPGTVIGTGPLGEKGEPGYPGGPGAKGETGP
KGFPGIPGQPGPPGFPTPGLIGAPGFPGDRGEKGEPGLPGVSLPGPSGRDGLPGPPGPPG
PPGQPGHTNGIVECQPGPPGDQGPPGIPGQPGLTGEVGEKGQKGDSCLVCDTAELRGPPG
PQGPPGEIGFPGQPGAKGDRGLPGRDGLEGLPGPQGAPGLMGQPGAKGEPGEIYFDIRLK
GDKGDPGFPGQPGMPGRAGSPGRDGQPGLPGPRGSPGSVGLKGERGPPGGVGFPGSRGDI
GPPGPPGFGPIGPIGDKGQIGFPGTPGAPGQPGPKGEAGKVVPLPGPPGAEGLPGSPGFQ
GPQGDRGFPGSPGRPGLPGEKGAIGQPGIGFPGPPGPKGVDGLPGDAGPPGNPGRQGFNG
LPGNPGPPGQKGEPGVGLPGLKGLPGIPGIPGTPGEKGNVGGPGIPGEHGAIGPPGLQGL
RGDPGPPGFQGPKGAPGVPGIGPPGAMGPPGGQGPPGSSGPPGVKGEKGFPGFPGLDMPG
PKGDKGSQGLPGLTGQSGLPGLPGQQGTPGQPGIPGPKGEMGVMGTPGQPGSPGPAGVPG
LPGAKGDHGFPGSSGPRGDPGFKGDKGDVGLPGKPGSMDKVDMGSMKGEKGDQGEKGQTG
PTGDKGSRGDPGTPGVPGKDGQAGHPGQPGPKGDPGVSGVPGAPGLPGPKGPTGGMGLPG
MPGPKGVAGIPGPQGVPGLPGDKGAKGEKGQAGLPGIGIPGRPGDKGDQGLAGFPGSPGE
KGEKGSTGIPGMPGSPGPKGSPGSVGYPGSPGLPGEKGDKGLPGLDGIPGIKGEAGLPGK
PGPTGPAGQKGEPGSDGIPGSVGEKGESGLPGRGFPGFPGSKGDKGSKGDVGFPGLAGSP
GIPGSKGEQGFMGPPGPQGQPGLPGTPGHAVEGPKGDRGPQGQPGLPGRPGPMGPPGLPG
LEGLKGERGNPGWPGTPGAPGPKGDPGFQGMPGIGGSPGITGAKGDVGPPGVPGFHGQKG
APGLQGVKGDQGDQGFPGTKGLPGPPGPPGPFSIIKGEPGLPGPEGPAGLKGLQGPPGPK
GQQGVTGSVGLPGPPGEPGFDGAPGQKGETGPFGPPGPRGFPGPPGPDGLPGSMGPPGTP
SVDHGFLVTRHSQTTDDPQCPPGTKILYHGYSLLYVQGNERAHGQDLGTAGSCLRKFSTM
PFLFCNINNVCNFASRNDYSYWLSTPEPMPMSMAPITGENIRPFISRCAVCEAPAMVMAV
HSQTIQIPQCPTGWSSLWIGYSFVMHTSAGAEGSGQALASPGSCLEEFRSAPFIECHGRG
TCNYYANAYSFWLATIERSEMFKKPTPSTLKAGELRTHVSRCQVCMRRT
NT seq 5010 nt   +upstreamnt  +downstreamnt
atggggccccggctcggcgtctggctgctgctgctgctcgccgcgctcctgctccacgag
gagagcagccgggccgccgcgaagggtgggtgtgccggctctggctgcgggaagtgtgac
tgccatggcgtgaagggacaaaagggagaaagaggcctcccggggttacaaggggtcatc
gggtttcccggaatgcaaggacccgaggggccacagggaccaccgggacagaagggtgac
accggcgagcccggactgccaggcactaaaggcacgagaggaccctcaggagtgcctggt
taccctggaaacccaggacttcctggtattcctggccaggatggtcctccgggtccccca
ggtattccaggatgcaacgggacaaagggtgagagagggcctgtggggcctcccggtttg
cctggattcgccggaaatcccggaccaccagggttaccgggaatgaagggagatccaggt
gagattctgggccatataccagggaccctgctgaaaggcgaaagaggatatcctggacag
ccaggagcgcccggctcaccaggtctgccaggactgcaaggccccgtcggccccccagga
ttcactggaccgccaggccctccaggccctcctggccctccaggtgaaaaggggcagatg
ggcttgagcttccaaggacccaaaggtgaaaagggtgatcaaggggtcagcgggccccca
ggattgccaggacaggcgcaagtcatcacgaaaggagacacggccatgcgaggcgagaag
ggtcaaaaaggtgaacctggatttccggggctgccagggtttggagagaaaggagaacct
ggaaaaccagggccccgtggaaaaccaggaaaagatggtgaaaaaggagaaaaagggagt
ccagggtttccaggcgactcagggtacccaggacagcctggccaagacggtttaaaggga
gagaaaggtgaagcaggtcctcccgggcttcctggaactgttattggcacgggaccgttg
ggagagaaaggagagcccgggtacccaggaggtccgggggcgaaaggggagacaggtccc
aaaggtttcccaggaataccaggccagccaggccctccaggcttcccgactccggggctg
attggtgcccccggcttccctggcgacagaggagagaagggtgaaccgggcttgccgggc
gtgtcgctgccaggacccagtggaagggacgggcttcccggcccccctggcccccccggg
ccccctgggcagccgggccacacaaatggaatcgtggaatgccagcctgggccgccaggg
gaccagggtcctcccggaattccagggcagccggggttgacgggcgaagttggagaaaaa
ggtcaaaaaggagacagttgcctcgtctgtgacacagcagagcttcgtgggcccccaggg
ccacagggaccccccggagaaataggtttcccaggacagccaggagccaagggtgataga
ggcttgcccggcagggacggtctggaaggactgcccggtccgcaaggtgcgccggggctc
atgggccagccgggagccaagggcgagcccggtgagatctacttcgacatacggctcaaa
ggcgacaaaggagaccccggcttcccaggccagcccggcatgccaggcagagcgggctcc
cccggaagagacggccaaccgggtctgcccggccccagaggctccccgggttccgtagga
ttgaaaggggagcgtggccccccgggaggcgtcggattccccgggagccgtggcgacatc
ggccctccggggcctccaggctttggcccgattggccccattggtgacaaaggacagata
ggcttcccaggaacccccggggccccaggccagccaggtcccaagggcgaggcggggaaa
gtcgtgcccttgcccggcccccctggagcagaaggacttcccgggtcccccggcttccag
gggccacaaggtgaccgaggttttcctggaagccccggaaggccgggcctccctggagag
aagggcgccatcggccagcctgggattggatttcctgggcctcccggccccaaaggcgtt
gatggtttacctggagacgctggacctcctgggaatccgggtcgtcaaggcttcaacggc
ttacctggcaaccccggtccacctggacagaagggcgagcctggagttggtctgccggga
ctcaaaggcctgcccgggatacctggcatccccggcacccccggggagaagggaaacgtc
ggaggaccgggaattcctggagagcacggcgccatcggccccccaggcctccaggggctc
agaggtgacccaggaccacctggatttcaaggccccaaaggagctccgggagtccccgga
atcggcccccctggagcaatgggcccccccggaggacagggacccccagggtcatcaggc
ccccccggagtgaaaggagagaaaggcttccccggcttcccaggcctggacatgccgggc
cccaaaggagacaaagggtcccaggggctccccggcctgacggggcagtctgggctgcct
ggccttcctggacagcagggcacccccggccagcctggcattccaggtcccaagggagag
atgggagtcatggggactccagggcagcccggctcgccagggccagcgggcgtgccagga
ttgccaggtgccaaaggggaccacggcttccccggctcttcgggacccaggggagaccct
ggcttcaagggcgacaaaggcgacgtggggctccccggcaagccaggctccatggacaag
gtggacatgggcagcatgaagggcgagaagggagaccaaggcgagaaaggacaaactggt
ccgactggtgataaaggatcccggggagacccgggaaccccaggtgtgccgggaaaggat
ggtcaggcaggacaccccgggcagccaggacctaaaggtgacccaggtgtgagcggggtc
ccaggtgctccgggactccctggtcccaaaggacccactggtggaatgggcctaccggga
atgccaggaccaaaaggtgtggctggcatccccggcccgcagggcgttcctggcttacct
ggagacaagggggcaaaaggagagaaagggcaggcggggctgcctggcattgggattcca
ggacggcccggggacaagggagaccagggcttagcaggatttcccggaagccccggcgag
aagggagagaaaggaagcactgggatcccagggatgcccgggtccccgggccccaaaggc
tccccaggaagtgttggctatccgggaagccctgggttgcctggagagaaaggtgacaaa
ggcctcccgggactggatggcattcctggcatcaaaggagaagcaggtcttcctgggaag
cctggccccacgggtccagccggccagaaaggggagcccggcagcgatggaatcccaggg
tcggtgggagagaagggcgagtcaggtctacctggaagaggattcccagggtttccaggg
agcaaaggagacaaaggttcaaagggcgatgtgggcttcccaggattagccgggagccca
ggaattcctggatccaaaggagaacaaggattcatgggtcccccgggaccacagggacag
ccgggattgccagggaccccaggccacgcggtagaggggcccaaaggagaccgcggcccg
caaggacaacccggcctgccagggcgtccggggcccatggggcctccaggcctccccggg
ctcgaggggctgaaaggtgaaagggggaacccaggctggccgggcactccgggagctcca
gggcccaagggagacccaggattccagggcatgccgggcatcggcggctctccaggaatc
acaggagccaagggtgatgtgggacctccaggagttccagggtttcacggtcagaaaggc
gcccccggcctgcagggagtcaaaggtgaccaaggagatcaaggcttcccaggaactaaa
ggtcttcccggccccccgggccccccaggtccattcagcatcatcaagggggaaccaggg
ctccctgggcccgagggccccgcgggtctgaaagggcttcagggacctccaggcccgaaa
ggacagcaaggtgtgacgggatccgtgggcttgcccgggcccccaggtgagcccggcttt
gacggcgccccaggccagaaaggagagacggggcccttcggccctcccggtccacgtggc
ttcccgggtccgcccggccccgacgggctgccgggatccatgggtcccccgggcacccca
tcagtcgatcatggcttccttgtgacccggcacagtcagacaacagacgacccccagtgc
cctcctgggaccaaaatcctctaccacggctactctttgctctacgtgcaaggcaacgag
cgggcgcacggccaggacttgggcacggcgggcagctgtctgcggaagttcagtaccatg
cccttcctcttctgtaacatcaacaatgtctgcaacttcgcgtcccgcaacgactactcg
tactggctgtccaccccggagcccatgcccatgtccatggctcccatcacgggggagaac
atccggcccttcatcagcaggtgtgctgtgtgtgaggccccggccatggtgatggccgtg
cacagccagaccatccagatcccgcagtgccccaccggctggtcctcgctctggatcggc
tactcctttgtgatgcacaccagcgccggggctgaaggctctggccaagccctcgcctcc
cccggctcgtgtctggaggagttcagaagcgcccccttcatcgagtgccacggccgcgga
acttgcaattactacgcgaacgcttacagcttttggcttgccacgatagagcggagcgag
atgttcaagaagcccacgccgtccacgctgaaggccggggagctgcgcacgcacgtcagc
cggtgccaggtgtgcatgcggaggacataa

DBGET integrated database retrieval system