KEGG   Ovis aries (sheep): 101115552
Entry
101115552         CDS       T03117                                 
Symbol
COL4A1
Name
(RefSeq) collagen alpha-1(IV) chain
  KO
K06237  collagen type IV alpha
Organism
oas  Ovis aries (sheep)
Pathway
oas04151  PI3K-Akt signaling pathway
oas04510  Focal adhesion
oas04512  ECM-receptor interaction
oas04820  Cytoskeleton in muscle cells
oas04926  Relaxin signaling pathway
oas04933  AGE-RAGE signaling pathway in diabetic complications
oas04974  Protein digestion and absorption
oas05146  Amoebiasis
oas05165  Human papillomavirus infection
oas05200  Pathways in cancer
oas05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:oas00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    101115552 (COL4A1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    101115552 (COL4A1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    101115552 (COL4A1)
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    101115552 (COL4A1)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    101115552 (COL4A1)
  09154 Digestive system
   04974 Protein digestion and absorption
    101115552 (COL4A1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    101115552 (COL4A1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    101115552 (COL4A1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    101115552 (COL4A1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    101115552 (COL4A1)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    101115552 (COL4A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:oas04147]
    101115552 (COL4A1)
   00536 Glycosaminoglycan binding proteins [BR:oas00536]
    101115552 (COL4A1)
Exosome [BR:oas04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   101115552 (COL4A1)
Glycosaminoglycan binding proteins [BR:oas00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   101115552 (COL4A1)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 101115552
NCBI-ProteinID: XP_027829587
EnsemblRapid: ENSOARG00020003497
LinkDB
Position
10:complement(84241592..84369833)
AA seq 1669 aa
MGPRLGVWLLLGLAALLLHEESSRAAAKGGCAGSGCGKCDCHGVKGQKGERGLPGLQGVI
GFPGMQGPEGPQGPPGQKGDTGEPGLPGTKGTRGPSGVPGYPGNPGLPGIPGQDGPPGPP
GIPGCNGTKGERGPAGPPGLPGFAGNPGPPGLPGMKGDPGEILGHIPGTLLKGERGYPGQ
PGAPGSPGLPGLQGPVGPPGFTGPPGPPGPPGPPGEKGQMGLSFQGPKGEKGDQGVSGPP
GLPGQAQVITKGDTAMRGEKGQKGEPGFPGLQGFGEKGEPGKPGPRGKPGKDGEKGEKGS
QGFPGDSGYPGQPGREGLKGEKGEAGPPGLPGTVIGTGPLGEKGEPGYPGGPGAKGETGP
KGYPGIPGQPGPPGFPTPGLIGAPGFPGDRGEKGEPGLPGVSLPGPSGRDGLPGPPGPPG
PPGQPGHTNGLVECQPGPPGDQGPPGIPGQPGLTGEVGEKGQKGDSCLVCDTTELRGPPG
PQGPPGEIGFPGQPGAKGDRGLPGRAGVEGTPGPQGAPGLMGQPGAKGEPGEIYFDLRLK
GDKGDPGLPGQPGMPGRAGSPGRDGQPGLPGPRGSPGSVGLKGERGPPGGVGFPGSRGDI
GPPGPPGFGPIGPIGDKGQMGFPGNPGAPGQPGPKGEAGKVVPLPGPPGAEGLPGSPGFQ
GPQGDRGFPGSPGRPGLPGEKGAIGQPGIGFPGPPGPKGVDGLPGDAGPPGNPGRQGFNG
LPGNPGPPGQKGEPGIGLPGLKGLPGIPGIPGTPGEKGNVGGPGVPGEHGAIGPPGLQGL
RGDPGPPGLQGPKGAPGVPGIGPPGAMGPPGGQGPPGSSGPPGVKGEKGFPGFPGLDMPG
PKGDKGSQGLPGLTGQSGLPGLPGQQGSPGQPGIPGPKGEMGVMGTPGQPGSPGPAGVPG
LPGAKGEHGFPGSSGPRGDPGFKGDKGDVGLPGKPGSMDKVDMGSMKGEKGDQGEKGQTG
PTGDKGSRGDPGTPGVPGKDGQAGHPGQPGPKGDPGVSGIPGAPGLPGPKGSTGGMGLPG
MPGPKGVAGIPGPQGIPGLPGDKGAKGEKGQAGLPGIGIPGRPGEKGDQGLAGFPGSPGE
KGEKGSTGIPGMPGAPGPKGSPGSVGYPGSPGLPGEKGDKGLPGLDGIPGIKGEAGLPGK
PGPTGPAGQKGEPGSDGIPGSVGEKGEAGLPGRGFPGFPGSKGEKGSKGDVGFPGLAGSP
GIPGSKGEQGFMGPPGPQGQPGLPGTPGHAVEGPKGDRGPQGQPGLPGHPGPMGPPGLPG
LDGLKGDKGNPGWPGTPGAPGPKGDPGFQGMPGIGGSPGITGAKGDMGPPGVPGFHGQKG
APGLQGVKGDQGDQGFPGTKGLPGPPGPPGPFNIIKGEPGLPGPEGPAGLKGLQGPPGPK
GQQGVTGSAGLPGPPGEPGFDGAPGQKGETGPFGPPGPRGFPGPPGPDGLPGSMGPPGTP
SVDHGFLVTRHSQTTDDPQCPPGTKILYHGYSLLYVQGNERAHGQDLGTAGSCLRKFSTM
PFLFCNINNVCNFASRNDYSYWLSTPEPMPMSMAPITGENIRPFISRCAVCEAPAMVMAV
HSQTIQIPQCPAGWSSLWIGYSFVMHTSAGAEGSGQALASPGSCLEEFRSAPFIECHGRG
TCNYYANAYSFWLATVERSEMFKKPTPSTLKAGELRTHVSRCQVCMRRT
NT seq 5010 nt   +upstreamnt  +downstreamnt
atggggccccggctcggcgtgtggctgctgctggggctcgccgcgctcctgctccacgag
gagagcagccgcgccgccgcgaagggtgggtgtgccggctctggctgcgggaagtgtgac
tgccatggcgtgaaaggacagaagggagaaagaggccttccggggttacaaggggtcatc
gggtttcccggaatgcaaggacccgaggggccgcagggaccaccgggacagaagggtgac
accggcgagcccggactgccaggcactaaagggactagaggaccctcaggagtgcctggt
taccctggaaacccaggacttcctggtattcctggccaggacggtcctccgggtccccca
ggtattccaggatgtaacgggacaaagggtgagagagggcccgcgggacctcccggtttg
cctggattcgccggaaatcccggaccaccagggttaccgggaatgaagggagatccaggt
gagattctgggccacataccagggaccctgctgaaaggcgaaagaggatatcctggacag
ccgggagcgcctggttcaccaggcctaccaggactgcaaggccccgtcgggcccccagga
ttcaccgggccaccaggccctccaggccctcctggccctccaggcgaaaaggggcaaatg
ggcttgagctttcaagggccgaaaggtgaaaagggtgatcaaggggtcagcgggcccccg
ggattaccaggacaggctcaagtcatcacgaaaggggacacagccatgcgcggtgagaag
ggtcaaaaaggtgaacccggatttccggggctgcaagggtttggagagaaaggagaacct
ggaaaaccagggccccgcggaaaaccaggaaaagatggtgaaaaaggagaaaaagggagt
caagggtttccgggcgattcagggtacccaggacagccaggtcgagaaggcttaaaggga
gagaaaggtgaagcaggtcctcccgggctgcctggaactgttattggcacaggacccttg
ggagagaaaggagagcccgggtacccagggggcccaggggcgaaaggggagacaggtccc
aaaggttacccaggaataccaggccagccaggccctccaggcttcccgactccggggctg
attggtgcccccggcttccccggcgacagaggcgagaagggtgaaccgggcttgccgggc
gtgtcgctgccgggacccagcggaagggacgggcttcccggcccccccgggccccccggg
ccccctgggcagccgggccacacaaatggactcgtggaatgccagcctgggccgccaggg
gaccagggtcctcccggaattccagggcagccggggttgacgggcgaagttggagaaaaa
ggtcaaaaaggagatagctgcctcgtctgtgacacaacagagcttcgcgggcccccaggg
ccacagggaccccccggagaaataggtttcccaggacagccaggggccaagggagacaga
ggcttgcccggcagggctggtgtggaaggaacgcctggtcctcaaggtgcgccagggctc
atgggccagccgggagccaagggcgagcctggcgagatctacttcgacctacggctcaag
ggcgacaaaggagaccccggcttaccaggccagcccggcatgccaggcagagcgggctcc
cctggaagagacggccaaccgggccttcccggccccagaggctccccgggttcagtagga
ttgaaaggggagcgtggccccccgggaggcgtcggattccccgggagccgtggagacatc
ggccctccggggcctccaggcttcggcccgattggccccattggtgacaaaggacaaatg
ggcttcccaggaaaccctggggccccaggccagccaggtcccaagggagaggcgggaaaa
gtcgtgcccttgcccggcccccctggagcagaagggcttcccgggtcccccggcttccag
gggccacaaggtgaccgaggttttcctggaagccccggaaggccgggcctccccggagag
aagggtgccattggccagcctgggattggatttcctgggcctcccggccccaaaggcgtt
gatggtttacctggagacgctggacctcctgggaatccgggtcgtcaaggcttcaacggc
ttacctggcaaccccggtccacctggccagaagggcgagcctggaatcggtctgccggga
ctcaaaggcctgcctgggatacctggcatccctggcacccctggggagaagggaaacgtc
ggaggaccgggcgttcctggagagcatggcgccatcggccccccaggcctccaggggctc
agaggtgacccgggacctcctggattgcaaggccccaaaggagcgccgggagtccctgga
atcggcccccctggagcaatgggcccccccggaggacagggacccccagggtcatcaggc
ccccccggagtgaaaggagagaaaggcttccccggcttcccaggtctggacatgccgggc
cccaaaggagacaaagggtcccaggggctccccggcctgacgggacagtcggggctgcct
ggccttcctggacagcagggctcccctggccagcctggcattccaggtcccaagggagag
atgggagtcatggggactccggggcagcccggctcgccaggaccagcgggcgtgccagga
ttgccgggtgccaaaggggaacacggcttccctggctcctcaggacccaggggagaccct
ggcttcaagggtgacaaaggcgacgtggggctccccggcaagccaggctccatggataag
gtggacatgggcagcatgaagggcgagaagggggaccaaggcgagaaaggacagactggt
ccgactggcgataaaggatcccgcggagacccgggaaccccaggcgtgccgggaaaggac
ggtcaggcaggacaccccgggcagccaggacctaaaggtgatccaggtgtgagtgggatc
cctggtgctccgggacttcctggtcccaaaggatccactggtggaatgggcctcccagga
atgccgggaccaaaaggtgtggctggcatccccggcccgcagggcattcctggcttacct
ggagacaagggggcaaaaggagagaaagggcaggcgggtctgcctggcattgggattcca
ggacggcctggggagaagggagaccagggccttgcaggatttcccggaagccccggcgag
aagggagagaaaggaagcacggggatcccagggatgcccggggctccgggccccaaaggc
tccccgggcagtgttggctatccaggaagccctgggttgcctggggagaaaggtgacaag
ggcctcccgggactggatggcattcctggcatcaaaggagaagcaggtcttcctgggaag
cctggccccacgggcccagccggccagaaaggggagcccggcagcgatggaatcccaggg
tcggtgggagagaagggcgaggcaggtctacctggaagaggattcccagggtttccaggg
agcaaaggagagaaaggttcaaagggcgatgtgggcttcccaggattagctgggagccca
ggaattcctggatccaaaggagaacaaggattcatgggtcccccgggaccacagggacag
ccgggattgccagggaccccaggccacgcggtagaggggcccaaaggagaccgcggcccg
caaggacaacccggcctgccagggcatccgggacccatggggcctccaggcctccccggg
ctcgatgggctgaaaggtgacaaggggaacccaggctggccgggcactccgggagctcca
gggcccaagggagacccaggattccagggcatgccgggcattggcggctctccaggaatc
acaggagctaagggagatatgggacccccaggagttccagggtttcacggtcagaaaggc
gcccccggcctgcagggagtcaaaggcgaccaaggagaccaaggcttcccgggaacgaaa
ggtcttcccggccccccgggccccccaggtccattcaacatcatcaagggggaaccaggg
ctccctggtcccgagggccccgcgggtctgaaagggcttcagggacctccaggcccaaaa
ggacagcaaggtgtgacgggatccgcaggcttgcctggacccccaggtgagcctggcttt
gacggcgcccctggccagaaaggagagaccgggcccttcggccctccaggtccacgaggc
ttcccgggtccgcccggccccgacgggctgccggggtccatgggtcccccgggcaccccg
tcagtcgatcacggcttccttgtgacccggcacagtcagacgacagacgacccccagtgc
cctcctgggaccaaaatcctctaccacggctactctttgctctacgtgcaaggcaacgag
cgggcgcacggccaggacttgggcacggcgggcagctgcctgcggaagttcagcaccatg
cccttcctcttttgcaacattaacaacgtctgcaacttcgcctcccgcaacgactactcg
tactggctgtccacgccggagcccatgcccatgtccatggcgcccatcaccggggagaac
atccggcccttcatcagcaggtgtgctgtgtgtgaggccccagcgatggtgatggccgtg
cacagccagaccatccagatcccgcagtgccccgccggctggtcctcgctctggatcggc
tactcctttgtgatgcacaccagcgccggggctgaaggctctggccaagccctcgcctcc
cccggctcgtgtctggaggagttcagaagcgcccccttcatcgagtgccacggccgcgga
acttgcaattactacgcaaacgcgtacagcttttggcttgccacggtagagcggagcgag
atgttcaagaagcccacgccgtccacgctgaaggccggggagctgcgcacgcacgtcagc
cggtgccaggtgtgcatgcggaggacataa

DBGET integrated database retrieval system