KEGG   Equus przewalskii (Przewalski's horse): 103547140
Entry
103547140         CDS       T04644                                 
Symbol
COL4A4
Name
(RefSeq) collagen alpha-4(IV) chain isoform X1
  KO
K06237  collagen type IV alpha
Organism
epz  Equus przewalskii (Przewalski's horse)
Pathway
epz04151  PI3K-Akt signaling pathway
epz04382  Cornified envelope formation
epz04510  Focal adhesion
epz04512  ECM-receptor interaction
epz04820  Cytoskeleton in muscle cells
epz04926  Relaxin signaling pathway
epz04933  AGE-RAGE signaling pathway in diabetic complications
epz04974  Protein digestion and absorption
epz05146  Amoebiasis
epz05165  Human papillomavirus infection
epz05200  Pathways in cancer
epz05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:epz00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    103547140 (COL4A4)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    103547140 (COL4A4)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    103547140 (COL4A4)
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    103547140 (COL4A4)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    103547140 (COL4A4)
  09154 Digestive system
   04974 Protein digestion and absorption
    103547140 (COL4A4)
  09158 Development and regeneration
   04382 Cornified envelope formation
    103547140 (COL4A4)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    103547140 (COL4A4)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    103547140 (COL4A4)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    103547140 (COL4A4)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    103547140 (COL4A4)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    103547140 (COL4A4)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:epz04147]
    103547140 (COL4A4)
   00536 Glycosaminoglycan binding proteins [BR:epz00536]
    103547140 (COL4A4)
Exosome [BR:epz04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   103547140 (COL4A4)
Glycosaminoglycan binding proteins [BR:epz00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   103547140 (COL4A4)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 103547140
NCBI-ProteinID: XP_070474513
LinkDB
Position
5:complement(15196446..15318409)
AA seq 1683 aa
MAFIRCSFRWTKPLAIDPWSLILILFSVQHVYGSGKKFIGPCGGRDCSACRCFPEKGSRG
QPGLPGLQGPIGPLGPPGPIGIPGEKGRRGDSGPPGGAGDKGDKGPTGVPGFPGLDGIPG
HPGPPGPRGTPGINGYNGSRGDPGLPGERGAPGPGGPPGLPGANGEKGNSVFIFGAIKGI
QGDRGDPGPPGLPGSRGSRGPVGPVGYPGAPGLTGPPGHPGSPGLKGNPGVGVKGQMGDP
GEIGQQGSPGPTLLVQPPDFSLYKGEKGIKGMPGMIGPPGPPGPKGVPGTGEKGEKGVPG
FPGPRGNPGSYGSPGFPGLKGEPGLSGEPGPFGFLGPKGDPGYRGDPGPPGVLTTPSLPL
KGPPGDPGFPGRHGEVGAIGPPGPPGHSGSPGEACAGMMGPPGPRGFPGHPGFPGAAGIP
GRADSAPGKSGNPGPPGLPGAPGLQGPPGSDAIYCTAGHPGPQGMKGKVGPPGRRGSKGE
KGSMGLCACQPGPMGPPGPPGLPGRQGSKGDLGLPGWLGEKGYPGPPGAEGSPGPPGKPG
ASGLPGSKGEKGDMVVSRVKGRKGERGPDGPPGFPGQQGQHGRDGRAGEKGDPGPPGDHE
AAAPGDRGFPGPPGPPGRAGPRGPPGLGFPGLPGQRGPPGAPGRPGSRGPEGVKGQKGDT
IPCNVTYPGRPGPPGFDGPPGPKGFPGPRGAPGLRCLDGVKGQRGQPGLSEIPGPPGFRG
DIGDPGLRGEKGSSPVGPPGSPGSPGVNGQKGIPGDPAYGHPGPPGRRGPSGMPGLKGLR
GDPGRPGAAGPAGMPGFPGLKGPKGREGSAGFPGIPGPPGHSCERGAPGIPGQPGLPGAP
GSPGAPGWKGQRGDVGPPGPAGMKGLPGVPGRPGADGPLGPPGVPGPFGDDGLPGLPGPK
GLQGLPGFPGFPGERGKPGPEGQPGRKGEIGEKGWPGFLGDPGLRGAKGSRGPPGDEGEV
VIIPRKGETGDPGPPGDGGFPGEGGDKGNPGMQGSRGEPGRHGPPGFHRGEPGRNGQPGL
PGPPGPPGSPGLRGIIGFPGFPGDQGQPGSPGPPGLSGVDGRRGPKGNKGDLASQFGPPG
PNGEPGSPGCPGHFGAPGEQGLPGVQGSRGPPGRPGLPGSSGPPGCPGNQGVPGLEGHPG
EMGDPGPRGLMGDPGTPGLPGIKGPPGSPGLNGLHGLKGQKGSEGASGLHEVGPRGPVGI
PGLKGETGDPGSPGISPPGLFGEKGPPGPPGRPGPPGPAGATGRAPTGDIPDSGPPGDQG
PPGPDGLRGTPGPPGPPGSVDLLRGEPGDRGLPGPPGPPGPPGPPGHKGFPGCDGKDGQK
GPMGFPGPPGPPGLPGPPGEKGLLGPPGRQGPSGPPGEPGPPADVDSCPRIPGLPGVPGP
RGPEGEMGPPGMRGPPGPGCKGEPGLDGRRGEDGIPGSPGPPGHKGDMGEAGCPGAPGPP
GPTGDPGPKGLGPGYLSGFLLVLHSQTDGEPTCPVGMPKLWTGYSLLYLEGQEKAHNQDL
GLAGSCLPVFSTLPFAYCNIHQVCHYAQRNDRSYWLASAAPLPMMPLSEEEIRPYISRCA
VCEAPAQAVAVHSQDQSIPACPRTWKSLWIGYSFLMHTGAGDQGGGQALMSPGSCLEDFR
AAPFLECQGRQGTCHFFANEYSFWLTTVKPDLEFSSAPSPDTLKESQAQRQKISRCQVCL
KYG
NT seq 5052 nt   +upstreamnt  +downstreamnt
atggcatttataagatgctctttcaggtggaccaagcccttggccatagatccctggtca
cttatactcattctcttttctgtacaacatgtgtatgggagcggaaagaagtttattggt
ccttgtggaggaagagattgctctgcttgccggtgttttcctgaaaaggggtctaggggt
caaccgggactcccagggttgcagggccctattggacccttgggacctccaggacccatt
ggaattccaggagagaaagggaggagaggtgacagtggccctcctggaggagcaggtgac
aaaggagataagggtccaactggtgtccccggatttccaggtttagacggcatacctggg
cacccaggccctcctggacccagaggcacacctggcataaatggctacaatggctcaaga
ggtgatccagggcttccaggagaaagaggcgctcctggcccaggaggccccccaggcctt
cctggggcaaatggagaaaaaggaaattcagtcttcatttttggtgccattaaaggtatt
cagggtgacagaggggacccaggacctcctggcttaccaggatcaaggggctccagagga
ccagtgggcccagtgggatatccgggagcgccggggttaacaggacctccgggccatcct
gggagtccaggtttgaagggtaatcctggcgtgggagtaaaggggcaaatgggagacccg
ggtgagattggccagcaaggttctccgggacccaccctgttggtacagccacctgatttt
agtctctataaaggagaaaagggtataaaaggaatgcctggaatgattggacctccggga
cctcctggacccaagggagtccctggtactggagaaaaaggagagaaaggtgttcctggg
tttccaggacctcggggtaaccctggttcctatggatctccaggttttccaggattaaag
ggggaaccaggactgtctggagagcctgggccatttggatttcttggtccaaagggggat
cctggatatcgcggggacccagggccaccaggtgttttgacaactccctctcttccactg
aaaggccctccaggggatccagggtttcctggccgccatggagaagtgggggctattgga
ccacctggtccccctggtcactcaggtagcccaggggaagcctgtgcaggcatgatggga
ccccctgggccacgagggtttcctggtcatccaggatttccaggggcagctggtattcct
gggagagctgattctgctccaggaaagtcagggaacccgggaccacctgggttgcctgga
gcaccagggctgcagggacctccaggatcagacgctatatattgtactgctgggcaccct
ggaccacaaggaatgaaaggcaaagtgggtcctccaggaagaagaggctcaaaaggtgaa
aaaggaagcatggggctctgtgcctgccagcccggtcccatgggcccaccaggccctcca
ggacttcctgggaggcagggtagtaagggagacttgggcctccctgggtggcttggagaa
aaaggttacccaggccctcctggtgcagaaggatctccagggccaccaggaaaacctggt
gcttcaggactacctggcagcaaaggagaaaagggcgacatggttgtatcaagagttaaa
gggcgcaaaggagaaagaggtcctgatgggcccccaggatttccaggacagcaaggacaa
catggtcgggatggtcgcgctggagaaaaaggggatccaggacccccaggggatcatgaa
gctgcagccccaggtgatcgagggtttcctggaccgccgggcccacctggcagagcagga
cctagggggcctccaggactgggatttcctggtctaccgggacagagagggccaccagga
gctccaggccgcccgggcagcaggggccctgagggcgtgaagggtcagaaaggtgataca
attccttgtaacgtaacctacccagggaggccaggccctccaggctttgatggacctcca
ggtccaaagggatttccaggtcctcggggcgctcctgggctgaggtgtttggatggggtt
aaaggtcagcgtggccaaccaggactatcagaaatacctggtccacctggctttcgtggt
gacataggcgatccaggtcttagaggtgaaaaggggtcctcccctgtggggcccccaggc
tctccagggtcacctggggtgaatggtcagaaaggaattcccggagaccctgcttatggc
cacccaggacccccgggaaggaggggtccttcagggatgccagggttgaaaggactcaga
ggtgatccaggacgtccaggagctgcagggccagctggcatgcccggattcccgggtctc
aaaggtcccaaaggcagagagggaagtgctgggtttccaggtatcccaggtccacctggc
cattcctgtgaaagaggcgctccagggatcccagggcaaccgggcctccctggggctcca
ggcagtccaggtgccccaggttggaaaggacagcgaggggatgtggggcctcctggacca
gctggaatgaagggcctccctggagtcccgggacggcctggggcagacggacccctggga
ccaccaggagtcccaggcccctttggggatgatggactacctggccttccaggcccaaag
ggactccaggggctgcctggcttcccagggtttccaggagagagaggaaagcctggtcca
gagggacaacctggccgaaagggagaaattggagagaagggttggcctggcttcctggga
gacccgggactgagaggagccaaaggatcgagaggacccccaggagatgaaggagaagtg
gttataattcccagaaagggggaaaccggggatcctggacctcctggagatggtggattc
ccaggagaaggaggtgataaaggcaatcctgggatgcaagggagcagaggagagccggga
agacacggaccacctggatttcacagaggagagcctggtagaaatgggcagccggggctt
cctggacccccaggccctccaggctccccggggctgagaggaatcattggttttccagga
tttccaggtgaccagggtcagccgggttctccagggccccctggactttcaggagttgat
ggaaggagaggacctaaaggaaacaaaggtgaccttgcaagtcagtttggcccccctggt
ccaaatggtgagccaggtagccctggatgtccaggacattttggagctcctggagagcag
ggcttgcctggtgttcaaggatccaggggaccacctggaagaccaggactgcctggctcc
tctggaccaccagggtgtccaggtaatcaaggggtgcctgggctggaaggacatccagga
gagatgggggatcctgggccaagaggcctcatgggggatccagggacaccaggtcttccg
ggaataaaaggtcccccagggtcacccggcctgaatggcttgcatggtttaaagggtcag
aaaggaagcgaaggcgcttcaggtttgcatgaagtgggtccacgcggcccagtgggaata
cctgggctgaaaggggagacgggagaccccgggagcccaggaatttctcctccagggctt
tttggagaaaaaggtcctccagggcccccagggagacctggaccacctggccctgcaggt
gccacaggaagagctcctacgggtgacattcctgactcgggtccacctggagatcaggga
cctcctggccccgatggtctgagaggaacacctgggcctcccggcccccctgggagtgtc
gaccttctgagaggggaaccaggtgaccgtggtcttccagggccaccaggtcccccaggc
cccccaggccctccaggacacaaaggcttcccagggtgtgatggaaaagacggccagaaa
ggacccatgggattcccggggccgccggggccacctggacttcctgggccacctggagag
aagggtttgctgggacctccagggagacaggggccctctggccctccgggtgaacctggg
ccacctgcagatgtggattcctgcccccgaatcccggggcttcctggggtaccaggccca
agaggcccagaaggagagatggggccccctggaatgaggggccccccaggaccagggtgc
aaaggagagcctgggctggatggcaggaggggcgaggatggcatccctgggtctcccggg
cctccaggacacaaaggtgacatgggagaagctggctgccctggagcaccaggccctcct
gggcccactggggatcctgggcccaaagggcttggccctggatacctcagtggcttcctc
ctggttctccacagtcagacagatggagagcccacctgccccgtgggcatgcccaagctc
tggacgggctatagtctgttatacctggaagggcaggagaaagctcacaatcaagacctc
ggcctggcgggatcttgccttcctgtgtttagtacactgccctttgcctactgcaacatc
caccaagtgtgccactatgcccagagaaatgacagatcctattggctggccagcgctgca
cccctgcccatgatgccactctcggaagaggagatccgcccctacatcagccgctgtgct
gtgtgtgaggccccggctcaggcggtggcggtgcacagccaggaccagtccatccccgcg
tgtccgcggacctggaagagcctttggatcgggtactcattcctgatgcacacaggagct
ggggaccaaggaggagggcaggccctcatgtcacctggcagctgcctggaggacttcaga
gcggcaccattccttgaatgccaaggccggcagggcacttgccacttttttgcaaatgag
tatagcttctggctgacaacagtgaaacccgacttggagttctcctcggcaccatcacca
gacaccttaaaagaaagccaggcccagcgccagaaaatcagcaggtgccaggtctgtctg
aagtatggctag

DBGET integrated database retrieval system