KEGG   Capricornis sumatraensis (Sumatran serow): 138088977
Entry
138088977         CDS       T10522                                 
Symbol
COL4A1
Name
(RefSeq) collagen alpha-1(IV) chain isoform X1
  KO
K06237  collagen type IV alpha
Organism
csum  Capricornis sumatraensis (Sumatran serow)
Pathway
csum04151  PI3K-Akt signaling pathway
csum04382  Cornified envelope formation
csum04510  Focal adhesion
csum04512  ECM-receptor interaction
csum04518  Integrin signaling
csum04820  Cytoskeleton in muscle cells
csum04926  Relaxin signaling pathway
csum04933  AGE-RAGE signaling pathway in diabetic complications
csum04974  Protein digestion and absorption
csum05146  Amoebiasis
csum05165  Human papillomavirus infection
csum05200  Pathways in cancer
csum05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:csum00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    138088977 (COL4A1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    138088977 (COL4A1)
   04518 Integrin signaling
    138088977 (COL4A1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    138088977 (COL4A1)
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    138088977 (COL4A1)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    138088977 (COL4A1)
  09154 Digestive system
   04974 Protein digestion and absorption
    138088977 (COL4A1)
  09158 Development and regeneration
   04382 Cornified envelope formation
    138088977 (COL4A1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    138088977 (COL4A1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    138088977 (COL4A1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    138088977 (COL4A1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    138088977 (COL4A1)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    138088977 (COL4A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:csum04147]
    138088977 (COL4A1)
   00536 Glycosaminoglycan binding proteins [BR:csum00536]
    138088977 (COL4A1)
Exosome [BR:csum04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   138088977 (COL4A1)
Glycosaminoglycan binding proteins [BR:csum00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   138088977 (COL4A1)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 138088977
NCBI-ProteinID: XP_068840370
LinkDB
Position
12:complement(92999890..93129961)
AA seq 1669 aa
MGPRLGVWLLLALAALLLHEESSRAAAKGGCAGSGCGKCDCHGVKGQKGERGLPGLQGVI
GFPGMQGPEGPQGPPGQKGDTGEPGLPGTKGTRGPSGVPGYPGNPGLPGIPGQDGPPGPP
GIPGCNGTKGERGPVGPPGLPGFAGNPGPPGLPGMKGDPGEILGHIPGTLLKGERGYPGQ
PGAPGSPGLPGLQGPVGPPGFTGPPGPPGPPGPPGEKGQMGLSFQGPKGEKGDQGVSGPP
GLPGQAQVITKGDTAMRGEKGQKGEPGFPGLQGFGEKGEPGKPGPRGKPGKDGEKGEKGS
QGFPGDSGYPGQPGREGLKGEKGEAGPPGLPGTVIGTGPLGEKGEPGYPGGPGAKGETGP
KGFPGIPGQPGPPGFPTPGLIGAPGFPGDRGEKGEPGLPGVSLPGPSGRDGLPGPPGPPG
PPGQPGHTNGLVECQPGPPGDQGPPGIPGQPGLTGEVGEKGQKGDSCLVCDTAELRGPPG
PQGPPGEIGFPGQPGAKGDRGLPGRDGLEGTPGPQGAPGLMGQPGAKGEPGEIYFDIRLK
GDKGDPGLPGQPGMPGRAGSPGRDGQPGLPGPRGSPGSVGLKGERGPPGGVGFPGSRGDI
GPPGPPGFGPIGPIGDKGQMGFPGNPGAPGQPGPKGEAGKVVPLPGPPGAEGLPGSPGFQ
GPQGDRGFPGSPGRPGLPGEKGAIGQPGIGFPGPPGPKGVDGLPGDAGPPGNPGRQGFNG
LPGNPGPPGQKGEPGVGLPGLKGLPGIPGIPGTPGEKGNVGGPGVPGEHGAIGPPGLQGL
RGDPGPPGLQGPKGAPGVPGIGPPGAMGPPGGQGPPGSSGPPGVKGEKGFPGFPGLDMPG
PKGDKGSQGLPGLTGQSGLPGLPGQQGSPGQPGIPGPKGEMGVMGTPGQPGSPGPAGVPG
LPGAKGDHGFPGSSGPRGDPGFKGDKGDVGLPGKPGSMDKVDMGSMKGEKGDQGEKGQTG
PTGDKGSRGDPGTPGVPGKDGQAGHPGQPGPKGDPGVSGIPGAPGLPGPKGSTGGMGLPG
MPGPKGVAGIPGPQGIPGLPGDKGAKGEKGQAGLPGIGIPGRPGEKGDQGLAGFPGSPGE
KGEKGSTGIPGMPGAPGPKGSPGSVGYPGSPGLPGEKGDKGLPGLDGIPGIKGEAGLPGK
PGPTGPAGQKGEPGSDGIPGSVGEKGEAGLPGRGFPGFPGGKGEKGSKGDVGFPGLAGSP
GIPGSKGEQGFMGPPGPQGQPGLPGTPGHAVEGPKGDRGPQGQPGLPGHPGPMGPPGLPG
LDGLKGDKGNPGWPGTPGAPGPKGDPGFQGMPGIGGSPGITGAKGDTGPPGVPGFHGQKG
APGLQGVKGDQGDQGFPGTKGLPGPPGPPGPFNIIKGEPGLPGPEGPAGLKGLQGPPGPK
GQQGVTGSAGLPGPPGEPGFDGAPGQKGETGPFGPPGPRGFPGPPGPDGLPGSMGPPGTP
SVDHGFLVTRHSQTTDDPQCPPGTKILYHGYSLLYVQGNERAHGQDLGTAGSCLRKFSTM
PFLFCNINNVCNFASRNDYSYWLSTPEPMPMSMAPITGENIRPFISRCAVCEAPAMVMAV
HSQTIQIPQCPAGWSSLWIGYSFVMHTSAGAEGSGQALASPGSCLEEFRSAPFIECHGRG
TCNYYANAYSFWLATIERSEMFKKPTPSTLKAGELRTHVSRCQVCMRRT
NT seq 5010 nt   +upstreamnt  +downstreamnt
atggggccccggctcggcgtgtggctgctgctggcgctcgccgcgctcctgctccacgag
gagagcagccgggccgccgcgaagggtgggtgtgctggctctggctgcgggaagtgtgac
tgccatggcgtgaaaggacagaagggagaaagaggcctcccggggttacagggggtcatc
gggtttcccggaatgcaaggacccgaggggccgcagggaccaccgggacagaagggtgac
accggcgagcccggactgccaggcactaaagggacaagaggaccctcaggagtgcctggt
taccctggaaacccaggacttcctggtattcctggccaggacggtcctccgggtccccca
ggtattccaggatgtaacgggacaaagggtgagagagggcccgtgggacctcccggtttg
cctggattcgctggaaatcctggaccaccagggttaccgggaatgaagggagatccaggt
gagattctgggccatataccagggaccctgctgaaaggcgaaagaggatatcctggacag
ccgggagcgcctggttcaccaggcctgccaggactgcaaggccccgtcgggcccccagga
ttcaccgggccaccaggccctccaggccctcctggccctccaggcgaaaaggggcaaatg
ggcttgagctttcaagggccgaaaggtgaaaagggtgatcaaggggtcagcgggcccccg
ggattaccaggacaggctcaagtcatcacgaaaggggacacagccatgcgcggcgagaag
ggtcaaaaaggtgaacccggatttccggggctgcaagggtttggagagaaaggagaacct
ggaaaaccagggccccgcggaaaaccaggaaaagacggtgaaaaaggagaaaaaggcagt
caagggtttccgggcgattcggggtacccaggacagccaggccgagaaggtttaaaggga
gagaaaggtgaagcaggtcctcccgggctgcctggcactgttattggcacaggacccttg
ggagagaaaggagagcccgggtacccagggggcccaggggcgaaaggggagacaggtccc
aaaggtttcccaggaataccaggccagccaggccctccaggcttcccgactccagggctg
attggtgcccccggcttccccggcgacagaggagagaagggtgaaccgggcttaccaggc
gtgtcgctgccaggacccagcggaagggacgggcttcccggcccccccgggccccccggg
ccccctgggcagccgggccacacaaatggactcgtggaatgccagcctgggccgccaggg
gaccagggtcctcccggaattccagggcagccggggttgacgggcgaagttggagaaaaa
ggtcaaaaaggagacagctgcctcgtctgtgacacagcagagcttcgtgggcccccaggg
ccacagggaccccccggagaaataggtttcccaggacaaccaggggccaagggagacaga
ggcttacccggcagggatggtctggaaggaacgcctggtcctcaaggtgcgccagggctc
atgggccagccgggagccaagggcgagcccggcgagatctacttcgacatacggctcaag
ggcgacaaaggagaccccggcttaccaggccagcccggcatgccaggcagagcgggctcc
cctggaagagacggccaaccgggccttcccggccccagaggctccccgggttcagtagga
ttgaaaggggagcgtggccccccgggaggcgttggattccctgggagccgtggcgacatc
ggccctccggggcctccaggcttcggcccgattggccccattggtgacaaaggacaaatg
ggcttcccaggaaaccctggggcccccggccagccaggtcccaagggagaggcgggaaaa
gttgtgcccttgcctggcccccctggagcagaaggacttcccgggtcccccggcttccag
gggccacaaggtgaccgaggttttcctggaagccctggaaggccgggcctccccggagag
aagggcgccatcggccagcctgggattggatttcctgggcctcccggccccaaaggcgtt
gatggtttacctggcgatgctggacctcctgggaatccgggtcgtcaaggcttcaacggc
ttacctggcaaccccggtccacctggccagaagggcgagcctggagtcggtctgccggga
ctcaaaggcctgcctgggatacctggcatccctggcacccctggggagaagggaaacgtc
ggaggaccaggcgttcctggagagcacggtgccatcggtcccccaggcctccaggggctc
agaggtgacccgggacctcctggattgcaaggccccaaaggagctccgggagtccccgga
atcggcccccctggagcaatgggcccccccggaggacagggacccccagggtcatcaggc
ccccccggagtgaaaggagagaaaggcttccctggcttcccaggtctggacatgccgggg
cccaaaggagacaaagggtcccaggggctccctggcctgacggggcagtcggggctgcct
ggccttcctggacagcagggctcccccggccagcctggcattccaggtcccaagggagag
atgggagtcatggggactccggggcagcccggctcgccaggaccagcgggtgtgccagga
ttgccgggtgccaaaggggaccatggcttccccggctcttcaggacccaggggagaccct
ggcttcaagggtgacaaaggcgacgtggggctccccggaaagccaggctccatggacaag
gtggacatgggcagcatgaagggcgagaagggggaccaaggcgagaaaggacagactggt
ccgactggcgataaaggatcccgcggagacccgggaacgccaggtgtgccgggaaaggac
ggccaggcaggacaccccgggcagccaggacctaaaggtgatccaggtgtgagcgggatc
cctggtgctccaggacttcctggtcccaaaggatccactggtggaatgggcctcccagga
atgccaggaccaaaaggtgtggctggcatccccggcccgcagggcattcctggcttacct
ggagacaagggggcaaaaggagagaaagggcaggcgggtctgcctggcattgggattcca
ggacggcctggggagaagggagaccagggccttgcaggatttcccggaagccccggcgag
aagggagagaaaggaagcacggggatcccagggatgcccggggctccgggccccaaaggc
tccccgggcagtgttggctatccgggaagccctgggttgcctggggagaaaggtgacaag
ggcctcccgggactggatggcattcctggcatcaaaggagaagcaggtcttcctgggaag
cctggccccacgggcccagccggccagaaaggggagcccggcagcgatggaatcccaggg
tcggtgggagagaagggcgaggcaggtctacctggaagaggattcccagggtttccaggg
ggcaaaggagagaaaggttcaaagggcgatgtgggcttcccaggattagctgggagccca
ggaattcctggatccaaaggagaacaaggattcatgggtcccccgggaccacagggacag
ccgggattgccagggaccccaggccacgcggtagaggggcccaaaggagaccgcggcccg
caaggacaacccggcctgccagggcatccgggacccatggggcctccaggcctccccggg
ctcgatgggctgaaaggtgacaaggggaacccaggctggccgggcactccgggagctcca
gggcccaagggagacccaggattccagggcatgccgggcattggcggctctccaggaatc
acaggagctaagggagatacgggacccccaggagttccagggtttcacggtcagaaaggc
gcccccggcctgcagggagtcaaaggtgaccaaggagaccaaggcttcccgggaacgaaa
ggtcttcccggccccccgggccccccaggtccgttcaacatcatcaagggggaaccaggg
ctccctggtcccgagggccccgcgggtctgaaagggcttcagggacctccaggcccgaaa
ggacagcaaggtgtgacgggatccgcgggcttgcctgggcccccaggtgagcccggcttt
gacggcgcccccggccagaaaggagagactgggcccttcggccctcccggtccacgaggc
ttcccgggtccgcccggccccgacgggctgccggggtccatgggtcccccgggcaccccg
tcagtcgatcatggtttccttgtgacccggcacagtcagacgacagacgacccccagtgc
cctcctgggaccaaaatcctctaccacggctactctttgctctacgtgcaaggcaacgag
cgggcgcacggccaggacttgggcacggcgggcagctgcctgcggaagttcagcaccatg
cccttcctcttctgcaacatcaacaacgtctgcaacttcgcctcccgcaacgactactcg
tactggctgtccacgccggagcccatgcccatgtccatggcccccatcaccggggagaac
atccggcccttcatcagcaggtgtgctgtgtgtgaggccccagcgatggtgatggccgtg
cacagccagaccatccagattccgcagtgccccgccggctggtcctcgctctggatcggc
tactccttcgtgatgcacaccagcgccggggctgaaggctctggccaagccctcgcctcc
cccggctcgtgtctggaggagttcagaagcgcccccttcatcgagtgccacggccgcggg
acttgcaattactacgcaaacgcttacagcttttggcttgccacgatagagcggagcgag
atgttcaagaagcccacgccgtccacgctgaaggccggggagctgcgcacgcacgtcagc
cggtgccaggtgtgcatgcggaggacatga

DBGET integrated database retrieval system