KEGG   Phocoena sinus (vaquita): 116759245
Entry
116759245         CDS       T07759                                 
Symbol
COL1A2
Name
(RefSeq) collagen alpha-2(I) chain isoform X1
  KO
K06236  collagen type I alpha
Organism
psiu  Phocoena sinus (vaquita)
Pathway
psiu04151  PI3K-Akt signaling pathway
psiu04510  Focal adhesion
psiu04512  ECM-receptor interaction
psiu04611  Platelet activation
psiu04926  Relaxin signaling pathway
psiu04933  AGE-RAGE signaling pathway in diabetic complications
psiu04974  Protein digestion and absorption
psiu05146  Amoebiasis
psiu05165  Human papillomavirus infection
psiu05205  Proteoglycans in cancer
psiu05415  Diabetic cardiomyopathy
Brite
KEGG Orthology (KO) [BR:psiu00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    116759245 (COL1A2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    116759245 (COL1A2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    116759245 (COL1A2)
 09150 Organismal Systems
  09151 Immune system
   04611 Platelet activation
    116759245 (COL1A2)
  09152 Endocrine system
   04926 Relaxin signaling pathway
    116759245 (COL1A2)
  09154 Digestive system
   04974 Protein digestion and absorption
    116759245 (COL1A2)
 09160 Human Diseases
  09161 Cancer: overview
   05205 Proteoglycans in cancer
    116759245 (COL1A2)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    116759245 (COL1A2)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    116759245 (COL1A2)
  09166 Cardiovascular disease
   05415 Diabetic cardiomyopathy
    116759245 (COL1A2)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    116759245 (COL1A2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:psiu00536]
    116759245 (COL1A2)
Glycosaminoglycan binding proteins [BR:psiu00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   116759245 (COL1A2)
SSDB
Motif
Pfam: Collagen COLFI
Other DBs
NCBI-GeneID: 116759245
NCBI-ProteinID: XP_032498638
Ensembl: ENSPSNG00000005091
LinkDB
Position
9:58326232..58361775
AA seq 1366 aa
MLSFVDTRTLLLLAVTSCLTTCQSLQEATARKGPTGDRGPRGERGPPGPPGRDGDDGIPG
PPGPPGPPGPPGLGGNFAAQYDGKGVGIGPGPMGLMGPRGPPGASGAPGPQGFQGPPGEP
GEPGQTGPAGARGPPGPPGKAGEDGHPGKPGRPGERGVVGPQGARGFPGTPGLPGFKGIR
GHNGLDGLKGQPGTPGVKGEPGAPGENGIPGQIGARGLPGERGRVGAPGPAGARGSDGSV
GPVGPAGPLGSAGPPGFPGAPGPKGELGPVGNPGPAGPAGSRGEVGLPGVSGPVGPPGNP
GANGLHGAKGAAGLPGVAGAPGLPGPRGIPGPVGAAGATGARGLVGEPGPAGSKGESGNK
GEPGAAGPTGPPGPSGEEGKRGTTGEIGSAGPPGPPGLRGNPGSRGLPGADGRAGAMGPH
GSRGGTGPAGVRGPSGDSGRPGEPGLMGPRGFPGSPGNVGPAGKEGPMGLPGIDGRPGPI
GPAGTRGEPGNIGFPGPKGPTGDPGKNGEKGHAGLAGPRGAPGPDGNNGAQGPPGLQGVS
GGKGEQGPAGPPGFQGLPGPAGTAGEAGKAGERGIPGEFGLPGPAGPRGERGPPGESGAA
GPAGPIGSRGPSGPAGPDGNKGEPGVVGAPGTAGPSGPNGLPGERGAAGIPGGKGEKGET
GLRGDAGSHGRDGARGAPGAVGAPGPAGANGDRGEAGPAGPAGPAGPRGSPGERGEVGPA
GPNGFAGPAGAAGQPGAKGERGTKGPKGENGPTGPTGPVGAAGPAGPNGPPGPAGSRGDG
GPPGATGFPGAAGRTGPPGPSGITGPPGPPGPAGKEGLRGPRGDQGPVGRTGETGASGPP
GFVGEKGPSGEPGTAGSPGTPGPQGLLGAPGFLGLPGSRGERGLPGVAGSVGEPGPLGIA
GPTGARGPPGAVGNPGVNGAPGEAGRDGNPGNDGPPGRDGQAGHKGDRGYPGNAGPTGTV
GAPGPQGPVGPTGKHGNRGEPGPSGPVGLAGAVGPRGPSGPQGIRGDKGEPGDKGPRGLP
GLKGHNGLQGLPGLAGHHGDQGAPGTVGPAGPRGPSGPSGPSGKDGRTGHPGAVGPAGIR
GSQGSQGPSGPPGPPGPPGPPGPSGGGYDFGFEGDFYRADQPRSPPSLRPKDYEVDATLK
SLNNQIETLLTPEGSRKNPARTCRDLRLSHPEWSSGYYWVDPNQGCTMDAIKVYCDFSTG
ETCIRAQPENIPVKNWYRSSKAKKHVWVGETINGGTQFEYNVEGVTTKEMATQLAFMRLL
ANHASQNITYHCKNSIAYMDEETGNLKKAVILQGSNDVELVAEGNSRFTYTVLVDGCSKK
INEWRKTIIEYKTNKPSRLPILDIAPLDIGGADQEIRLNIGPVCFK
NT seq 4101 nt   +upstreamnt  +downstreamnt
atgctcagctttgtggatacgcggactctgttgctgcttgcagtaacttcgtgcctaaca
acatgccaatctttacaagaggcaactgcaagaaagggcccaactggagatagaggacca
cgtggagaaaggggtccgccaggcccaccaggcagagatggtgatgatggtatcccaggc
ccacctggtccacctggtcctcctggtccccctggtcttggtgggaactttgctgctcag
tatgatggaaaaggagttggaattggccctggaccaatgggtttgatgggacctagaggc
cctcctggtgcatctggagcccctggccctcaaggtttccaaggacctcctggtgagcct
ggtgagcctggtcaaactggccctgcaggtgctcgtggtccacctggccctcctggcaag
gctggtgaggatggtcaccctggaaaacctggacggcctggtgagagaggagttgttggg
ccacagggtgctcgtggtttccctggaactcctggactccctggcttcaagggcattagg
ggtcacaatggtctggacgggttgaagggacagcctggtactccaggtgtgaagggtgaa
cctggtgcccctggtgaaaatggaattccaggtcaaataggagctcgtgggcttcctggt
gagagaggacgtgtcggtgcccctggcccagctggtgcccgtggaagtgatggaagtgtg
ggtcctgtgggtcctgctggtcctcttgggtctgctggccctccaggcttcccaggtgct
cctggccccaagggtgaacttggacctgtcggtaaccctggtcctgctggtcccgcgggt
tcccgtggtgaagtgggtcttccaggtgtttctggccctgttggacctcctggcaaccct
ggagccaatggccttcacggtgctaagggtgctgccggccttccaggcgttgctggggct
cctggcctccctggaccccggggtattcctggccctgttggtgctgctggtgctactggt
gccagaggacttgttggtgagcctggtccagctggttccaaaggagagagtggcaacaag
ggcgagcctggtgctgctgggcccacaggtcctcctggtcccagtggcgaagaaggaaag
agaggcaccactggtgaaattggatccgctggccccccaggacctcctggtctgagggga
aatcctggttctcgtggtcttcctggagccgacggcagagctggtgccatgggccctcat
ggtagtcgtggtggaactggccctgctggtgtgcgaggtcccagtggagattctggtcgc
cctggagagcctggcctcatgggaccccgaggttttcctggttcccctggaaatgttggc
ccagctggtaaagaaggtccgatgggcctccctggcattgatggcaggcctggaccaatt
ggcccagctggaacaagaggagagcctggcaacatcggattccctggacccaaaggcccc
actggtgatcctggcaaaaatggtgaaaaaggtcatgctggtcttgccggtcctcggggt
gctccaggtcctgatggaaacaatggtgctcagggacctcctggactacagggtgtctca
ggtggaaaaggtgaacagggtcccgctggtcctccaggcttccagggtctgcctggccct
gcaggtacagctggtgaagctggcaaagcaggagaaaggggtatccctggtgaatttggt
ctccctggtcctgctggtccaagaggagagcgtggtcccccaggtgaaagtggtgctgct
ggtcctgctggtcctattggaagccgaggtccttctggacctgcagggcctgatgggaac
aagggcgaacctggtgtggttggtgctccaggcactgctggtccatctggtcctaatgga
ctcccaggagaaaggggtgctgctggcatacctggaggcaagggagaaaagggtgaaact
ggtctcagaggtgatgctggtagccacggcagagacggtgctcgtggtgctcctggtgct
gtaggtgcccctggtcctgctggagccaacggggaccggggtgaagctggtcctgctggt
cccgctggtcccgctggtcctcgtggtagccctggtgaacgtggtgaggtcggtcccgct
ggccccaatggatttgctggtcctgctggcgctgctggtcaacctggtgctaaaggagag
agaggaaccaaaggacccaagggtgaaaacggccctactggtcccacaggccccgttgga
gctgctggcccagctggtccaaatggtccccctggtcctgctggaagtcgtggtgatgga
ggcccccctggtgctacgggtttccctggtgctgctggaaggactggtcctcctggaccc
tctggtatcactggcccccctggtccccctggtcctgctggtaaagaaggacttcgtggg
cctcgtggtgaccaaggtccagttggtcgaactggagaaacaggtgcttctggcccccct
ggctttgttggtgagaagggtccctctggagagcctggtactgctggatctcctggcacc
ccaggtcctcaaggtcttcttggtgctcctggttttctgggtctcccaggctctagaggt
gaacgtggtctaccaggtgttgctggatctgtgggtgaacctggccctctcggcattgca
ggcccaactggggcccgtggtccccctggtgctgtgggtaatcctggcgtcaatggtgct
cctggtgaagctggtcgtgatggcaaccctgggaatgatggtcccccaggccgcgatggt
caagctggacacaagggggatcgtggttaccctggtaacgctggtcccactggtactgtg
ggtgcacctggtcctcaaggccctgtgggtccaactggcaaacatggaaaccgcggtgaa
cctggtccttctggtcccgttggtctggccggtgctgttggtccaagaggtcctagtggc
ccacaaggtattcgaggtgataagggagagcctggtgataaggggcccagaggtcttcct
ggcttaaagggacacaatggattgcagggtcttcctggtcttgctggtcatcatggcgat
caaggtgctcctggcactgtgggtcctgctggtcctaggggcccttctggtccttctggc
ccttctggcaaggacggtcgcactggacatcctggtgcagtcggacctgctggcattcgt
ggctctcagggtagccaaggtccttctggccctcctggtcctcctggccctcctggccct
cctggccccagtggtggtggttatgacttcggttttgaaggagacttctacagggctgac
cagcctcgctcaccaccatctctcagacccaaggattatgaagttgatgctactctgaaa
tctctcaacaaccagattgagactcttcttactccagaaggctctaggaagaacccagcg
cgcacatgccgtgacttgagactcagccacccagaatggagcagtggttactactgggtt
gaccctaaccaaggatgtactatggatgctatcaaagtatactgtgatttctctactggc
gaaacctgcatccgggctcagcctgaaaacatcccagtcaagaactggtacagaagttcc
aaggccaagaagcacgtctgggtaggagaaactatcaatggtggtacccagtttgaatat
aatgttgaaggagtaactaccaaggaaatggctacccaacttgccttcatgcgcctgctg
gccaaccatgcctctcaaaacatcacctaccattgcaagaacagcattgcatacatggat
gaggagactggcaacctgaaaaaggctgtcattctgcaaggatccaatgatgttgaactt
gttgctgagggcaacagcagattcacttacactgttcttgtagatggctgctctaaaaag
ataaatgaatggagaaagacaatcattgaatataaaacaaataagccatctcgcctgccc
atccttgatattgcacctttggacatcggtggcgctgaccaagaaatcagattgaacatt
ggcccagtctgtttcaaataa

DBGET integrated database retrieval system