KEGG   Pan troglodytes (chimpanzee): 452897
Entry
452897            CDS       T01005                                 

Gene name
SOS2
Definition
(RefSeq) son of sevenless homolog 2 isoform X1
  KO
K03099  son of sevenless
Organism
ptr  Pan troglodytes (chimpanzee)
Pathway
ptr01521  EGFR tyrosine kinase inhibitor resistance
ptr01522  Endocrine resistance
ptr04010  MAPK signaling pathway
ptr04012  ErbB signaling pathway
ptr04014  Ras signaling pathway
ptr04062  Chemokine signaling pathway
ptr04068  FoxO signaling pathway
ptr04072  Phospholipase D signaling pathway
ptr04150  mTOR signaling pathway
ptr04151  PI3K-Akt signaling pathway
ptr04510  Focal adhesion
ptr04540  Gap junction
ptr04630  JAK-STAT signaling pathway
ptr04650  Natural killer cell mediated cytotoxicity
ptr04660  T cell receptor signaling pathway
ptr04662  B cell receptor signaling pathway
ptr04664  Fc epsilon RI signaling pathway
ptr04714  Thermogenesis
ptr04722  Neurotrophin signaling pathway
ptr04810  Regulation of actin cytoskeleton
ptr04910  Insulin signaling pathway
ptr04912  GnRH signaling pathway
ptr04915  Estrogen signaling pathway
ptr04917  Prolactin signaling pathway
ptr04926  Relaxin signaling pathway
ptr04935  Growth hormone synthesis, secretion and action
ptr05034  Alcoholism
ptr05160  Hepatitis C
ptr05161  Hepatitis B
ptr05163  Human cytomegalovirus infection
ptr05165  Human papillomavirus infection
ptr05200  Pathways in cancer
ptr05205  Proteoglycans in cancer
ptr05206  MicroRNAs in cancer
ptr05210  Colorectal cancer
ptr05211  Renal cell carcinoma
ptr05213  Endometrial cancer
ptr05214  Glioma
ptr05215  Prostate cancer
ptr05220  Chronic myeloid leukemia
ptr05221  Acute myeloid leukemia
ptr05223  Non-small cell lung cancer
ptr05224  Breast cancer
ptr05225  Hepatocellular carcinoma
ptr05226  Gastric cancer
ptr05231  Choline metabolism in cancer
Brite
KEGG Orthology (KO) [BR:ptr00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04010 MAPK signaling pathway
    452897 (SOS2)
   04012 ErbB signaling pathway
    452897 (SOS2)
   04014 Ras signaling pathway
    452897 (SOS2)
   04630 JAK-STAT signaling pathway
    452897 (SOS2)
   04068 FoxO signaling pathway
    452897 (SOS2)
   04072 Phospholipase D signaling pathway
    452897 (SOS2)
   04151 PI3K-Akt signaling pathway
    452897 (SOS2)
   04150 mTOR signaling pathway
    452897 (SOS2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    452897 (SOS2)
   04540 Gap junction
    452897 (SOS2)
  09142 Cell motility
   04810 Regulation of actin cytoskeleton
    452897 (SOS2)
 09150 Organismal Systems
  09151 Immune system
   04650 Natural killer cell mediated cytotoxicity
    452897 (SOS2)
   04660 T cell receptor signaling pathway
    452897 (SOS2)
   04662 B cell receptor signaling pathway
    452897 (SOS2)
   04664 Fc epsilon RI signaling pathway
    452897 (SOS2)
   04062 Chemokine signaling pathway
    452897 (SOS2)
  09152 Endocrine system
   04910 Insulin signaling pathway
    452897 (SOS2)
   04912 GnRH signaling pathway
    452897 (SOS2)
   04915 Estrogen signaling pathway
    452897 (SOS2)
   04917 Prolactin signaling pathway
    452897 (SOS2)
   04926 Relaxin signaling pathway
    452897 (SOS2)
   04935 Growth hormone synthesis, secretion and action
    452897 (SOS2)
  09156 Nervous system
   04722 Neurotrophin signaling pathway
    452897 (SOS2)
  09159 Environmental adaptation
   04714 Thermogenesis
    452897 (SOS2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    452897 (SOS2)
   05206 MicroRNAs in cancer
    452897 (SOS2)
   05205 Proteoglycans in cancer
    452897 (SOS2)
   05231 Choline metabolism in cancer
    452897 (SOS2)
  09162 Cancer: specific types
   05210 Colorectal cancer
    452897 (SOS2)
   05225 Hepatocellular carcinoma
    452897 (SOS2)
   05226 Gastric cancer
    452897 (SOS2)
   05214 Glioma
    452897 (SOS2)
   05221 Acute myeloid leukemia
    452897 (SOS2)
   05220 Chronic myeloid leukemia
    452897 (SOS2)
   05211 Renal cell carcinoma
    452897 (SOS2)
   05215 Prostate cancer
    452897 (SOS2)
   05213 Endometrial cancer
    452897 (SOS2)
   05224 Breast cancer
    452897 (SOS2)
   05223 Non-small cell lung cancer
    452897 (SOS2)
  09165 Substance dependence
   05034 Alcoholism
    452897 (SOS2)
  09172 Infectious disease: viral
   05161 Hepatitis B
    452897 (SOS2)
   05160 Hepatitis C
    452897 (SOS2)
   05163 Human cytomegalovirus infection
    452897 (SOS2)
   05165 Human papillomavirus infection
    452897 (SOS2)
  09176 Drug resistance: antineoplastic
   01521 EGFR tyrosine kinase inhibitor resistance
    452897 (SOS2)
   01522 Endocrine resistance
    452897 (SOS2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04990 Domain-containing proteins not elsewhere classified [BR:ptr04990]
    452897 (SOS2)
Domain-containing proteins not elsewhere classified [BR:ptr04990]
 Pleckstrin homology (PH) domain-containing proteins
  Dbl-Like RhoGEF family proteins
   452897 (SOS2)
SSDB
Motif
Pfam: RasGEF RasGEF_N RhoGEF PH Histone IQ_SEC7_PH PH_19 PH_13
Other DBs
NCBI-GeneID: 452897
NCBI-ProteinID: XP_016781538
Ensembl: ENSPTRG00000006328
VGNC: 8216
LinkDB
Position
14
AA seq 1332 aa
MQQAPQPYEFFSEENSPKWRGLLVSALRKVQEQVHPTLSANEESLYYIEELIFQLLNKLC
MAQPRTVQDVEERVQKTFPHPIDKWAIADAQSAIEKRKRRNPLLLPVDKIHPSLKEVLGY
KVDYHVSLYIVAVLEYISADILKLAGNYVFNIRHYEISQQDIKVSMCADKVLMDMFDQDD
IGLVSLCEDEPSSSGELNYYDLVRTEIAEERQYLRELNMIIKVFREAFLSDRKLFKPSDI
EKIFSNISDIHELTVKLLGLIEDTVEMTDESSPHPLAGSCFEDLAEEQAFDPYETLSQDI
LSPEFHEHFNKLMARPAVALHFQSIADGFKEAVRYVLPRLMLVPVYHCWHYFELLKQLKA
CSEEQEDRECLNQAITALMNLQGSMDRIYKQYSPRRRPGDPVCPFYSHQLRSKHLAIKKM
NEIQKNIDGWEGKDIGQCCNEFIMEGPLTRIGAKHERHIFLFDGLMISCKPNHGQTRLPG
YSSAEYRLKEKFVMRKIQICDKEDTCEYKHAFELVSKDENSIIFAAKSAEEKNNWMAALI
SLHYRSTLDRMLDSVLLKEENEQPLRLPSPEVYRFVVKDSEENIVFEDNLQSRSGIPIIK
GGTVVKLIERLTYHMYADPNFVRTFLTTYRSFCKPQELLSLLIERFEIPEPEPTDADKLA
IEKGEQPISADLKRFRKEYVQPVQLRILNVFRHWVEHHFYDFERDLELLERLESFISSVR
GKAMKKWVESIAKIIRRKKQAQANGISHNITFESPPPPIEWHISKPGQFETFDLMTLHPI
EIARQLTLLESDLYRKVQPSELVGSVWTKEDKEINSPNLLKMIRHTTNLTLWFEKCIVEA
ENFEERVAVLSRIIEILQVFQDLNNFNGVLEIVSAVNSVSVYRLDHTFEALQERKRKILD
EAVELSQDHFKKYLVKLKSINPPCVPFFGIYLTNILKTEEGNNDFLKKKGKDLINFSKRR
KVAEITGEIQQYQNQPYCLRIEPDMRRFFENLNPMGSASEKEFTDYLFNKSLEIEPRNCK
QPPRFPRKSTFSLKSPGIRPNTGRHGSTSGTLRGHPTPLEREPCKISFSRIAETELESTV
SAPTSPNTPSTPPVSASSDLSVFLDVDLNSSCGSNSIFAPVLLPHSKSFFSSCGSLHKLS
EEPLIPPPLPPRKKFDHDASNSKGNMKSDDDPPAIPPRQPPPPKVKPRVPVPTGAFDGPL
HSPPPPPPRDPLPDTPPPVPLRPPEHFINCPFNLQPPPLGHLHRDSDWLRDISTCPNSPS
TPPSTPSPRVPRRCYVLSSSQNNLAHPPAPPVPPRQNSSPHLPKLPPKTYKRELSHPPLY
RLPLLENAETPQ
NT seq 3999 nt   +upstreamnt  +downstreamnt
atgcagcaggcgccgcagccttacgagttcttcagcgaggagaacagtccgaaatggcgg
ggactgttggtctcggccctgcggaaggttcaggaacaagtgcatcccactctctcagct
aatgaagagtctctctattatattgaagagctgatttttcagctgcttaataaattatgc
atggcccagccaaggactgttcaagatgtagaggagcgagttcagaagacctttcctcac
ccaattgataaatgggccattgctgatgcacaatctgctatagaaaaacgaaaacgaaga
aatcctcttttactgcctgtggacaaaatccatccttcgttgaaggaagtattagggtac
aaagtggactaccatgtgtccctatatattgtggctgtactagagtatatctcagctgat
attttaaaattggctggtaattatgtttttaatatccggcattatgaaatatctcagcag
gacattaaagtgtcaatgtgtgcggataaggttttgatggacatgtttgatcaggatgac
ataggtttggtttctctctgtgaagatgaacctagttcttctggtgaattaaactactat
gatcttgtcagaactgaaatcgcagaagaaagacagtatctacgggaattaaatatgatc
ataaaagtgtttcgagaagcctttctttctgatagaaagctgtttaaaccttctgatatc
gaaaagatttttagtaacatttcagatatacatgaattgactgtgaaacttttaggtttg
attgaagacacagttgaaatgactgatgaaagcagtcctcatcccttagctggcagctgt
tttgaagatttggcagaagagcaagcatttgatccttatgaaacattatcacaggacatt
ctttcaccagagtttcatgaacatttcaataaattaatggccagacctgcagttgctcta
cactttcagtccattgctgatggttttaaagaggcagttcgttatgtccttccacgtctt
atgctggtgccagtgtatcactgttggcactactttgagttactaaagcaattgaaagca
tgtagtgaagaacaagaagacagagaatgtttgaaccaagctattactgctctcatgaat
ctccaaggtagcatggaccgaatttacaagcagtattcacctagacgtcgacctggagat
cctgtttgccctttttatagtcaccaattaagaagcaaacacctggctatcaaaaaaatg
aatgaaattcagaaaaatattgatggatgggaaggcaaagatattggacagtgttgtaat
gaattcattatggagggaccattgacaagaatcggtgccaaacatgaacggcatattttt
ctgtttgatggcttaatgatcagttgtaaacctaatcatggccagactcggcttccaggt
tacagtagtgcagaatacaggttaaaagaaaaatttgtcatgaggaaaatacaaatttgt
gataaagaagatacttgtgagtacaagcatgcatttgaattagtatccaaagatgagaac
agcataatatttgctgctaagtctgctgaagaaaaaaacaactggatggcagcccttatt
tctcttcattatcgtagtactctagatcgaatgttagattcagtattattgaaagaagaa
aatgagcaaccactgagattaccaagtcctgaagtatatcgttttgtagtaaaagactct
gaggaaaacattgtttttgaagacaacttgcaaagtagaagtggcatccccattattaaa
ggaggaactgtagtgaaattaattgaaaggttaacatatcatatgtatgcagatcccaat
tttgttcgtacttttcttaccacatatcgttcattttgtaaaccacaggaattgctgagc
ttactgattgaacggtttgaaattccagagccagaacctactgatgcagacaaattggca
atagagaaaggcgagcagccaatcagtgcagaccttaaaagatttcgcaaggaatatgtc
caaccagtacaacttaggatcttaaatgtatttcggcattgggttgaacatcatttttat
gactttgaaagagacttggaattgcttgaaagactagaatccttcatttcaagtgtaaga
gggaaagctatgaaaaaatgggtagagtcaatcgctaagatcatcaggaggaagaagcaa
gctcaggcaaatggaataagccataatattacctttgaaagtccacctccaccaattgaa
tggcatatcagcaaaccaggacagtttgaaacatttgatctcatgacacttcatccaata
gaaattgcacgtcagctgacacttttggagtctgatctctacaggaaagttcaaccgtct
gaacttgtagggagtgtgtggaccaaagaagataaagaaataaattctccaaatttatta
aaaatgattcgccataccacaaatctcaccctctggtttgaaaaatgcattgtggaagca
gaaaattttgaagaacgggtggcagtactaagtagaattatagaaattctgcaagttttt
caagatttgaataatttcaatggcgtattggagatagtcagtgcagtaaattcagtatca
gtatacagactagaccatacctttgaggcattgcaggaaagaaaaaggaaaattttggac
gaagctgtggaattaagtcaagatcactttaaaaaatacctagtaaaacttaagtcaatc
aatccaccttgtgtgcctttttttggaatatatttaacaaatattctgaagaccgaagaa
gggaataatgattttttaaaaaagaaagggaaagatttaatcaatttcagtaagaggagg
aaagtagctgaaattactggagaaattcagcagtatcagaatcagccttactgtttacgg
atagaaccagatatgaggagattctttgaaaaccttaacccaatgggaagtgcatctgaa
aaagagtttacagattatttgttcaacaagtcactagaaattgaacctcgaaactgcaaa
cagccacctcgatttcctaggaaatcaactttttccttaaaatctcctggaataaggcct
aacacaggccgacatggctctacctcaggtactttacgaggtcacccaacaccattagaa
agagaaccatgtaaaataagctttagtcggattgctgaaactgagctggaatcaacagtg
tcagcaccaacctctccaaatacaccatctactccaccagtatctgcttcttcagacctt
agtgtatttttagatgtggatctcaacagctcctgtggcagcaatagcatcttcgctcca
gtgcttttgccacattcaaagtctttctttagttcatgtggtagtttacataaactaagt
gaagagcccctgattcctcctcctcttcctcctcgaaaaaagtttgatcatgatgcttca
aattccaagggaaatatgaaatctgatgatgatcctcctgctattccaccgagacagcct
cctcctccaaaggtaaaacccagagttcctgttcctactggtgcatttgatgggcctctg
catagtccacctccgccaccaccaagagatcctcttcctgatacccctccaccagttccc
cttcggcctccagaacactttataaactgtccatttaatcttcagccacctccactgggg
catcttcacagagattcagactggctcagagacattagtacgtgtccaaattcgccaagc
actcctcctagcacaccctctccaagggtaccgcgtcgatgctatgtgctcagttctagt
cagaataatcttgctcatcctccagctccccctgttccaccaaggcagaattcaagccct
catctgccaaaactgccaccaaagacttacaaacgggagctttcgcaccccccattgtac
agactgcctttgctagaaaatgcagaaactccccaatga

KEGG   Pan troglodytes (chimpanzee): 459171
Entry
459171            CDS       T01005                                 

Gene name
SOS1
Definition
(RefSeq) son of sevenless homolog 1 isoform X2
  KO
K03099  son of sevenless
Organism
ptr  Pan troglodytes (chimpanzee)
Pathway
ptr01521  EGFR tyrosine kinase inhibitor resistance
ptr01522  Endocrine resistance
ptr04010  MAPK signaling pathway
ptr04012  ErbB signaling pathway
ptr04014  Ras signaling pathway
ptr04062  Chemokine signaling pathway
ptr04068  FoxO signaling pathway
ptr04072  Phospholipase D signaling pathway
ptr04150  mTOR signaling pathway
ptr04151  PI3K-Akt signaling pathway
ptr04510  Focal adhesion
ptr04540  Gap junction
ptr04630  JAK-STAT signaling pathway
ptr04650  Natural killer cell mediated cytotoxicity
ptr04660  T cell receptor signaling pathway
ptr04662  B cell receptor signaling pathway
ptr04664  Fc epsilon RI signaling pathway
ptr04714  Thermogenesis
ptr04722  Neurotrophin signaling pathway
ptr04810  Regulation of actin cytoskeleton
ptr04910  Insulin signaling pathway
ptr04912  GnRH signaling pathway
ptr04915  Estrogen signaling pathway
ptr04917  Prolactin signaling pathway
ptr04926  Relaxin signaling pathway
ptr04935  Growth hormone synthesis, secretion and action
ptr05034  Alcoholism
ptr05160  Hepatitis C
ptr05161  Hepatitis B
ptr05163  Human cytomegalovirus infection
ptr05165  Human papillomavirus infection
ptr05200  Pathways in cancer
ptr05205  Proteoglycans in cancer
ptr05206  MicroRNAs in cancer
ptr05210  Colorectal cancer
ptr05211  Renal cell carcinoma
ptr05213  Endometrial cancer
ptr05214  Glioma
ptr05215  Prostate cancer
ptr05220  Chronic myeloid leukemia
ptr05221  Acute myeloid leukemia
ptr05223  Non-small cell lung cancer
ptr05224  Breast cancer
ptr05225  Hepatocellular carcinoma
ptr05226  Gastric cancer
ptr05231  Choline metabolism in cancer
Brite
KEGG Orthology (KO) [BR:ptr00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04010 MAPK signaling pathway
    459171 (SOS1)
   04012 ErbB signaling pathway
    459171 (SOS1)
   04014 Ras signaling pathway
    459171 (SOS1)
   04630 JAK-STAT signaling pathway
    459171 (SOS1)
   04068 FoxO signaling pathway
    459171 (SOS1)
   04072 Phospholipase D signaling pathway
    459171 (SOS1)
   04151 PI3K-Akt signaling pathway
    459171 (SOS1)
   04150 mTOR signaling pathway
    459171 (SOS1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    459171 (SOS1)
   04540 Gap junction
    459171 (SOS1)
  09142 Cell motility
   04810 Regulation of actin cytoskeleton
    459171 (SOS1)
 09150 Organismal Systems
  09151 Immune system
   04650 Natural killer cell mediated cytotoxicity
    459171 (SOS1)
   04660 T cell receptor signaling pathway
    459171 (SOS1)
   04662 B cell receptor signaling pathway
    459171 (SOS1)
   04664 Fc epsilon RI signaling pathway
    459171 (SOS1)
   04062 Chemokine signaling pathway
    459171 (SOS1)
  09152 Endocrine system
   04910 Insulin signaling pathway
    459171 (SOS1)
   04912 GnRH signaling pathway
    459171 (SOS1)
   04915 Estrogen signaling pathway
    459171 (SOS1)
   04917 Prolactin signaling pathway
    459171 (SOS1)
   04926 Relaxin signaling pathway
    459171 (SOS1)
   04935 Growth hormone synthesis, secretion and action
    459171 (SOS1)
  09156 Nervous system
   04722 Neurotrophin signaling pathway
    459171 (SOS1)
  09159 Environmental adaptation
   04714 Thermogenesis
    459171 (SOS1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    459171 (SOS1)
   05206 MicroRNAs in cancer
    459171 (SOS1)
   05205 Proteoglycans in cancer
    459171 (SOS1)
   05231 Choline metabolism in cancer
    459171 (SOS1)
  09162 Cancer: specific types
   05210 Colorectal cancer
    459171 (SOS1)
   05225 Hepatocellular carcinoma
    459171 (SOS1)
   05226 Gastric cancer
    459171 (SOS1)
   05214 Glioma
    459171 (SOS1)
   05221 Acute myeloid leukemia
    459171 (SOS1)
   05220 Chronic myeloid leukemia
    459171 (SOS1)
   05211 Renal cell carcinoma
    459171 (SOS1)
   05215 Prostate cancer
    459171 (SOS1)
   05213 Endometrial cancer
    459171 (SOS1)
   05224 Breast cancer
    459171 (SOS1)
   05223 Non-small cell lung cancer
    459171 (SOS1)
  09165 Substance dependence
   05034 Alcoholism
    459171 (SOS1)
  09172 Infectious disease: viral
   05161 Hepatitis B
    459171 (SOS1)
   05160 Hepatitis C
    459171 (SOS1)
   05163 Human cytomegalovirus infection
    459171 (SOS1)
   05165 Human papillomavirus infection
    459171 (SOS1)
  09176 Drug resistance: antineoplastic
   01521 EGFR tyrosine kinase inhibitor resistance
    459171 (SOS1)
   01522 Endocrine resistance
    459171 (SOS1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04990 Domain-containing proteins not elsewhere classified [BR:ptr04990]
    459171 (SOS1)
Domain-containing proteins not elsewhere classified [BR:ptr04990]
 Pleckstrin homology (PH) domain-containing proteins
  Dbl-Like RhoGEF family proteins
   459171 (SOS1)
SSDB
Motif
Pfam: RasGEF RasGEF_N RhoGEF Histone PH PH_19 PH_10 IQ_SEC7_PH PH_13
Other DBs
NCBI-GeneID: 459171
NCBI-ProteinID: XP_009440628
Ensembl: ENSPTRG00000011857
VGNC: 6658
UniProt: A0A6D2YAU4 A0A2I3T5C9
LinkDB
Position
2A
AA seq 1318 aa
MQAQQLPYEFFSEENAPKWRGLLVPALKKVQGQVHPTLESNDDALQYVEELILQLLNMLC
QAQPRSASDVEERVQKSFPHPIDKWAIADAQSAIEKRKRRNPLSLPVEKIHPLLKEVLGY
KIDHQVSVYIVAVLEYISADILKLVGNYVRNIRHYEITKQDIKVAMCADKVLMDMFHQDV
EDINILSLTDEEPSTSGEQTYYDLVKAFMAEIRQYIRELNLIIKVFREPFVSNSKLFSAN
DVENIFSRIVDIHELSVKLLGHIEDTVEMTDEGSPHPLVGSCFEDLAEELAFDPYESYAR
DILRPGFHDRFLSQLSKPGAALYLQSIGEGFKEAVQYVLPRLLLAPVYHCLHYFELLKQL
EEKSEDQEDKECLKQAITALLNVQSGMEKICSKSLAKRRLSESACRFYSQQMKGKQLAIK
KMNEIQKNIDGWEGKDIGQCCNEFIMEGTLTRVGAKHERHIFLFDGLMICCKSNHGQPRL
PGASNAEYRLKEKFFMRKVQINDKDDTNEYKHAFEIILKDENSVIFSAKSAEEKNNWMAA
LISLQYRSTLERMLDVTMLQEEKEEQMRLPSADVYRFAEPDSEENIIFEENMQPKAGIPI
IKAGTVIKLIERLTYHMYADPNFVRTFLTTYRSFCKPQELLSLIIERFEIPEPEPTEADR
IAIENGDQPLSAELKRFRKEYIQPVQLRVLNVCRHWVEHHFYDFERDAYLLQRMEEFIGT
VRGKAMKKWVESITKIIQRKKIARDNGPGHNITFQSSPPTVEWHISRPGHIETFDLLTLH
PIEIARQLTLLESDLYRAVQPSELVGSVWTKEDKEINSPNLLKMIRHTTNLTLWFEKCIV
ETENLEERVAVVSRIIEILQVFQELNNFNGVLEVVSAMNSSPVYRLDHTFEQIPSRQKKI
LEEAHELSEDHYKKYLAKLRSINPPCVPFFGIYLTNILKTEEGNPEVLKRHGKELINFSK
RRKVAEITGEIQQYQNQPYCLRVESDIKRFFENLNPMGNSMEKEFTDYLFNKSLEIEPRN
PKPLPRFPKKYSYPLKSPGVRPSNPRPGTMRHPTPLQQEPRKISYSRIPESETESTASAP
NSPRTPLTPPPASGASSTTDVCSVFDSDHSSPFHSRSASVSSISLTKGTDEVPVPPPVPP
RRRPESAPAESSPSKIMSKHLDSPPAIPPRQPTSKAYSPRYSLSDRTSISDPPESPPLLP
PREPVRTPDVFSSSPLHLQPPPLGKKSDHGNTFFPNSPSPFTPPPPQTPSPHGTRRHLPS
PPLTQEVDLHSIAGPPVPPRQSTSQHIPKLPPKTYKREHTHPSMHRDGPPLLENAHSS
NT seq 3957 nt   +upstreamnt  +downstreamnt
atgcaggcgcagcagctgccctacgagtttttcagcgaagagaacgcgcccaagtggcgg
ggactgctggtgcctgcgctgaaaaaggtccaggggcaagttcatcctactctcgagtct
aatgatgatgctcttcagtatgttgaagaattaattttgcaattattaaatatgctatgc
caagctcagccccgaagtgcttcagatgtagaggaacgtgttcaaaaaagtttccctcat
ccaattgataaatgggcaatagctgatgcccaatcagctattgaaaagaggaagcgaaga
aaccctttatctctcccagtagaaaaaattcatcctttattaaaggaggtcctaggttat
aaaattgaccaccaggtttctgtttacatagtagcagtcttagaatacatttctgcagac
attttaaagctggttgggaattatgtaagaaatatacggcattatgaaattacaaaacaa
gatattaaagtggcaatgtgtgctgacaaggtattgatggatatgtttcatcaagatgta
gaagatattaatatattatctttaactgatgaagagccttccacctcaggagaacaaact
tactatgatttggtaaaagcatttatggcagaaattcgacaatatataagggaactaaat
ctaattataaaagtttttagagagccctttgtctccaattcaaaattgttttcagctaat
gatgtagaaaatatatttagtcgcatagtagatatacatgaacttagtgtaaagttactg
ggccatatagaagatacggtagaaatgacagatgaaggcagtccccatccactagtagga
agctgctttgaagacttagcagaggaactggcatttgatccatatgaatcgtatgctcga
gatattttgcgacctggttttcatgatcgtttccttagtcagttatcaaagcctggggca
gcactttatttgcagtcaataggcgaaggtttcaaagaagctgttcaatatgttttaccc
aggctgcttctggcccctgtttaccactgtctccattactttgaacttttgaagcagtta
gaagaaaaaagtgaagatcaagaagacaaggaatgtttaaaacaagcaataacagctttg
cttaatgttcagagtggtatggaaaaaatatgttctaaaagtcttgcaaaacgaagactg
agtgaatctgcatgtcggttttatagtcagcaaatgaaggggaaacaactagcaatcaag
aaaatgaacgagattcagaagaatattgatggttgggagggaaaagacattggacagtgt
tgtaatgaatttataatggaaggaactcttacacgtgtaggagccaaacacgagagacac
atatttctctttgatggcttaatgatttgctgtaaatcaaatcacgggcagccaagactt
cctggtgctagcaatgcagaatatcgtcttaaagaaaagttttttatgcgaaaggtacaa
attaatgataaagatgacaccaatgaatacaagcatgcttttgaaataattttaaaagat
gaaaatagtgttatattttctgccaagtcagctgaagagaaaaacaattggatggcagca
ttgatatctttacagtaccggagtacactggaaaggatgcttgatgtaacaatgctacag
gaagagaaagaggagcagatgaggctgcctagtgctgatgtttatagatttgcagagcct
gactctgaagagaatattatatttgaagagaacatgcagcccaaggctggaattccaatt
atcaaagcaggaactgttattaaacttatagagaggcttacgtaccatatgtacgcagat
cccaattttgttcggacatttcttacaacatacagatccttttgcaaacctcaagaacta
ctgagtcttataatagaaaggtttgaaattccagagcctgagccaacagaagctgatcgc
atagctatagagaatggagatcaacccttgagtgcagaactgaaaagatttagaaaagaa
tatatacagcctgtgcaactgcgagtattaaatgtatgtcggcactgggtagagcaccac
ttctatgattttgaaagagatgcatatcttttgcaacgaatggaagaatttattggaaca
gtaagaggtaaagcaatgaaaaaatgggttgaatccatcactaaaataatccaaaggaaa
aaaattgcaagagacaatggaccaggtcataatattacatttcagagttcacctcccaca
gttgagtggcatataagcagacctgggcacatagagacttttgacctgctcaccttacac
ccaatagaaattgctcgacaactcactttacttgaatcagatctataccgagctgtacag
ccatcagaattagttggaagtgtgtggacaaaagaagacaaagaaattaactctcctaat
cttctgaaaatgattcgacataccaccaacctcactctgtggtttgagaaatgtattgta
gaaactgaaaatttagaagaaagagtagctgtggtgagtcgaattattgagattctacaa
gtctttcaagagttgaacaactttaatggtgtccttgaggttgtcagtgctatgaattca
tcacctgtttacagactagaccacacatttgagcaaataccaagtcgccagaagaaaatt
ttagaagaagctcatgaattgagtgaagatcactataagaaatatttggcaaaactcagg
tctattaatccaccatgtgtgcctttctttggaatttatctcactaatatcttgaaaaca
gaagaaggcaaccctgaggtcctaaaaagacatggaaaagagcttataaactttagcaaa
aggaggaaagtagcagaaataacaggagagatccagcagtaccaaaatcagccttactgt
ttacgagtagaatcagatatcaaaaggttctttgaaaacttgaatccgatgggaaatagc
atggagaaggaatttacagattatcttttcaacaaatccctagaaatagaaccacgaaac
cctaagcctctcccaagatttccaaaaaaatatagctatcccctaaaatctcctggtgtt
cgtccatcaaacccaagaccaggtaccatgaggcatcccacacctctgcagcaggagcca
aggaaaattagttatagtaggatccctgaaagtgaaacagaaagtacagcatctgcacca
aattctccaagaacaccgttaacacctccgcctgcttctggtgcttccagtaccacagat
gtttgcagtgtatttgattccgatcattcgagcccttttcactcaagatctgcttctgta
tcatctataagtttaaccaaaggcactgatgaagtgcctgtccctcctcctgttcctcca
cgaagacgaccagaatctgccccagcagaatcttcaccatctaagattatgtctaagcat
ttggacagtcccccagccattcctcctaggcaacccacatcaaaagcctattcaccacga
tattcactatcagaccggacctctatctcagaccctcctgaaagccctcccttattacca
ccacgagaacctgtgaggacacctgatgttttctcaagctcaccactacatctccaacct
ccccctttgggcaaaaaaagtgaccatggcaataccttcttcccaaacagcccttccccc
tttacaccacctcctcctcaaacaccttctcctcacggcacaagaaggcatctgccatca
ccaccattgacacaagaagtggaccttcattccattgctgggccgcctgttcctccacga
caaagcacttctcaacatatccctaaactccctccaaaaacttacaaaagagagcacaca
cacccatccatgcacagagatggaccaccactgttggagaatgcccattcttcctga

DBGET integrated database retrieval system