KEGG   Sus scrofa (pig): 100156602
Entry
100156602         CDS       T01009                                 

Gene name
SOS2
Definition
(RefSeq) son of sevenless homolog 2
  KO
K03099  son of sevenless
Organism
ssc  Sus scrofa (pig)
Pathway
ssc01521  EGFR tyrosine kinase inhibitor resistance
ssc01522  Endocrine resistance
ssc04010  MAPK signaling pathway
ssc04012  ErbB signaling pathway
ssc04014  Ras signaling pathway
ssc04062  Chemokine signaling pathway
ssc04068  FoxO signaling pathway
ssc04072  Phospholipase D signaling pathway
ssc04150  mTOR signaling pathway
ssc04151  PI3K-Akt signaling pathway
ssc04510  Focal adhesion
ssc04540  Gap junction
ssc04630  JAK-STAT signaling pathway
ssc04650  Natural killer cell mediated cytotoxicity
ssc04660  T cell receptor signaling pathway
ssc04662  B cell receptor signaling pathway
ssc04664  Fc epsilon RI signaling pathway
ssc04714  Thermogenesis
ssc04722  Neurotrophin signaling pathway
ssc04810  Regulation of actin cytoskeleton
ssc04910  Insulin signaling pathway
ssc04912  GnRH signaling pathway
ssc04915  Estrogen signaling pathway
ssc04917  Prolactin signaling pathway
ssc04926  Relaxin signaling pathway
ssc04935  Growth hormone synthesis, secretion and action
ssc05034  Alcoholism
ssc05160  Hepatitis C
ssc05161  Hepatitis B
ssc05163  Human cytomegalovirus infection
ssc05165  Human papillomavirus infection
ssc05200  Pathways in cancer
ssc05205  Proteoglycans in cancer
ssc05206  MicroRNAs in cancer
ssc05210  Colorectal cancer
ssc05211  Renal cell carcinoma
ssc05213  Endometrial cancer
ssc05214  Glioma
ssc05215  Prostate cancer
ssc05220  Chronic myeloid leukemia
ssc05221  Acute myeloid leukemia
ssc05223  Non-small cell lung cancer
ssc05224  Breast cancer
ssc05225  Hepatocellular carcinoma
ssc05226  Gastric cancer
ssc05231  Choline metabolism in cancer
Brite
KEGG Orthology (KO) [BR:ssc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04010 MAPK signaling pathway
    100156602 (SOS2)
   04012 ErbB signaling pathway
    100156602 (SOS2)
   04014 Ras signaling pathway
    100156602 (SOS2)
   04630 JAK-STAT signaling pathway
    100156602 (SOS2)
   04068 FoxO signaling pathway
    100156602 (SOS2)
   04072 Phospholipase D signaling pathway
    100156602 (SOS2)
   04151 PI3K-Akt signaling pathway
    100156602 (SOS2)
   04150 mTOR signaling pathway
    100156602 (SOS2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100156602 (SOS2)
   04540 Gap junction
    100156602 (SOS2)
  09142 Cell motility
   04810 Regulation of actin cytoskeleton
    100156602 (SOS2)
 09150 Organismal Systems
  09151 Immune system
   04650 Natural killer cell mediated cytotoxicity
    100156602 (SOS2)
   04660 T cell receptor signaling pathway
    100156602 (SOS2)
   04662 B cell receptor signaling pathway
    100156602 (SOS2)
   04664 Fc epsilon RI signaling pathway
    100156602 (SOS2)
   04062 Chemokine signaling pathway
    100156602 (SOS2)
  09152 Endocrine system
   04910 Insulin signaling pathway
    100156602 (SOS2)
   04912 GnRH signaling pathway
    100156602 (SOS2)
   04915 Estrogen signaling pathway
    100156602 (SOS2)
   04917 Prolactin signaling pathway
    100156602 (SOS2)
   04926 Relaxin signaling pathway
    100156602 (SOS2)
   04935 Growth hormone synthesis, secretion and action
    100156602 (SOS2)
  09156 Nervous system
   04722 Neurotrophin signaling pathway
    100156602 (SOS2)
  09159 Environmental adaptation
   04714 Thermogenesis
    100156602 (SOS2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    100156602 (SOS2)
   05206 MicroRNAs in cancer
    100156602 (SOS2)
   05205 Proteoglycans in cancer
    100156602 (SOS2)
   05231 Choline metabolism in cancer
    100156602 (SOS2)
  09162 Cancer: specific types
   05210 Colorectal cancer
    100156602 (SOS2)
   05225 Hepatocellular carcinoma
    100156602 (SOS2)
   05226 Gastric cancer
    100156602 (SOS2)
   05214 Glioma
    100156602 (SOS2)
   05221 Acute myeloid leukemia
    100156602 (SOS2)
   05220 Chronic myeloid leukemia
    100156602 (SOS2)
   05211 Renal cell carcinoma
    100156602 (SOS2)
   05215 Prostate cancer
    100156602 (SOS2)
   05213 Endometrial cancer
    100156602 (SOS2)
   05224 Breast cancer
    100156602 (SOS2)
   05223 Non-small cell lung cancer
    100156602 (SOS2)
  09165 Substance dependence
   05034 Alcoholism
    100156602 (SOS2)
  09172 Infectious disease: viral
   05161 Hepatitis B
    100156602 (SOS2)
   05160 Hepatitis C
    100156602 (SOS2)
   05163 Human cytomegalovirus infection
    100156602 (SOS2)
   05165 Human papillomavirus infection
    100156602 (SOS2)
  09176 Drug resistance: antineoplastic
   01521 EGFR tyrosine kinase inhibitor resistance
    100156602 (SOS2)
   01522 Endocrine resistance
    100156602 (SOS2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04990 Domain-containing proteins not elsewhere classified [BR:ssc04990]
    100156602 (SOS2)
Domain-containing proteins not elsewhere classified [BR:ssc04990]
 Pleckstrin homology (PH) domain-containing proteins
  Dbl-Like RhoGEF family proteins
   100156602 (SOS2)
SSDB
Motif
Pfam: RasGEF RasGEF_N RhoGEF PH Histone IQ_SEC7_PH PH_19 PH_13
Other DBs
NCBI-GeneID: 100156602
NCBI-ProteinID: XP_020957098
Ensembl: ENSSSCG00000005015
LinkDB
Position
1
AA seq 1332 aa
MQQAPQPYEFFSEENSPKWRGLLVSALRKVQEQVHPNLSANEESLYYIEELIFQLLNKLC
LAQPRTVQDVEERVQKTFPHPIDKWAIADAQSAIEKRKRRNPLLLPVDKIHPSLKEVLGY
KVDYHVSLYIVAVLEYISADILKLAGNYVFNIRHYEISQQDIKVSMCADKVLMDMFDQDD
IGLVSLCEDEPSSSGELNYYDLVRTEIAEERQYLRELNMIIKVFREAFLSDRKLFKPSDI
EKIFSNILDIHELTVKLLGLVEDTVEMTDESSPHPLAGSCFEDLAEEQAFDPYETLSQDI
LSPEFNEHFSKLMARPAVALHFQSIADGFKEAVRYVLPRLMLVPVYHCWHYFELLKQLKA
CSEEQEDRECLNQAITALLNLQGSMDRIYKQHSPRRRPGDPVCPFYNRQLRSKHLAIKKM
NEIQKNIDGWEGKDIGQCCNEFIMEGPLTRVGAKHERHIFLFDGLMISCKPNHSQSRLPG
YSNAEYRLKEKFVMRKIQICDKEDTCEYKHAFELVSKDENSIIFAAKSAEEKSNWMAALI
SLHYRSTLDRMLDSVLLKEENEQPLRLPSPEVYRFVVKDSEENIVFEDNLQSRSGIPIIK
GGTVVKLIERLTYHMYADPNFVRTFLTTYRSFCKPQELLSLLIERFEIPEPEPTEADKLA
VEKGEQPISADLKRFRKEYVQPVQLRILNVFRHWVEHHFYDFERDLELLERLESFISSVR
GKAMKKWVESIAKIIKRKKQAQANGISHNITFESPPPPIEWHISRPGQFETFDLMTLHPI
EIARQLTLLESDLYRKVQPSELVGSVWTKEDKEINSPNLLKMIRHTTNLTLWFEKCIVEA
ENFEERVAVLSRIIEILQVFQDLNNFNGVLEIVSAVNSVSVYRLDHTFEALQERKRKILD
EAVELSQDHFKKYLVKLKSINPPCVPFFGIYLTNILKTEEGNNDFLKKKGKDLINFSKRR
KVAEITGEIQQYQNQPYCLRIEPEMRRFFENLNPMGNSSEKEFTDYLFNKSLEIEPRNCK
QPPRFPRKSTFSLKSPGIRPNTGRHGSTSGTLRGHPTPLEREPCKISFSRIAETDLESTV
SAPTSPNTPSTPPVSASSDLSVFLDVDLNSSCGSNSIFAPVLLPHSKSFFSSCGSLHKLS
EEPLIPPPLPPRKKFDHDASNSKGNMKSDDDPPAIPPRQPPPPKVKPRVPAPTGVFDGPL
PSPPLPPPRDPLPDTPPPVPLRPPEHFINCPFTLQPPPLGHLHRDPDWFRDVSMCPNSPN
TPPSTPSPRVPRRCYVLSSSQNNLAHPQAPPIPPRQNSSPHLPKLPPKTYKRELSHPPLY
RLPLLENAETPQ
NT seq 3999 nt   +upstreamnt  +downstreamnt
atgcagcaggcgccgcagccctacgagttcttcagcgaagagaacagtccgaaatggcgg
ggactcttagtctcggccctgcggaaggttcaggaacaagtacatcccaatctctcagca
aatgaagagtctctctattatattgaagagctgatttttcagctgcttaataaattatgc
ctggcccaaccaaggactgttcaagatgtggaggaacgagttcagaagacttttcctcat
ccaattgataaatgggctattgctgatgcacagtctgccatagagaaacgaaaacgaaga
aatcctctcttgctgcctgtggataaaatccatccttcattgaaggaagttttagggtac
aaggtggactaccatgtatccctgtatattgtggctgtactagagtatatctcagctgat
attttgaaattggctggtaattatgtttttaatatccgacattatgaaatatcccaacaa
gacattaaagtgtcaatgtgtgcagataaggtattgatggacatgtttgatcaggatgac
ataggcttggtttctctctgtgaagatgaacctagttcttcaggtgaattaaactactac
gaccttgtcagaactgaaattgcagaagaaaggcagtatctacgggaactaaatatgatc
ataaaagtgtttcgagaagcttttctttctgacagaaagctttttaaaccttctgatatt
gaaaagattttcagtaacattttagatatacatgaattgacggtgaaacttttaggtttg
gttgaagacacagttgaaatgactgatgaaagcagccctcatcccttagctggcagctgt
tttgaagatttggcagaagagcaagcatttgatccttatgaaacgttatcacaggatatt
ctttcaccagaatttaatgaacatttcagtaagttgatggccagacctgcagtggctcta
cactttcagtccattgctgatggctttaaagaggcagttcgctatgttcttccccgccta
atgttggtgccagtgtatcattgttggcactattttgaattattaaagcaattgaaagca
tgtagtgaagagcaggaagacagagagtgtttgaaccaagctataacagctctcttgaac
ctccaaggtagtatggaccgaatttacaagcagcattcacctagacgccgacctggggat
cctgtttgccctttttataatcgtcaattaagaagcaaacacctggctattaaaaaaatg
aatgaaattcagaaaaacatagatggctgggaaggcaaagatattggacagtgttgtaat
gaatttattatggaaggcccattgacaagagttggtgctaaacatgaacgacatattttt
ctttttgatggcttaatgattagttgcaaacccaatcatagccagtcacgccttccagga
tacagtaatgcagaatacagattaaaagaaaaatttgtcatgaggaaaatacaaatatgt
gataaagaagatacttgtgagtacaaacatgcttttgaattagtatccaaagatgaaaac
agcataatattcgctgctaagtctgctgaagagaaaagtaattggatggcagcacttatt
tctcttcattatcgtagtactctggatcgaatgctagattcagtattgttgaaggaagaa
aatgaacaaccactgagattaccaagtcctgaagtgtatcgttttgtggtaaaggactct
gaagaaaacattgtttttgaagacaacttgcaaagtagaagtggaatccccattattaaa
ggaggaactgtagtgaaattaattgaaaggttaacttatcatatgtatgcagatcccaat
tttgttcgtacttttcttactacgtatcgttcattttgtaaaccacaggaattgctaagc
ttactgattgaacgatttgaaattccagagccagaacctactgaagcagataaattggca
gtagagaaaggcgagcagcctatcagtgcagaccttaaaagatttcgcaaggaatacgtc
caaccagtacaacttaggatcttaaatgtgtttcggcactgggttgaacatcatttttat
gactttgaaagagatttggagttgcttgaaagactagaatccttcatttcaagtgtaaga
gggaaagctatgaagaaatgggtagagtcaattgctaagattatcaagaggaagaaacaa
gctcaggcaaatggaataagccataatattacctttgaaagcccacctcccccaattgaa
tggcatatcagtagaccaggacagtttgaaacatttgatctcatgacacttcatccaata
gaaattgcacgtcagctaacacttttggaatctgatctctacaggaaagtccaaccttct
gaacttgtagggagtgtatggaccaaagaagataaagaaataaattctccaaatttatta
aaaatgattcgtcacaccacaaatctcaccctctggtttgaaaagtgcattgtggaagca
gaaaattttgaggaacgggtggcagtactgagtagaattatagaaattctgcaagttttt
caagatttgaataatttcaatggtgtattggagatcgtcagtgcagtaaattcagtgtca
gtttatagactagatcatacctttgaggcattgcaggaaagaaaaaggaaaattttggat
gaagctgtggaattaagtcaagatcattttaaaaaatatctagtgaaacttaagtcaatc
aatccaccttgtgtgcctttttttggaatatatttaacaaatattctgaagactgaagaa
gggaataatgattttttaaaaaagaaagggaaagatttaatcaatttcagtaagaggagg
aaagtagctgaaattactggagaaattcagcagtatcagaatcaaccttactgtttacgg
atagaaccagaaatgaggaggttctttgaaaaccttaaccccatgggaaattcttctgaa
aaagagtttacagattatttgttcaacaagtcactagaaattgaaccccgaaattgcaaa
cagccacctcgatttcctaggaaatcaactttctctttaaaatctcctggaataaggcct
aatacaggccgacatggctctacctcaggtactttacgaggccatccaacaccattagaa
agagaaccatgtaaaataagctttagtcggattgctgaaacagaccttgaatcaacagtg
tcagcaccaacctctccaaacacaccatctactccaccagtatctgcttcttcagacctt
agtgtgtttttagatgtggatctcaacagttcctgtggaagcaatagcatctttgctcca
gtcctcttgccacattcaaagtctttcttcagttcatgtggtagtttacataaactaagt
gaagagccactgattcctcctccgcttcctcctcgaaaaaagtttgatcatgatgcttca
aattccaagggaaatatgaaatccgatgatgacccccctgctattccgccaagacaacct
cctcctccaaaggtaaaacccagagttcctgctcctactggtgtgtttgacggacctctg
cctagtccacctctaccaccgccaagagatcctcttcctgatacccctccaccggttccc
cttcggcctccagaacactttataaactgtccattcactctccagccacctccactggga
catcttcacagagatccagactggttcagagatgttagtatgtgtccaaattctccaaac
actcctccaagcacaccatctccaagagtaccacgtcgatgctatgtgctcagttctagt
caaaataatctcgctcatcctcaagccccccccattccaccaaggcagaattcaagccct
cacctaccaaaactgccaccaaagacttataaacgggagctttcgcaccccccattgtat
agactgcctttgctagaaaatgcagaaactcctcaatga

KEGG   Sus scrofa (pig): 100520187
Entry
100520187         CDS       T01009                                 

Gene name
SOS1
Definition
(RefSeq) son of sevenless homolog 1 isoform X1
  KO
K03099  son of sevenless
Organism
ssc  Sus scrofa (pig)
Pathway
ssc01521  EGFR tyrosine kinase inhibitor resistance
ssc01522  Endocrine resistance
ssc04010  MAPK signaling pathway
ssc04012  ErbB signaling pathway
ssc04014  Ras signaling pathway
ssc04062  Chemokine signaling pathway
ssc04068  FoxO signaling pathway
ssc04072  Phospholipase D signaling pathway
ssc04150  mTOR signaling pathway
ssc04151  PI3K-Akt signaling pathway
ssc04510  Focal adhesion
ssc04540  Gap junction
ssc04630  JAK-STAT signaling pathway
ssc04650  Natural killer cell mediated cytotoxicity
ssc04660  T cell receptor signaling pathway
ssc04662  B cell receptor signaling pathway
ssc04664  Fc epsilon RI signaling pathway
ssc04714  Thermogenesis
ssc04722  Neurotrophin signaling pathway
ssc04810  Regulation of actin cytoskeleton
ssc04910  Insulin signaling pathway
ssc04912  GnRH signaling pathway
ssc04915  Estrogen signaling pathway
ssc04917  Prolactin signaling pathway
ssc04926  Relaxin signaling pathway
ssc04935  Growth hormone synthesis, secretion and action
ssc05034  Alcoholism
ssc05160  Hepatitis C
ssc05161  Hepatitis B
ssc05163  Human cytomegalovirus infection
ssc05165  Human papillomavirus infection
ssc05200  Pathways in cancer
ssc05205  Proteoglycans in cancer
ssc05206  MicroRNAs in cancer
ssc05210  Colorectal cancer
ssc05211  Renal cell carcinoma
ssc05213  Endometrial cancer
ssc05214  Glioma
ssc05215  Prostate cancer
ssc05220  Chronic myeloid leukemia
ssc05221  Acute myeloid leukemia
ssc05223  Non-small cell lung cancer
ssc05224  Breast cancer
ssc05225  Hepatocellular carcinoma
ssc05226  Gastric cancer
ssc05231  Choline metabolism in cancer
Brite
KEGG Orthology (KO) [BR:ssc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04010 MAPK signaling pathway
    100520187 (SOS1)
   04012 ErbB signaling pathway
    100520187 (SOS1)
   04014 Ras signaling pathway
    100520187 (SOS1)
   04630 JAK-STAT signaling pathway
    100520187 (SOS1)
   04068 FoxO signaling pathway
    100520187 (SOS1)
   04072 Phospholipase D signaling pathway
    100520187 (SOS1)
   04151 PI3K-Akt signaling pathway
    100520187 (SOS1)
   04150 mTOR signaling pathway
    100520187 (SOS1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100520187 (SOS1)
   04540 Gap junction
    100520187 (SOS1)
  09142 Cell motility
   04810 Regulation of actin cytoskeleton
    100520187 (SOS1)
 09150 Organismal Systems
  09151 Immune system
   04650 Natural killer cell mediated cytotoxicity
    100520187 (SOS1)
   04660 T cell receptor signaling pathway
    100520187 (SOS1)
   04662 B cell receptor signaling pathway
    100520187 (SOS1)
   04664 Fc epsilon RI signaling pathway
    100520187 (SOS1)
   04062 Chemokine signaling pathway
    100520187 (SOS1)
  09152 Endocrine system
   04910 Insulin signaling pathway
    100520187 (SOS1)
   04912 GnRH signaling pathway
    100520187 (SOS1)
   04915 Estrogen signaling pathway
    100520187 (SOS1)
   04917 Prolactin signaling pathway
    100520187 (SOS1)
   04926 Relaxin signaling pathway
    100520187 (SOS1)
   04935 Growth hormone synthesis, secretion and action
    100520187 (SOS1)
  09156 Nervous system
   04722 Neurotrophin signaling pathway
    100520187 (SOS1)
  09159 Environmental adaptation
   04714 Thermogenesis
    100520187 (SOS1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    100520187 (SOS1)
   05206 MicroRNAs in cancer
    100520187 (SOS1)
   05205 Proteoglycans in cancer
    100520187 (SOS1)
   05231 Choline metabolism in cancer
    100520187 (SOS1)
  09162 Cancer: specific types
   05210 Colorectal cancer
    100520187 (SOS1)
   05225 Hepatocellular carcinoma
    100520187 (SOS1)
   05226 Gastric cancer
    100520187 (SOS1)
   05214 Glioma
    100520187 (SOS1)
   05221 Acute myeloid leukemia
    100520187 (SOS1)
   05220 Chronic myeloid leukemia
    100520187 (SOS1)
   05211 Renal cell carcinoma
    100520187 (SOS1)
   05215 Prostate cancer
    100520187 (SOS1)
   05213 Endometrial cancer
    100520187 (SOS1)
   05224 Breast cancer
    100520187 (SOS1)
   05223 Non-small cell lung cancer
    100520187 (SOS1)
  09165 Substance dependence
   05034 Alcoholism
    100520187 (SOS1)
  09172 Infectious disease: viral
   05161 Hepatitis B
    100520187 (SOS1)
   05160 Hepatitis C
    100520187 (SOS1)
   05163 Human cytomegalovirus infection
    100520187 (SOS1)
   05165 Human papillomavirus infection
    100520187 (SOS1)
  09176 Drug resistance: antineoplastic
   01521 EGFR tyrosine kinase inhibitor resistance
    100520187 (SOS1)
   01522 Endocrine resistance
    100520187 (SOS1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04990 Domain-containing proteins not elsewhere classified [BR:ssc04990]
    100520187 (SOS1)
Domain-containing proteins not elsewhere classified [BR:ssc04990]
 Pleckstrin homology (PH) domain-containing proteins
  Dbl-Like RhoGEF family proteins
   100520187 (SOS1)
SSDB
Motif
Pfam: RasGEF RasGEF_N RhoGEF Histone PH PH_19 PH_10 IQ_SEC7_PH PH_13
Other DBs
NCBI-GeneID: 100520187
NCBI-ProteinID: XP_020943245
Ensembl: ENSSSCG00000030405
LinkDB
Position
3
AA seq 1333 aa
MQAQQLPYEFFSEENAPKWRGLLVPALKKVQGQVHPTLESSDDALQYVEELILQLLNMLC
QAQPRSASDVEERVQKSFPHPIDKWAIADAQSAIEKRKRRNPLSLPVEKIHPLLKEVLGY
KIDHQVSVYIVAVLEYISADILKLVGNYVRNIRHYEITKQDIKVAMCADKVLMDMFHQDV
EDINILSLTDEEPSTSGEQTYYDLVKAFMAEIRQYIRELNLIIKVFREPFVSNSKLFSAN
DVENIFSRIVDIHELSVKLLGHIEDTVEMTDEGSPHPLVGSCFEDLAEELAFDPYESYAR
DILRPGFHDHFLSQLSKPGAALYLQSIGEGFKEAVQYVLPRLLLAPVYHCLHYFELLKQL
EEKSEDQEDKECLKQAITALLNVQSGMEKICSKSLAKRRLSESACRFYSQQMKGKQLAIK
KMNEIQKNIDGWEGKDIGQCCNEFIMEGTLTRVGAKHERHIFLFDGLMICCKSNHGQPRL
PGASNAEYRLKEKFFMRKVQINDKDDTSEYKHAFEIILKDENSVIFSAKSAEEKNNWMAA
LISLQYRSTLERMLDVTMLQEEKEEQMRLPSADVYRFAEPDSEENIIFEENMQPKAGIPI
IKAGTVIKLIERLTYHMYADPNFVRTFLTTYRSFCKPQELLSLIIERFEIPEPEPTEADR
IAIENGDQPLSAELKRFRKEYIQPVQLRVLNVCRHWVEHHFYDFERDADLLQRMEEFIGT
VRGKAMKKWVESITKIIQRKKIARDNGPGHNITFQSSPPTVEWHISRPGHIESFDLLTLH
PIEIARQLTLLESDLYRAVQPSELVGSVWTKEDKEINSPNLLKMIRHTTNLTLWFEKCIV
ETENLEERVAVVSRIIEILQVFQELNNFNGVLEVVSAMNSSPVYRLDHTFEQIPSRQKKI
LEEAHELSEDHYKKYLAKLRSINPPCVPFFGIYLTNILKTEEGNPEVLKRHGKELINFSK
RRKVAEITGEIQQYQNQPYCLRVESDIKRFFENLNPMGSSMEKEFTDYLFNKSLEIEPRN
PKPLPRFPKKYNYPLKSPGVRPSNPRPGTMRHPTPLQQEPRKISYSRIPESETESTASAP
NSPRTPLTPPPASGASSTTDVCSVFDSDHSSPFHSSSDTVFIQVTLPHGPRSASVSSISL
TKSTDEVPVPPPVPPRRRPESAPAESSPSKIMSKHLDSPPAIPPRQPTSKAYSPRYSISD
RTSISDPPESPPLLPPREPVRTPDVFSSSPLHLQPPPLGKKSDHSNAFFPNSPSPFTPPP
PQTPSPHGTRRHLPSPPLTQEVDLHSIAGPPVPPRQSTSQHIPKLPPKTYKREHTHPSMH
RDGPPLLENAHSS
NT seq 4002 nt   +upstreamnt  +downstreamnt
atgcaggcgcagcagctgccgtacgagtttttcagcgaggagaacgcgcccaagtggcgg
gggctgctggttcctgcgctgaaaaaggttcaggggcaagttcatcctactcttgagtct
agtgatgatgctcttcagtatgttgaagaattaattttgcaattattaaatatgctgtgc
caagctcagccccgaagtgcttcagatgtagaggaacgagttcaaaaaagtttccctcat
ccaattgataagtgggcaatagctgatgcccagtcggctattgaaaagaggaagcgaaga
aaccctttatctctcccagtagaaaaaattcatcctttgttaaaggaggtcctaggttat
aaaattgaccaccaggtttctgtttacatagtagcagtattagaatacatttctgcagac
attttaaagctggtggggaattatgtgcgaaatatacggcactatgaaattacaaaacaa
gatattaaagtggcaatgtgtgctgataaggtattgatggatatgtttcatcaagatgta
gaagatataaatatattgtctttaactgatgaagagccttccacctcaggagagcaaact
tattatgatttggtaaaagcatttatggcagaaattcgacaatacataagggaactaaat
ttaattataaaagtttttagagaaccctttgtctccaactcaaaattgttttcagctaat
gatgtagaaaatatatttagtcgtatagtagatatacatgaacttagtgtaaagttactg
ggccatatagaagatactgtagaaatgacagatgaaggtagtccccacccattagtagga
agctgctttgaagacttagcagaggaactggcatttgacccatatgaatcatatgctcga
gatattttacgacctggttttcatgatcatttccttagtcagttatcaaagcctggagcg
gcactttatttacagtcaataggcgaaggtttcaaagaagctgttcagtatgttttaccc
cggctgcttctagctcctgtttaccactgtctacattacttcgaacttttgaagcagtta
gaagagaagagtgaagatcaagaagacaaggaatgtttgaaacaagcaataacagctttg
cttaatgttcagagtggtatggaaaaaatatgttctaaaagtcttgcaaaacgaagactg
agtgaatctgcatgtcggttttatagtcaacaaatgaaggggaaacaactagcaatcaag
aaaatgaacgagattcagaagaatattgatggttgggagggaaaagacattggacagtgt
tgcaatgagtttataatggaaggaactcttacacgtgtaggagccaaacatgagagacac
atatttctctttgatggcttaatgatttgctgtaaatcaaatcatgggcagccaagactt
cctggtgctagcaatgcagaatatcgtcttaaagaaaagttttttatgagaaaggtacaa
attaatgacaaagatgacaccagtgagtacaagcatgcttttgaaataattttaaaagat
gaaaatagtgtaatattttctgccaagtcagctgaagagaaaaacaattggatggcagcg
ttgatatctttacagtaccggagtacactggaaaggatgcttgatgtgacaatgctacag
gaagaaaaggaggagcagatgaggcttcctagtgctgatgtttatagatttgcagagcct
gactctgaagaaaatatcatatttgaagaaaacatgcagcccaaagctggaattccaatt
atcaaagcaggaactgttattaaacttatagagaggctcacataccatatgtacgcagat
cccaattttgttcggacatttcttacaacatacagatccttttgtaaacctcaagaacta
ctgagtcttataatagaaaggtttgaaattccagagcctgagccaacagaagctgatcgc
atagctatagaaaatggagatcagcccttgagtgcagaactaaaaaggtttagaaaagaa
tatatacagcccgtacaactgcgagtattaaatgtatgtcggcactgggtagagcaccac
ttctatgattttgaaagagatgcagatcttttgcagcgaatggaggaatttattggaaca
gtaaggggtaaagcaatgaagaaatgggttgaatcaatcactaaaataattcaaaggaaa
aaaattgcaagagacaatggaccaggtcataatattacatttcagagttcacctcccaca
gttgagtggcatataagcagacctgggcacatagagtcttttgacctgctcaccttacac
ccaatagaaattgctcgacaactcactttacttgaatcagatctgtatcgagctgtgcag
ccatcagaattagttggaagtgtgtggacaaaagaagacaaagaaattaattctcctaat
cttctgaaaatgatccggcataccactaatctcactctgtggtttgagaaatgtattgta
gaaactgaaaacttagaagaaagggtagctgtggtgagtcgaataattgagattctgcaa
gtctttcaagagctgaacaatttcaatggtgtacttgaggttgtcagtgctatgaactca
tcacctgtttacagactagaccacaccttcgagcaaataccaagtcgccaaaagaaaatt
ttagaagaagctcatgaactaagtgaagatcactataagaaatatttggcaaaactcagg
tctattaatccaccatgtgtgcctttctttggaatttatctaactaatatcttgaaaaca
gaagaaggcaaccctgaggtcctaaaaaggcatggaaaagagcttataaactttagcaaa
aggaggaaagtggcagaaataacaggcgagatccagcagtaccaaaatcagccttattgt
ttacgagtagaatcagatatcaaaaggttctttgaaaacttgaatccaatgggaagtagc
atggaaaaagaatttacagattatcttttcaacaaatccctagaaatagaaccacgaaac
cctaaacctctcccaagatttccaaaaaaatacaactatcccctaaaatctcctggcgtt
cgtccatcaaacccaagaccaggtaccatgagacatcccacacctctgcagcaggagcca
aggaaaattagttatagtaggatccctgaaagtgaaacagaaagtacagcatctgcacca
aattctccaagaacaccgttaacacctcctcctgcttctggtgcttctagtaccacagat
gtttgcagcgtatttgattctgatcattcaagcccttttcactcaagcagcgataccgtc
tttatccaagttacactgccccatggcccaagatctgcttcagtatcatctataagctta
accaagagcactgatgaagtgcctgtcccccctcctgttcctccacgaagacgaccagaa
tctgccccagcggaatcttcgccatctaagattatgtctaagcatttggacagcccccca
gcaattcctcctaggcaacccacatcaaaagcctattcaccacgatattccatatcagac
cggacctctatatcagaccctcctgaaagccctcccttactaccaccacgagaacctgtg
aggacacctgatgttttctcaagctcaccactacatctccaacctccccctttgggcaaa
aaaagtgaccatagtaatgccttcttcccaaatagcccttccccctttacaccacctcct
cctcaaacaccttctcctcacggcaccagaaggcatctaccatcaccaccattgacacag
gaagtagaccttcattccattgctgggccacctgttcctccacgacaaagcacttctcaa
catatccccaaactccctccaaaaacttacaaaagggagcacacacacccatccatgcac
agagatggaccaccactgttggagaacgcccattcttcctga

DBGET integrated database retrieval system