KEGG   Lipotes vexillifer (Yangtze River dolphin): 103079242
Entry
103079242         CDS       T03090                                 

Gene name
SOS1
Definition
(RefSeq) SOS Ras/Rac guanine nucleotide exchange factor 1
  KO
K03099  son of sevenless
Organism
lve  Lipotes vexillifer (Yangtze River dolphin)
Pathway
lve01521  EGFR tyrosine kinase inhibitor resistance
lve01522  Endocrine resistance
lve04010  MAPK signaling pathway
lve04012  ErbB signaling pathway
lve04014  Ras signaling pathway
lve04062  Chemokine signaling pathway
lve04068  FoxO signaling pathway
lve04072  Phospholipase D signaling pathway
lve04150  mTOR signaling pathway
lve04151  PI3K-Akt signaling pathway
lve04510  Focal adhesion
lve04540  Gap junction
lve04630  JAK-STAT signaling pathway
lve04650  Natural killer cell mediated cytotoxicity
lve04660  T cell receptor signaling pathway
lve04662  B cell receptor signaling pathway
lve04664  Fc epsilon RI signaling pathway
lve04714  Thermogenesis
lve04722  Neurotrophin signaling pathway
lve04810  Regulation of actin cytoskeleton
lve04910  Insulin signaling pathway
lve04912  GnRH signaling pathway
lve04915  Estrogen signaling pathway
lve04917  Prolactin signaling pathway
lve04926  Relaxin signaling pathway
lve04935  Growth hormone synthesis, secretion and action
lve05034  Alcoholism
lve05160  Hepatitis C
lve05161  Hepatitis B
lve05163  Human cytomegalovirus infection
lve05165  Human papillomavirus infection
lve05200  Pathways in cancer
lve05205  Proteoglycans in cancer
lve05206  MicroRNAs in cancer
lve05210  Colorectal cancer
lve05211  Renal cell carcinoma
lve05213  Endometrial cancer
lve05214  Glioma
lve05215  Prostate cancer
lve05220  Chronic myeloid leukemia
lve05221  Acute myeloid leukemia
lve05223  Non-small cell lung cancer
lve05224  Breast cancer
lve05225  Hepatocellular carcinoma
lve05226  Gastric cancer
lve05231  Choline metabolism in cancer
Brite
KEGG Orthology (KO) [BR:lve00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04010 MAPK signaling pathway
    103079242 (SOS1)
   04012 ErbB signaling pathway
    103079242 (SOS1)
   04014 Ras signaling pathway
    103079242 (SOS1)
   04630 JAK-STAT signaling pathway
    103079242 (SOS1)
   04068 FoxO signaling pathway
    103079242 (SOS1)
   04072 Phospholipase D signaling pathway
    103079242 (SOS1)
   04151 PI3K-Akt signaling pathway
    103079242 (SOS1)
   04150 mTOR signaling pathway
    103079242 (SOS1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    103079242 (SOS1)
   04540 Gap junction
    103079242 (SOS1)
  09142 Cell motility
   04810 Regulation of actin cytoskeleton
    103079242 (SOS1)
 09150 Organismal Systems
  09151 Immune system
   04650 Natural killer cell mediated cytotoxicity
    103079242 (SOS1)
   04660 T cell receptor signaling pathway
    103079242 (SOS1)
   04662 B cell receptor signaling pathway
    103079242 (SOS1)
   04664 Fc epsilon RI signaling pathway
    103079242 (SOS1)
   04062 Chemokine signaling pathway
    103079242 (SOS1)
  09152 Endocrine system
   04910 Insulin signaling pathway
    103079242 (SOS1)
   04912 GnRH signaling pathway
    103079242 (SOS1)
   04915 Estrogen signaling pathway
    103079242 (SOS1)
   04917 Prolactin signaling pathway
    103079242 (SOS1)
   04926 Relaxin signaling pathway
    103079242 (SOS1)
   04935 Growth hormone synthesis, secretion and action
    103079242 (SOS1)
  09156 Nervous system
   04722 Neurotrophin signaling pathway
    103079242 (SOS1)
  09159 Environmental adaptation
   04714 Thermogenesis
    103079242 (SOS1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    103079242 (SOS1)
   05206 MicroRNAs in cancer
    103079242 (SOS1)
   05205 Proteoglycans in cancer
    103079242 (SOS1)
   05231 Choline metabolism in cancer
    103079242 (SOS1)
  09162 Cancer: specific types
   05210 Colorectal cancer
    103079242 (SOS1)
   05225 Hepatocellular carcinoma
    103079242 (SOS1)
   05226 Gastric cancer
    103079242 (SOS1)
   05214 Glioma
    103079242 (SOS1)
   05221 Acute myeloid leukemia
    103079242 (SOS1)
   05220 Chronic myeloid leukemia
    103079242 (SOS1)
   05211 Renal cell carcinoma
    103079242 (SOS1)
   05215 Prostate cancer
    103079242 (SOS1)
   05213 Endometrial cancer
    103079242 (SOS1)
   05224 Breast cancer
    103079242 (SOS1)
   05223 Non-small cell lung cancer
    103079242 (SOS1)
  09165 Substance dependence
   05034 Alcoholism
    103079242 (SOS1)
  09172 Infectious disease: viral
   05161 Hepatitis B
    103079242 (SOS1)
   05160 Hepatitis C
    103079242 (SOS1)
   05163 Human cytomegalovirus infection
    103079242 (SOS1)
   05165 Human papillomavirus infection
    103079242 (SOS1)
  09176 Drug resistance: antineoplastic
   01521 EGFR tyrosine kinase inhibitor resistance
    103079242 (SOS1)
   01522 Endocrine resistance
    103079242 (SOS1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04990 Domain-containing proteins not elsewhere classified [BR:lve04990]
    103079242 (SOS1)
Domain-containing proteins not elsewhere classified [BR:lve04990]
 Pleckstrin homology (PH) domain-containing proteins
  Dbl-Like RhoGEF family proteins
   103079242 (SOS1)
SSDB
Motif
Pfam: RasGEF RasGEF_N RhoGEF Histone PH PH_19 PH_10 IQ_SEC7_PH PH_13
Other DBs
NCBI-GeneID: 103079242
NCBI-ProteinID: XP_007445829
UniProt: A0A340WF86
LinkDB
Position
Un
AA seq 1333 aa
MQAQQLPYEFFSEENAPKWRGLLVPALKKVQGQVHPTLESSDDALQYVEELILQLLNMLC
QAQPRSASDVEERVQKSFPHPIDKWAIADAQSAIEKRKRRNPLSLPVEKIHPLLKEVLGY
KIDHQVSVYIVAVLEYISADILKLVGNYVRNIRHYEITKQDIKVAMCADKVLMDMFHQDV
EDINILSLTDEEPSTSGEQTYYDLVKAFMAEIRQYIRELNLIIKVFREPFVSNSKLFSAN
DIENIFSRIVDIHELSVKLLGHIEDTVEMTDEGSPHPLVGSCFEDLAEELAFDPYESYAR
DILRPGFHDRFLSQLSKPGAALYLQSIGEGFKEAVQYVLPRLLLAPVYHCLHYFELLKQL
EEKSEDQEDKECLKQAITALLNVQSGMEKICSKSLAKRRLSESACRFYSQQMKGKQLAIK
KMNEIQKNIDGWEGKDIGQCCNEFIMEGTLTRVGAKHERHIFLFDGLMICCKSNHGQPRL
PGASNAEYRLKEKFFMRKVQINDKDDTNEYKHAFEIILKDENSVIFSAKSAEEKNNWMAA
LISLQYRSTLERMLDVTMLQEEKEEQMRLPSADVYRFAEPDSEENIIFEENVQPKAGIPI
IKAGTVIKLIERLTYHMYADPNFVRTFLTTYRSFCKPQELLSLIIERFEIPEPEPTEADR
IAIENGDQPLSAELKRFRKEYIQPVQLRVLNVCRHWVEHHFYDFERDADLLQRMEEFIGT
VRGKAMKKWVESITKIIQRKKIARDNGPGHNITFQSSPPTVEWHISRPGHIETFDLLTLH
PIEIARQLTLLESDLYRAVQPSELVGSVWTKEDKEINSPNLLKMIRHTTNLTLWFEKCIV
ETENLEERVAVVSRIIEILQVFQELNNFNGVLEVVSAMNSSPVYRLDHTFEQIPSRQKKI
LEEAHELSEDHYKKYLAKLRSINPPCVPFFGIYLTNILKTEEGNPEVLKRHGKELINFSK
RRKVAEITGEIQQYQNQPYCLRVESDIKRFFENLNPMGNSMEKEFTDYLFNKSLEIEPRN
PKPLPRFPKKYSYPLKSPGVRPSNPRPGTMRHPTPLQQEPRKISYSRIPESETESTASAP
NSPRTPLTPPPASGASSTTDVCSVFDSDHSSPFHSSSDTVFIQVTLPHGPRSASVSSISL
NKSTDEVPVPPPVPPRRRPESAPAESSPSKIMSKHLDSPPAIPPRQPTSKAYSPRYSVSD
RTSISDPPESPPLLPPREPVRTPDVFSSSPLHLQPPPLGKKSDHSNAFFPNSPSPFTPPP
PQTPSPHGTRRHLPSPPLTQEVDLHSIAGPPVPPRQSTSQHIPKLPPKTYKREHTHPSMH
RDGPPLLENAHSS
NT seq 4002 nt   +upstreamnt  +downstreamnt
atgcaggcgcagcagctgccgtacgagtttttcagcgaggagaacgcgcccaagtggcgg
gggctgctggtgcctgcgctgaaaaaggttcaggggcaagttcatcctactcttgagtct
agtgatgatgctcttcagtatgttgaagaattaattttgcagttattaaatatgctatgc
caagctcagccccgaagtgcttcagatgtagaggaacgtgttcaaaaaagtttccctcat
ccaattgataagtgggcaatagctgatgcccagtcggctattgaaaagaggaagcgaaga
aaccctttatctctcccagtagaaaaaattcatcctttgttaaaggaggtcctaggttat
aaaattgaccaccaggtttctgtttacatagtagcagtattagaatacatttctgcagac
attttaaagctggtggggaattatgtgcgaaatatacggcattatgaaattacaaaacaa
gatattaaagtggcaatgtgtgctgataaggtattgatggatatgtttcatcaagatgta
gaagatataaatatattatctttaactgatgaagagccttccacctcaggagagcaaact
tattatgatttggtaaaagcatttatggcagaaattcgacaatacataagggaacttaat
ttaattataaaagtttttagagagccctttgtctccaattcaaaattgttttcagctaat
gatatagaaaatatatttagtcgtatagtggatatacatgaactcagtgtaaagttactg
ggccatatagaagatactgtagaaatgacagatgaaggcagtccccatccattagtagga
agttgctttgaagacttagcagaggaactggcatttgatccatatgaatcatatgctcga
gatattttacgacctggttttcatgatcgtttccttagtcagttatcaaagcctggagcg
gcactctatttgcagtcaataggcgaaggtttcaaagaagctgttcagtatgttttaccc
aggctacttctagcccctgtttaccactgtctacattactttgaacttctgaagcagtta
gaagaaaagagtgaagatcaagaagacaaggaatgtttgaaacaagcaataacagctttg
cttaatgttcagagtggtatggaaaaaatatgctctaaaagtcttgcaaaacgaaggctg
agtgaatctgcatgtcggttttatagtcagcaaatgaaggggaaacaactagcaatcaag
aaaatgaacgagattcagaagaatattgatggttgggagggaaaagacattggacagtgt
tgcaatgagtttataatggaaggaactcttacacgtgtaggagccaaacatgagagacac
atatttctctttgatggcttaatgatttgctgtaaatcaaatcatgggcagccaagactt
cctggtgctagcaatgcagaatatcgtcttaaagaaaagttttttatgagaaaggtacaa
attaatgacaaagatgacaccaatgagtacaagcatgcttttgaaataattttaaaagat
gaaaatagcgttatattttctgccaagtcagctgaagagaaaaacaactggatggcagca
ttgatatctttacagtaccggagtacactggaaaggatgcttgatgtgacaatgctacag
gaagaaaaggaggagcagatgaggctccctagtgctgatgtttatagatttgcagagcct
gactctgaagaaaatatcatatttgaagaaaacgtgcagcccaaagctggaattccaatt
atcaaagcaggaactgttattaaacttatagagaggctcacataccatatgtacgcagat
cccaattttgttcggacatttcttacaacatatagatccttctgtaagcctcaagaacta
ctgagtcttataatagaaaggtttgaaattccagagcctgagccaacagaagctgatcgc
atagctatagagaatggagatcagcccttgagtgcagaactaaaaaggtttagaaaagaa
tatatacagcctgtacaactgcgagtattaaacgtatgtcggcactgggtagagcaccac
ttctacgattttgaaagagatgcagatctgttgcagcgaatggaggaatttattggaaca
gtaagaggtaaagcaatgaaaaagtgggttgaatccatcactaaaataatccaaaggaaa
aaaattgcaagagacaatggaccaggtcataatattacatttcagagttcaccccctaca
gttgagtggcatataagcagacctgggcacatagagacttttgacctgctcaccttacac
ccaatagaaattgctcgacaactcactttacttgaatcagatctatatcgagctgtacag
ccatcagaattagttggaagtgtgtggacaaaggaagacaaagaaattaattctcctaat
cttctgaaaatgatccggcataccactaatctcactctgtggtttgagaaatgtattgta
gaaactgaaaacttagaagaaagagtagctgtggtgagtcgaataattgagattctgcaa
gtctttcaagagctgaacaacttcaatggtgtccttgaggttgtcagtgctatgaactca
tcacctgtatacagactagaccacacgtttgagcaaataccaagtcgccaaaagaaaatt
ttagaagaagctcatgaactaagtgaagatcactataagaaatatttggcaaaactcagg
tctattaatccaccatgtgtgcctttctttggaatttatctaactaatatcttgaaaaca
gaagaaggcaaccctgaggtcctaaaaaggcatggaaaagagcttataaactttagcaaa
aggaggaaagtagcagaaataacaggcgagatccagcagtaccaaaatcagccttattgt
ttacgagtagaatcagatatcaaaaggttctttgaaaacttgaatccaatgggaaatagt
atggaaaaagaatttacagattatcttttcaacaaatccctagaaatagaaccacgaaac
cctaagcctctcccaagatttccaaaaaaatatagctatcccctaaaatctcctggtgtt
cgtccatcaaacccaagaccaggtactatgagacatcccacacctctgcagcaagagcca
aggaaaattagttatagtaggatccctgaaagtgaaacagaaagtacagcatctgcgcca
aattctccaagaacaccgttaacacctcctcctgcttctggcgcttctagtaccacagat
gtttgcagcgtatttgattctgatcattcaagcccttttcactcaagcagcgataccgtc
tttatccaagttacactgccccatggcccaagatctgcttcagtatcatctataagttta
aacaagagcactgatgaagtgcctgtcccccctcctgttcctccacgaagacgaccagaa
tctgccccagcggaatcttcgccatctaagatcatgtctaagcatttggacagcccccca
gcaattcctcctaggcaacccacatcaaaagcctattcaccacgatattcagtatcagac
cggacctctatatcagatcctcctgaaagccctcccttattacctccacgagaacccgtg
aggacacctgatgttttctcaagctcaccactacatctccaacctccccctttgggcaaa
aaaagtgaccatagtaatgccttcttcccaaacagcccttccccctttacaccacctcct
cctcaaacaccttctcctcacggcacaagaaggcatctgccatcaccaccattgacacaa
gaagtggaccttcattccattgctgggccgcctgttcctccacgacagagcacttctcaa
catatccctaaactccctccaaaaacttacaaaagggagcatacacacccatccatgcac
agagatggaccaccactgttggagaatgctcattcttcctga

KEGG   Lipotes vexillifer (Yangtze River dolphin): 103081084
Entry
103081084         CDS       T03090                                 

Gene name
SOS2
Definition
(RefSeq) son of sevenless homolog 2 (Drosophila)
  KO
K03099  son of sevenless
Organism
lve  Lipotes vexillifer (Yangtze River dolphin)
Pathway
lve01521  EGFR tyrosine kinase inhibitor resistance
lve01522  Endocrine resistance
lve04010  MAPK signaling pathway
lve04012  ErbB signaling pathway
lve04014  Ras signaling pathway
lve04062  Chemokine signaling pathway
lve04068  FoxO signaling pathway
lve04072  Phospholipase D signaling pathway
lve04150  mTOR signaling pathway
lve04151  PI3K-Akt signaling pathway
lve04510  Focal adhesion
lve04540  Gap junction
lve04630  JAK-STAT signaling pathway
lve04650  Natural killer cell mediated cytotoxicity
lve04660  T cell receptor signaling pathway
lve04662  B cell receptor signaling pathway
lve04664  Fc epsilon RI signaling pathway
lve04714  Thermogenesis
lve04722  Neurotrophin signaling pathway
lve04810  Regulation of actin cytoskeleton
lve04910  Insulin signaling pathway
lve04912  GnRH signaling pathway
lve04915  Estrogen signaling pathway
lve04917  Prolactin signaling pathway
lve04926  Relaxin signaling pathway
lve04935  Growth hormone synthesis, secretion and action
lve05034  Alcoholism
lve05160  Hepatitis C
lve05161  Hepatitis B
lve05163  Human cytomegalovirus infection
lve05165  Human papillomavirus infection
lve05200  Pathways in cancer
lve05205  Proteoglycans in cancer
lve05206  MicroRNAs in cancer
lve05210  Colorectal cancer
lve05211  Renal cell carcinoma
lve05213  Endometrial cancer
lve05214  Glioma
lve05215  Prostate cancer
lve05220  Chronic myeloid leukemia
lve05221  Acute myeloid leukemia
lve05223  Non-small cell lung cancer
lve05224  Breast cancer
lve05225  Hepatocellular carcinoma
lve05226  Gastric cancer
lve05231  Choline metabolism in cancer
Brite
KEGG Orthology (KO) [BR:lve00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04010 MAPK signaling pathway
    103081084 (SOS2)
   04012 ErbB signaling pathway
    103081084 (SOS2)
   04014 Ras signaling pathway
    103081084 (SOS2)
   04630 JAK-STAT signaling pathway
    103081084 (SOS2)
   04068 FoxO signaling pathway
    103081084 (SOS2)
   04072 Phospholipase D signaling pathway
    103081084 (SOS2)
   04151 PI3K-Akt signaling pathway
    103081084 (SOS2)
   04150 mTOR signaling pathway
    103081084 (SOS2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    103081084 (SOS2)
   04540 Gap junction
    103081084 (SOS2)
  09142 Cell motility
   04810 Regulation of actin cytoskeleton
    103081084 (SOS2)
 09150 Organismal Systems
  09151 Immune system
   04650 Natural killer cell mediated cytotoxicity
    103081084 (SOS2)
   04660 T cell receptor signaling pathway
    103081084 (SOS2)
   04662 B cell receptor signaling pathway
    103081084 (SOS2)
   04664 Fc epsilon RI signaling pathway
    103081084 (SOS2)
   04062 Chemokine signaling pathway
    103081084 (SOS2)
  09152 Endocrine system
   04910 Insulin signaling pathway
    103081084 (SOS2)
   04912 GnRH signaling pathway
    103081084 (SOS2)
   04915 Estrogen signaling pathway
    103081084 (SOS2)
   04917 Prolactin signaling pathway
    103081084 (SOS2)
   04926 Relaxin signaling pathway
    103081084 (SOS2)
   04935 Growth hormone synthesis, secretion and action
    103081084 (SOS2)
  09156 Nervous system
   04722 Neurotrophin signaling pathway
    103081084 (SOS2)
  09159 Environmental adaptation
   04714 Thermogenesis
    103081084 (SOS2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    103081084 (SOS2)
   05206 MicroRNAs in cancer
    103081084 (SOS2)
   05205 Proteoglycans in cancer
    103081084 (SOS2)
   05231 Choline metabolism in cancer
    103081084 (SOS2)
  09162 Cancer: specific types
   05210 Colorectal cancer
    103081084 (SOS2)
   05225 Hepatocellular carcinoma
    103081084 (SOS2)
   05226 Gastric cancer
    103081084 (SOS2)
   05214 Glioma
    103081084 (SOS2)
   05221 Acute myeloid leukemia
    103081084 (SOS2)
   05220 Chronic myeloid leukemia
    103081084 (SOS2)
   05211 Renal cell carcinoma
    103081084 (SOS2)
   05215 Prostate cancer
    103081084 (SOS2)
   05213 Endometrial cancer
    103081084 (SOS2)
   05224 Breast cancer
    103081084 (SOS2)
   05223 Non-small cell lung cancer
    103081084 (SOS2)
  09165 Substance dependence
   05034 Alcoholism
    103081084 (SOS2)
  09172 Infectious disease: viral
   05161 Hepatitis B
    103081084 (SOS2)
   05160 Hepatitis C
    103081084 (SOS2)
   05163 Human cytomegalovirus infection
    103081084 (SOS2)
   05165 Human papillomavirus infection
    103081084 (SOS2)
  09176 Drug resistance: antineoplastic
   01521 EGFR tyrosine kinase inhibitor resistance
    103081084 (SOS2)
   01522 Endocrine resistance
    103081084 (SOS2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04990 Domain-containing proteins not elsewhere classified [BR:lve04990]
    103081084 (SOS2)
Domain-containing proteins not elsewhere classified [BR:lve04990]
 Pleckstrin homology (PH) domain-containing proteins
  Dbl-Like RhoGEF family proteins
   103081084 (SOS2)
SSDB
Motif
Pfam: RasGEF RasGEF_N RhoGEF PH Histone IQ_SEC7_PH PH_19 PH_13
Other DBs
NCBI-GeneID: 103081084
NCBI-ProteinID: XP_007446558
UniProt: A0A340WAQ5
LinkDB
Position
Un
AA seq 1272 aa
MAQPRTVQDVEERVQKTFPHPIDKWAIADAQSAIEKRKRRNPLLLPVDKIHPSLKEVLGY
KVDYHVSLYIVAVLEYISADILKLAGNYVFNIRHYEISQQDIKVSMCADKVLMDMFDQDD
IGLVSLCEDEPSSSGELNYYDLVRTEIAEERQYLRELNMIIKVFREAFLSDRDLFKPSDI
EKIFSNILDIHELTVKLLGLIEDTVEMTDESSPHPLAGSCFEDLAEEQAFDPYETLSQDI
LSPKFNEHFSKLMARPAVALHFQSIADGFKEAVRYVLPRLMLVPVYHCWHYFELLKQLKA
CSEEHEDRECLNQAITALMNLQGSMDRIYKQYSPRRRPGDPVCPFYNHQLRSKHLAIKKM
NEIQKNIDGWEGKDIGQCCNEFIMEGPLTRIGAKHERHIFLFDGLMISCKPNHSQSRLPG
YSSAEYRLKEKFVMRKIQICDKEDTCECKHAFELVSKDENSIIFAAKSAEEKNNWMAALI
SLHYRSTLDRMLDSVLLKEENEQPLRLPSPEVYRFVVKDSEENIVFEDNLQSRSGIPIIK
GGTVVKLIERLTYHMYADPNFVRTFLTTYRSFCKPQELLSLLIERFEIPEPEPTEADKLA
VEKGEQPISADLKRFRKEYVQPVQLRILNVFRHWVEHHFYDFERDLELLERLESFISSVR
GKAMKKWVESIAKIIKRKKQAQANGISHNITFESPPPPTEWHISRPGQFETFDLMTLHPI
EIARQLTLLESDLYRKVQPSELVGSVWTKEDKEINSPNLLKMIRHTTNLTLWFEKCIVEA
ENFEERVAVLSRIIEILQVFQDLNNFNGVLEIVSAVNSVSVYRLDHTFEALQERKRKILD
EAVELSQDHFKKYLLKLKSINPPCVPFFGIYLTNILKTEEGNNDFLKKKGKDLINFSKRR
KVAEITGEIQQYQNQPYCLRIEPEMRRFFENLNPMGNASEKEFTDYLFNKSLEIEPRNCK
QPPRFPRKSTFSLKSPGIRPNTGRHGSTSGTLRGHPTPLEREPCKISFSRIAETELESTV
SAPTSPNTPSTPPVSASSDLSVFLDVDLNSSCGSNSIFAPVLLPHSKSFFSSCGSLHKLS
EEPLIPPPLPPRKKFDHDASNSKGTMKSDDDPPAIPPRQPPPPKVKPRVPAPSGAFDGPL
HSPPPPPPRDPLPDTPPPVPLRPPEHFINCPFTLQPPPLGHLHRDPDWFRDINTCPNSPN
TPPSTPSPRVPRRCYVLSSSQNNLAHPQAPPVPPRQNSSPHLPKLPPKTYKRELSHPPLY
RLPLLENAETPQ
NT seq 3819 nt   +upstreamnt  +downstreamnt
atggcccaaccaaggactgttcaagatgtggaggaacgagttcaaaagacctttcctcat
ccaattgataaatgggctattgctgatgcgcagtctgccatagagaaacgaaaacgaaga
aatcctctcttactgcctgtggacaaaatccatccttcattgaaggaagttttagggtac
aaagtggactaccatgtgtccctatatattgtggctgtactagagtatatctcagctgat
attttgaaattggctggtaattatgtttttaatatccgacattatgaaatatcccaacag
gacattaaagtgtccatgtgtgcagataaggttttgatggacatgtttgatcaggatgac
ataggcttggtttctctttgtgaagatgaacctagttcttcaggtgaattaaactactat
gaccttgtcagaactgaaattgcagaagaaagacagtatctacgggaactaaatatgatc
ataaaagtgtttcgagaagcttttctttctgacagagacctgtttaaaccttctgatatt
gaaaagattttcagtaacattttagatatacatgaattgaccgtgaaacttttaggttta
attgaagacacagttgaaatgactgatgaaagcagccctcatcccttagctggaagctgt
tttgaagatttggcagaggaacaagcatttgatccttatgaaacattatcacaggatatt
ctttcaccaaaatttaatgaacatttcagtaagttgatggccagacctgcagtggctcta
cactttcagtccattgctgatggttttaaagaggcagttcgttatgtccttccacgcctt
atgctggtgccagtatatcattgttggcactattttgaattattaaagcaattgaaagca
tgtagtgaagagcatgaagacagagaatgtttgaaccaagctattactgctcttatgaat
ctccaaggtagtatggaccgaatttacaagcagtattcacctagacgccgacctggggat
cctgtttgccctttttataatcatcaattaagaagcaagcacctggctattaaaaaaatg
aatgaaattcagaaaaacatagatggatgggaaggcaaagatattggacagtgttgtaat
gaatttattatggaaggcccattgacaagaattggtgctaaacacgaacggcatattttt
ctctttgatggcttaatgattagctgcaaacccaatcatagccagtcacgccttccagga
tacagtagtgcagaatacagattaaaagaaaaatttgtcatgaggaaaatacaaatatgt
gataaagaagatacctgtgagtgcaaacatgcttttgaattggtatccaaagatgaaaac
agcataatatttgctgctaagtctgctgaagagaaaaataattggatggcagcacttatt
tctcttcattatcgtagtactctagatcgaatgctagattcagtattactgaaggaagaa
aatgaacaaccacttagattaccgagtcctgaagtgtatcgttttgtggtaaaagactct
gaggaaaacattgtttttgaagacaacttgcaaagtagaagtggaatccccattattaaa
ggaggaactgtggtgaaattaattgaaaggttaacatatcacatgtatgcagatcccaat
tttgttcgcacttttcttactacgtatcgttcattttgtaaaccacaggaattgctaagc
ttactgattgaacgatttgaaattccagagccagaacctactgaagcagataaattggca
gtagagaaaggcgagcagccaatcagtgcagaccttaaaaggtttcgcaaggaatacgtc
caaccagtacaacttaggatcttaaatgtgtttcggcactgggttgaacaccatttttat
gactttgaaagagacttggagttgcttgaaagactagaatccttcatttcaagtgtaaga
gggaaagctatgaagaaatgggtagagtcaattgctaagatcatcaagaggaagaaacaa
gctcaggcaaatggaataagccataatattacctttgaaagtccacctccaccaactgaa
tggcatatcagcagaccaggacagtttgaaacgtttgatctcatgacacttcatccaata
gaaattgcacgtcagctgacacttttggaatctgatctctacaggaaagttcaaccttct
gaacttgtagggagtgtatggaccaaagaagacaaagaaataaattctccaaatttatta
aaaatgattcgccacaccacaaatctcactctctggtttgaaaagtgcattgtggaagca
gaaaattttgaggaacgggtggcagtactaagtagaattatagaaattctgcaagttttt
caagatttgaataatttcaatggtgtattagagatcgtcagtgcagtaaattcagtatca
gtctatagactagatcatacctttgaggcattgcaggaaagaaaaaggaaaattttggat
gaagctgtggaattaagtcaagatcattttaaaaaatatctactgaaacttaagtcaatc
aatccaccttgtgtgcctttttttggaatatatttaacaaatattctgaagactgaagaa
gggaataatgattttttaaaaaagaaagggaaagatttaatcaatttcagtaagaggagg
aaagtagctgaaattactggagaaattcagcaatatcagaatcaaccttactgtttacgg
atagaaccagaaatgaggaggttctttgaaaaccttaaccccatgggaaatgcttctgaa
aaagagtttacagattatctgttcaacaagtcactagaaattgaaccccgaaactgcaaa
cagccacctcgatttcctaggaaatcaactttctctttaaaatctcctggaataaggcct
aatactggccgacatggctctacctcaggtactttacgaggtcatccaacaccattagaa
agagaaccatgtaaaataagctttagtcggattgctgaaacagagcttgaatcaacagtg
tcagcaccaacctctccaaatacaccatctactccaccagtatctgcttcttcagacctt
agtgtgtttttagatgtggatctcaacagttcctgtggaagcaatagcatctttgctcca
gtcctcttgcctcattcaaagtctttctttagttcgtgtggtagtttacataaactaagt
gaagagccactgattcctcctccacttcctcctcgaaaaaaatttgatcatgatgcttca
aattccaagggaactatgaaatctgatgatgacccccctgctattccgccaagacagcct
cctcctccaaaggtaaaacccagagttcctgctccttctggtgcatttgacgggcctctg
catagtccaccaccaccgccgccaagagatcctcttcccgatacccctccaccagttccc
cttcggcctccagaacactttataaactgtccgtttactcttcagccacctccactggga
catcttcacagagatccagactggttcagagacattaatacttgtccaaattctccaaac
actcctcctagcacaccctctccaagggtaccacgtcgatgctacgtgctcagttctagt
cagaataatcttgctcatcctcaagctccccctgttccaccaaggcagaattcaagccct
cacctaccaaaactgccaccaaagacttacaaacgggagctttcgcaccccccattgtat
agactgcctttgctagaaaatgcagaaactcctcaatga

DBGET integrated database retrieval system