KEGG   Mus musculus (house mouse): 20662
Entry
20662             CDS       T01002                                 
Symbol
Sos1, 4430401P03Rik, 9630010N06
Name
(RefSeq) SOS Ras/Rac guanine nucleotide exchange factor 1
  KO
K03099  son of sevenless
Organism
mmu  Mus musculus (house mouse)
Pathway
mmu01521  EGFR tyrosine kinase inhibitor resistance
mmu01522  Endocrine resistance
mmu04010  MAPK signaling pathway
mmu04012  ErbB signaling pathway
mmu04014  Ras signaling pathway
mmu04062  Chemokine signaling pathway
mmu04068  FoxO signaling pathway
mmu04072  Phospholipase D signaling pathway
mmu04150  mTOR signaling pathway
mmu04151  PI3K-Akt signaling pathway
mmu04510  Focal adhesion
mmu04540  Gap junction
mmu04630  JAK-STAT signaling pathway
mmu04650  Natural killer cell mediated cytotoxicity
mmu04660  T cell receptor signaling pathway
mmu04662  B cell receptor signaling pathway
mmu04664  Fc epsilon RI signaling pathway
mmu04714  Thermogenesis
mmu04722  Neurotrophin signaling pathway
mmu04810  Regulation of actin cytoskeleton
mmu04910  Insulin signaling pathway
mmu04912  GnRH signaling pathway
mmu04915  Estrogen signaling pathway
mmu04917  Prolactin signaling pathway
mmu04926  Relaxin signaling pathway
mmu04935  Growth hormone synthesis, secretion and action
mmu05034  Alcoholism
mmu05160  Hepatitis C
mmu05161  Hepatitis B
mmu05163  Human cytomegalovirus infection
mmu05165  Human papillomavirus infection
mmu05200  Pathways in cancer
mmu05205  Proteoglycans in cancer
mmu05206  MicroRNAs in cancer
mmu05207  Chemical carcinogenesis - receptor activation
mmu05208  Chemical carcinogenesis - reactive oxygen species
mmu05210  Colorectal cancer
mmu05211  Renal cell carcinoma
mmu05213  Endometrial cancer
mmu05214  Glioma
mmu05215  Prostate cancer
mmu05220  Chronic myeloid leukemia
mmu05221  Acute myeloid leukemia
mmu05223  Non-small cell lung cancer
mmu05224  Breast cancer
mmu05225  Hepatocellular carcinoma
mmu05226  Gastric cancer
mmu05231  Choline metabolism in cancer
Brite
KEGG Orthology (KO) [BR:mmu00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04010 MAPK signaling pathway
    20662 (Sos1)
   04012 ErbB signaling pathway
    20662 (Sos1)
   04014 Ras signaling pathway
    20662 (Sos1)
   04630 JAK-STAT signaling pathway
    20662 (Sos1)
   04068 FoxO signaling pathway
    20662 (Sos1)
   04072 Phospholipase D signaling pathway
    20662 (Sos1)
   04151 PI3K-Akt signaling pathway
    20662 (Sos1)
   04150 mTOR signaling pathway
    20662 (Sos1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    20662 (Sos1)
   04540 Gap junction
    20662 (Sos1)
  09142 Cell motility
   04810 Regulation of actin cytoskeleton
    20662 (Sos1)
 09150 Organismal Systems
  09151 Immune system
   04650 Natural killer cell mediated cytotoxicity
    20662 (Sos1)
   04660 T cell receptor signaling pathway
    20662 (Sos1)
   04662 B cell receptor signaling pathway
    20662 (Sos1)
   04664 Fc epsilon RI signaling pathway
    20662 (Sos1)
   04062 Chemokine signaling pathway
    20662 (Sos1)
  09152 Endocrine system
   04910 Insulin signaling pathway
    20662 (Sos1)
   04912 GnRH signaling pathway
    20662 (Sos1)
   04915 Estrogen signaling pathway
    20662 (Sos1)
   04917 Prolactin signaling pathway
    20662 (Sos1)
   04926 Relaxin signaling pathway
    20662 (Sos1)
   04935 Growth hormone synthesis, secretion and action
    20662 (Sos1)
  09156 Nervous system
   04722 Neurotrophin signaling pathway
    20662 (Sos1)
  09159 Environmental adaptation
   04714 Thermogenesis
    20662 (Sos1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    20662 (Sos1)
   05206 MicroRNAs in cancer
    20662 (Sos1)
   05205 Proteoglycans in cancer
    20662 (Sos1)
   05207 Chemical carcinogenesis - receptor activation
    20662 (Sos1)
   05208 Chemical carcinogenesis - reactive oxygen species
    20662 (Sos1)
   05231 Choline metabolism in cancer
    20662 (Sos1)
  09162 Cancer: specific types
   05210 Colorectal cancer
    20662 (Sos1)
   05225 Hepatocellular carcinoma
    20662 (Sos1)
   05226 Gastric cancer
    20662 (Sos1)
   05214 Glioma
    20662 (Sos1)
   05221 Acute myeloid leukemia
    20662 (Sos1)
   05220 Chronic myeloid leukemia
    20662 (Sos1)
   05211 Renal cell carcinoma
    20662 (Sos1)
   05215 Prostate cancer
    20662 (Sos1)
   05213 Endometrial cancer
    20662 (Sos1)
   05224 Breast cancer
    20662 (Sos1)
   05223 Non-small cell lung cancer
    20662 (Sos1)
  09172 Infectious disease: viral
   05161 Hepatitis B
    20662 (Sos1)
   05160 Hepatitis C
    20662 (Sos1)
   05163 Human cytomegalovirus infection
    20662 (Sos1)
   05165 Human papillomavirus infection
    20662 (Sos1)
  09165 Substance dependence
   05034 Alcoholism
    20662 (Sos1)
  09176 Drug resistance: antineoplastic
   01521 EGFR tyrosine kinase inhibitor resistance
    20662 (Sos1)
   01522 Endocrine resistance
    20662 (Sos1)
SSDB
Motif
Pfam: RasGEF RasGEF_N RhoGEF SOS1_NGEF_PH PH PH_19 PH_10 IQ_SEC7_PH RHG20_PH DUF6018 Takusan
Other DBs
NCBI-GeneID: 20662
NCBI-ProteinID: NP_033257
MGI: 98354
Ensembl: ENSMUSG00000024241
UniProt: Q62245 Q3USK4
Structure
LinkDB
Position
17:complement(80701181..80787882)
AA seq 1319 aa
MQAQQLPYEFFSEENAPKWRGLLVPALKKVQGQVHPTLESNDDALQYVEELILQLLNMLC
QAQPRSASDVEERVQKSFPHPIDKWAIADAQSAIEKRKRRNPLSLPAERIHHLLREVLGY
KIDHQVSVYIVAVLEYISADILKLVGNYVRNIRHYEITKQDIKVAMCADKVLMDMFHQDV
EDINILSLTDEEPSTSGEQTYYDLVKAFMAEIRQYIRELNLIIKVFREPFVSNSKLFSSN
DVENIFSRIVDIHELSVKLLGHIEDTVEMTDEGSPHPLVGSCFEDLAEELAFDPYESYAR
DILRPGFHGHFLSQLSKPGAALYLQSIGEGFKEAVQYVLPRLLLAPVYHCLHYFELLKQL
EEKSEDQEDKECMKQAITALLNVQSGMEKICSKSLAKRRLSESACRFYSQQMKGKQLAIK
KMNEIQKNIDGWEGKDIGQCCNEFIMEGTLTRVGAKHERHIFLFDGLMICCKSNHGQPRL
PGASSAEYRLKEKFFMRKVQINDKDDTSEYKHAFEIILKDGNSVIFSAKSAEEKNNWMAA
LISLQYRSTLERMLDVTVLQEEKEEQMRLPSAEVYRFAEPDSEENILFEENVQPKAGIPI
IKAGTVLKLIERLTYHMYADPNFVRTFLTTYRSFCRPQELLSLLIERFEIPEPEPTEADR
IAIENGDQPLSAELKRFRKEYIQPVQLRVLNVCRHWVEHHFYDFERDADLLQRMEEFIGT
VRGKAMKKWVESITKIIQRKKIARDNGPGHNITFQSSPPTVEWHISRPGHIETFDLLTLH
PIEIARQLTLLESDLYRAVQPSELVGSVWTKEDKEINSPNLLKMIRHTTNLTLWFEKCIV
ETENLEERVAVVSRIIEILQVFQELNNFNGVLEVVSAMNSSPVYRLDHTFEQIPSRQKKI
LEEAHELSEDHYKKYLAKLRSINPPCVPFFGIYLTNILKTEEGNPEVLRRHGKELINFSK
RRRVAEITGEIQQYQNQPYCLRVEPDIKRFFENLNPMGNSMEKEFTDYLFNKSLEIEPRH
PKPLPRFPKKYSYPLKSPGVRPSNPRPGTMRHPTPLQQEPRKISYSRIPESETESTASAP
NSPRTPLTPPPASGTSSNTDVCSVFDSDHSASPFHSRSASVSSISLSKGTDEVPVPPPVP
PRRRPESAPAESSPSKIMSKHLDSPPAIPPRQPTSKAYSPRYSISDRTSISDPPESPPLL
PPREPVRTPDVFSSSPLHLQPPPLGKKSDHGNAFFPNSPSPFTPPPPQTPSPHGTRRHLP
SPPLTQEMDLHSIAGPPVPPRQSTSQLIPKLPPKTYKREHTHPSMHRDGPPLLENAHSS
NT seq 3960 nt   +upstreamnt  +downstreamnt
atgcaggcgcagcagctgccttacgagtttttcagcgaggagaacgcgcccaagtggcgg
gggctgctggtgcctgcgctgaaaaaggttcaggggcaagttcaccctactcttgagtct
aatgatgatgctcttcagtatgttgaagaattaattttgcaattactaaatatgctatgc
caagctcagccccggagtgcttcagatgtggaggaacgtgttcaaaagagttttcctcat
ccaattgataagtgggcaatagctgatgcccaatcagccattgaaaagaggaagagacga
aatcctttatcgctgccagcagaaagaattcatcatttattaagggaggtcctcggttat
aaaattgaccaccaggtttctgtttacatagtagcagtattagaatacatttctgcagat
attttaaagctcgtggggaattatgtaagaaatatacggcattatgaaattacaaaacaa
gacattaaagtggcaatgtgtgctgataaggtattgatggatatgtttcatcaagatgta
gaagatataaatatcttatctttaactgatgaagagccttccacctcaggagaacaaact
tattatgatttggtaaaagcattcatggcagaaattcgacagtatataagagaattaaat
ctaattataaaagtttttcgagagccctttgtctctaattccaaattgttttcatctaat
gatgtagaaaacatattcagtcgtatagtagatatacatgaacttagtgtaaagttactg
ggccatatagaagatactgtagaaatgacagatgaaggcagtccccacccattagtagga
agctgttttgaagacttagcagaagaactggcatttgacccgtatgagtcatatgctcgg
gatattttacgacccggattccatggccattttcttagtcagttatcaaagcctggggca
gcactttatttgcagtccataggcgaaggcttcaaagaagctgtccagtacgtcctgccc
cggctgctgcttgcccctgtgtaccactgtctgcattactttgaacttctgaagcagtta
gaagaaaagagtgaagatcaagaagacaaggagtgtatgaagcaagcaataacagccctg
cttaatgtccaaagtggcatggaaaaaatttgctccaaaagtcttgcaaaacgaagacta
agtgagtctgcatgtcggttttacagccagcagatgaaggggaaacagctagccatcaag
aagatgaacgagatccagaagaacattgatggctgggaggggaaggacattggacagtgt
tgcaatgagttcataatggaaggaactcttacacgtgtaggagccaaacacgagagacac
atatttctcttcgatggcttaatgatttgctgtaaatcaaaccatgggcagccaagactc
cctggtgctagcagtgcagaataccggcttaaagaaaagttttttatgcgaaaggtacag
attaatgataaagatgacaccagtgagtacaagcatgcttttgaaatcattctgaaagat
ggcaatagtgttatattttctgccaagtcagctgaagagaaaaacaactggatggcagca
ctgatctctttgcagtaccgcagcaccctggagaggatgctggacgtaacggtgctgcag
gaggagaaggaggagcagatgaggctgcccagtgctgaagtgtacaggtttgcagaacct
gactccgaggagaatattctattcgaagagaatgtgcagcccaaagctgggatccccatt
atcaaggcagggacagtgcttaagctcattgagaggcttacctaccacatgtacgcagat
ccaaattttgttcggacgtttcttacaacatacaggtccttttgcagacctcaagaacta
ctgagtcttctgatagaaagatttgaaattccagagcctgagccaacagaagctgatcgc
atagctatagagaatggagatcagcccctgagtgcagagctgaagaggtttagaaaggaa
tatattcagcctgtgcagttgagggtgttaaatgtgtgtcggcactgggtggagcaccat
ttctatgactttgaaagagatgcagaccttttacagagaatggaggaatttattggaaca
gtaagaggtaaagcaatgaaaaaatgggtcgaatccatcactaagataatccaaaggaaa
aaaattgcaagagacaatggcccaggtcataacattacatttcagagctcacctcccaca
gttgagtggcacataagcagacctgggcacatagagacttttgacttgctcaccttacac
ccaatagaaattgctcggcaactcactttacttgaatcagatctataccgggctgtgcag
ccatcagaattagttggaagtgtgtggacaaaagaagataaagaaattaattctcccaac
cttctgaagatgattcggcacaccactaacctcactttgtggtttgagaaatgtattgta
gaaacagaaaacttagaagaaagagtagctgtagtaagtcggataattgagattctacaa
gtctttcaagagctgaacaacttcaatggtgtcctggaagttgtcagtgctatgaactcg
tcacctgtttacagactagaccacacatttgagcaaataccaagcagacaaaagaaaatt
ttagaagaagctcatgaattgagtgaagatcactataagaaatatttggcaaaactcagg
tctattaatccaccgtgtgtgcctttctttggaatttatctcacaaatatcctgaagaca
gaagagggcaaccctgaggtcctgaggagacacgggaaagagcttattaacttcagcaag
aggaggagagtggccgagatcacaggcgagatccagcagtaccagaaccagccctactgc
ttacgggtggagccggacatcaagaggttctttgaaaacttgaatccaatgggaaacagc
atggagaaagaatttacagactatctgttcaacaaatccctagaaatagaaccccggcac
cctaagcctcttccgagattcccaaaaaaatacagctatcccctaaaatctcctggtgtt
cgtccatcaaatccaagaccaggaaccatgagacatcccacacctctgcagcaggagcca
agaaaaattagctacagtcggattcctgaaagtgagacggaaagcacagcatctgcacca
aactcccctcggaccccactgacgccgccccctgcatctggcacctccagcaacacagat
gtttgcagcgtgttcgattctgaccactcggcaagcccttttcattcaagatctgcttca
gtctcatctataagtttatccaagggcactgatgaagtgcctgtcccccctcctgtaccc
cctcgaagacgtccagagtctgccccagctgaatcctccccatccaagattatgtctaag
cacttggacagccccccagctattcctcctaggcaacccacatccaaagcctattcacca
cgctattcaatatcagatcggacctctatatcagatcctcctgaaagccctcccttgtta
ccaccacgggaacctgtgaggacacctgatgttttctcaagctcaccattacatctccaa
cctcctcctttgggcaaaaagagtgatcatggcaacgccttcttcccaaacagcccatcc
ccttttacaccgccacccccccaaaccccctctcctcatggcacgagaaggcatctgcca
tcaccaccactgacacaggagatggacctccattccattgctgggcctcctgttcctcca
cgacaaagcacttctcaacttatccccaaactccctccaaaaacttacaaaagggagcac
acacacccatccatgcatagagatggaccaccactgctggagaatgcccattcttcctga

KEGG   Mus musculus (house mouse): 20663
Entry
20663             CDS       T01002                                 
Symbol
Sos2, SOS-2, mSOS-2
Name
(RefSeq) SOS Ras/Rho guanine nucleotide exchange factor 2
  KO
K03099  son of sevenless
Organism
mmu  Mus musculus (house mouse)
Pathway
mmu01521  EGFR tyrosine kinase inhibitor resistance
mmu01522  Endocrine resistance
mmu04010  MAPK signaling pathway
mmu04012  ErbB signaling pathway
mmu04014  Ras signaling pathway
mmu04062  Chemokine signaling pathway
mmu04068  FoxO signaling pathway
mmu04072  Phospholipase D signaling pathway
mmu04150  mTOR signaling pathway
mmu04151  PI3K-Akt signaling pathway
mmu04510  Focal adhesion
mmu04540  Gap junction
mmu04630  JAK-STAT signaling pathway
mmu04650  Natural killer cell mediated cytotoxicity
mmu04660  T cell receptor signaling pathway
mmu04662  B cell receptor signaling pathway
mmu04664  Fc epsilon RI signaling pathway
mmu04714  Thermogenesis
mmu04722  Neurotrophin signaling pathway
mmu04810  Regulation of actin cytoskeleton
mmu04910  Insulin signaling pathway
mmu04912  GnRH signaling pathway
mmu04915  Estrogen signaling pathway
mmu04917  Prolactin signaling pathway
mmu04926  Relaxin signaling pathway
mmu04935  Growth hormone synthesis, secretion and action
mmu05034  Alcoholism
mmu05160  Hepatitis C
mmu05161  Hepatitis B
mmu05163  Human cytomegalovirus infection
mmu05165  Human papillomavirus infection
mmu05200  Pathways in cancer
mmu05205  Proteoglycans in cancer
mmu05206  MicroRNAs in cancer
mmu05207  Chemical carcinogenesis - receptor activation
mmu05208  Chemical carcinogenesis - reactive oxygen species
mmu05210  Colorectal cancer
mmu05211  Renal cell carcinoma
mmu05213  Endometrial cancer
mmu05214  Glioma
mmu05215  Prostate cancer
mmu05220  Chronic myeloid leukemia
mmu05221  Acute myeloid leukemia
mmu05223  Non-small cell lung cancer
mmu05224  Breast cancer
mmu05225  Hepatocellular carcinoma
mmu05226  Gastric cancer
mmu05231  Choline metabolism in cancer
Brite
KEGG Orthology (KO) [BR:mmu00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04010 MAPK signaling pathway
    20663 (Sos2)
   04012 ErbB signaling pathway
    20663 (Sos2)
   04014 Ras signaling pathway
    20663 (Sos2)
   04630 JAK-STAT signaling pathway
    20663 (Sos2)
   04068 FoxO signaling pathway
    20663 (Sos2)
   04072 Phospholipase D signaling pathway
    20663 (Sos2)
   04151 PI3K-Akt signaling pathway
    20663 (Sos2)
   04150 mTOR signaling pathway
    20663 (Sos2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    20663 (Sos2)
   04540 Gap junction
    20663 (Sos2)
  09142 Cell motility
   04810 Regulation of actin cytoskeleton
    20663 (Sos2)
 09150 Organismal Systems
  09151 Immune system
   04650 Natural killer cell mediated cytotoxicity
    20663 (Sos2)
   04660 T cell receptor signaling pathway
    20663 (Sos2)
   04662 B cell receptor signaling pathway
    20663 (Sos2)
   04664 Fc epsilon RI signaling pathway
    20663 (Sos2)
   04062 Chemokine signaling pathway
    20663 (Sos2)
  09152 Endocrine system
   04910 Insulin signaling pathway
    20663 (Sos2)
   04912 GnRH signaling pathway
    20663 (Sos2)
   04915 Estrogen signaling pathway
    20663 (Sos2)
   04917 Prolactin signaling pathway
    20663 (Sos2)
   04926 Relaxin signaling pathway
    20663 (Sos2)
   04935 Growth hormone synthesis, secretion and action
    20663 (Sos2)
  09156 Nervous system
   04722 Neurotrophin signaling pathway
    20663 (Sos2)
  09159 Environmental adaptation
   04714 Thermogenesis
    20663 (Sos2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    20663 (Sos2)
   05206 MicroRNAs in cancer
    20663 (Sos2)
   05205 Proteoglycans in cancer
    20663 (Sos2)
   05207 Chemical carcinogenesis - receptor activation
    20663 (Sos2)
   05208 Chemical carcinogenesis - reactive oxygen species
    20663 (Sos2)
   05231 Choline metabolism in cancer
    20663 (Sos2)
  09162 Cancer: specific types
   05210 Colorectal cancer
    20663 (Sos2)
   05225 Hepatocellular carcinoma
    20663 (Sos2)
   05226 Gastric cancer
    20663 (Sos2)
   05214 Glioma
    20663 (Sos2)
   05221 Acute myeloid leukemia
    20663 (Sos2)
   05220 Chronic myeloid leukemia
    20663 (Sos2)
   05211 Renal cell carcinoma
    20663 (Sos2)
   05215 Prostate cancer
    20663 (Sos2)
   05213 Endometrial cancer
    20663 (Sos2)
   05224 Breast cancer
    20663 (Sos2)
   05223 Non-small cell lung cancer
    20663 (Sos2)
  09172 Infectious disease: viral
   05161 Hepatitis B
    20663 (Sos2)
   05160 Hepatitis C
    20663 (Sos2)
   05163 Human cytomegalovirus infection
    20663 (Sos2)
   05165 Human papillomavirus infection
    20663 (Sos2)
  09165 Substance dependence
   05034 Alcoholism
    20663 (Sos2)
  09176 Drug resistance: antineoplastic
   01521 EGFR tyrosine kinase inhibitor resistance
    20663 (Sos2)
   01522 Endocrine resistance
    20663 (Sos2)
SSDB
Motif
Pfam: RasGEF RasGEF_N RhoGEF SOS1_NGEF_PH PH IQ_SEC7_PH PH_19 RHG20_PH
Other DBs
NCBI-GeneID: 20663
NCBI-ProteinID: NP_001129031
MGI: 98355
Ensembl: ENSMUSG00000034801
UniProt: A0A0A0MQ87
LinkDB
Position
12:complement(69630535..69728626)
AA seq 1332 aa
MQQAPQPYEFFSEENSPKWRGLLVPALRKVQEQVHPTLSANEESLYYIEELIFQLLNKLC
MAQPRTVQDVEERVQKTFPHPIDKWAIADAQSAIEKRKRRNPLLLPVDKIHPSLKEVLGY
KVDYHVSLYIVAVLEYISADILKLAGNYVFNIRHYEISQQDIKVSMCADKVLMDMFDQDD
DIGLVSLCEDEPCSSGELNYYDLVRTEIAEERQYLRELNMIIKVFREAFLLDRKLFKPSE
IEKIFSNISDIHELTVKLLGLIEDTVEMTDESSPHPLAGSCFEDLAEEQAFDPYETLSQD
ILAPEFNDHFSKLMARPAVALHFQSIADGFKEAVRYVLPRLMLVPVYHCWHYFELLKLKA
CSEEQEDKECLNQAITALMNLQGSMDRIYKQHSPRRRPGDPVCLFYNRQLRSKHLAIKKM
NEIQKNIDGWEGKDIGQCCNEFIMEGPLTRIGAKHERHIFLFDGLMISCKPNHGQTRLPG
YSSAEYRLKEKFVMRKIQICDKEDACEYRHAFELVSKDENSVIFAAKSAEEKNNWMAALI
SLHYRSTLDRMLDSVLLKEENEQPLRLPSPDMYRFVVTDSEENIVFEDNLQSRSGIPIIK
GGTVVKLIERLTYHMYADPNFVRTFLTTYRSFCKPQELLNLLIERFEIPEPEPTEADKLA
LEKGEQPISADLKRFRKEYVQPVQLRVLNVFRHWVEHHYYDFERDLELLERLESFISSVR
GKAMKKWVESIAKIIKRKKQAQANGISHNITFESSPPPVEWHISRTGQFETFDLMTLHPI
EIARQLTLLESDLYRKVQPSELVGSVWTKEDKEINSPNLLKMIRHTTNLTLWFEKCIVEA
ENFEERVAVLSRIVEILQVFQDLNNFNGVLEIVSAVNSVSVYRLDHTFEALQERKRRILD
DAVELSQDHFKKYLVKLKSINPPCVPFFGIYLTNILKTEEGNSDFLKRKGKDLINFSKRR
KVAEITGEIQQYQNQPYCLRTEPEMRRFFENLNPMGILSEKEFTDYLFNKSLEIEPRNCK
QPPRFPRKSTFSLKSPGIRPNAGRHGSTSGTLRGHPTPLEREPYKISFSRIAETELESTV
SAPTSPNTPSTPPVSASSDHSVFLDVDLNSSCGSNTIFAPVLLPHSKTFFSSCGSLHKLS
EEPLIPPPLPPRKKFDHDALNSKGAVKSDDDPPAIPPRQPPPPKVKPRVPVLMGTFDGPV
PSPPPPPPRDPLPDTPPPVPLRPPEHFINCPFNLQPPPLGHPHRDPDWLRDVSTCPNSPS
TPPTTPSPRIPRSCHLLSSSHSSLAHLPAPPVPPRQNSSPLLPKLPPKTYKRELSHPPLY
RLPLLENAETPQ
NT seq 3999 nt   +upstreamnt  +downstreamnt
atgcagcaggcgccgcagccctacgagttcttcagcgaagaaaacagcccgaaatggcgg
ggactattggtcccggccctgcggaaggttcaggagcaagtacatcccaccctctcagct
aatgaagagtctctctattatattgaagaactgatttttcagctgcttaataagctatgc
atggctcaaccaaggactgttcaagatgttgaggaacgagttcaaaagacctttcctcat
cctattgataaatgggcaattgctgatgcacaatctgctatagagaaacgaaaacgaaga
aatcctctcttactacctgtggacaaaatccatccttccttgaaggaagttttggggtat
aaagtggactaccatgtgtccctctacattgtggctgtattggagtatatctcagcagat
attttgaaattggctggtaattatgtttttaatatccggcattatgaaatatctcagcaa
gacattaaagtgtccatgtgtgcagataaggttttgatggacatgttcgatcaggatgat
gatataggcttggtttctctctgtgaagatgagccttgttcttctggtgagctaaactat
tatgacctcgtcaggactgaaattgcagaagaaagacagtatctacgggagctgaatatg
atcattaaagtgttccgggaagcctttctcttggacagaaagttgttcaagccttctgaa
attgaaaagattttcagtaacatttcagatatacatgaattgactgtgaaacttttaggt
ttaattgaagacacagtagaaatgacagatgaaagtagtcctcatccattagctggtagc
tgttttgaagatttagcagaggagcaagcgtttgatccctatgaaacattatcacaggac
attcttgcaccagagtttaatgaccacttcagcaagttgatggccagacctgcagtcgct
ctacattttcagtccattgctgacggctttaaggaggctgttcgttatgtccttccacgc
ctcatgctggttcccgtgtatcactgttggcattactttgaattattaaagttgaaggca
tgcagtgaagagcaggaggacaaagagtgcttgaatcaggctataactgccctcatgaac
ctccaaggcagcatggaccgcatttacaagcagcactcccccagacgccggcctggggat
ccagtttgccttttttacaatcgtcaattaagaagcaaacacctggctatcaaaaaaatg
aatgaaattcagaaaaacatagatgggtgggaaggcaaagatatcggacagtgttgtaat
gagttcataatggaggggccactgaccagaattggtgctaaacacgaaaggcatatcttt
ctctttgatggcttaatgatcagctgtaaacccaatcatggccagacccggcttccagga
tatagcagtgcagaatacagattaaaggagaagtttgtcatgaggaaaattcaaatctgt
gataaggaagacgcctgtgagtacagacatgcttttgaattagtgtccaaagatgaaaac
agtgtaatatttgctgccaaatcagctgaagagaaaaacaactggatggcagccctcatt
tccctgcactatcgcagcactctagacagaatgctggactctgtgctgctgaaagaagag
aatgagcagcccctgaggctacccagtccagatatgtatcgctttgtggtaacagactct
gaggaaaacattgtgtttgaagacaacttgcaaagcagaagtgggatccccataattaaa
ggaggcactgtggtgaagttgatcgaaaggctaacataccacatgtatgcagatcccaat
tttgttcgtacttttcttactacatatcgttcattttgtaaaccacaggaattgctaaac
ttgctgatagaacggtttgaaattccagaaccagaacctactgaggcagacaagctggcg
ttagaaaaaggcgagcagccaatcagcgcagatctgaaaagattccgcaaggaatacgtc
caacctgtgcagcttagggtcttgaatgtctttcgccactgggttgagcatcattattat
gactttgaaagagacttggaactgcttgaaagactagaatccttcatttcaagtgtaaga
gggaaagccatgaagaaatgggtagaatccattgctaaaataatcaagaggaagaagcaa
gctcaggcaaatggaataagccataatatcacctttgaaagttcccccccaccagtggaa
tggcacatcagtagaacaggacagttcgaaacatttgaccttatgacacttcatccaata
gagatcgcacggcagctaacacttttggaatctgacctctacaggaaagtccagccctct
gaacttgtagggagtgtctggaccaaagaagataaagaaataaattctccaaacttatta
aaaatgattcgccatacaacaaacctcaccctatggtttgagaaatgcattgtggaagca
gaaaactttgaagaacgggtggcagtgctcagcagaatagtagaaattctgcaagtattt
caagacttgaataatttcaatggcgtgttggagatagtgagtgcagtcaactccgtgtca
gtgtacaggctagaccacacgtttgaggcactgcaggaaaggaagcggagaattttggat
gacgctgtggaactaagtcaggaccactttaaaaagtacctagtaaaacttaagtcaatc
aatccgccttgtgtgcctttttttggaatatatttaacaaatattctgaagactgaagaa
gggaacagtgactttctaaagaggaaagggaaagatttgatcaatttcagtaagaggagg
aaagtggctgaaataactggagagatccagcagtatcagaaccaaccgtactgcttacgg
acagaaccagaaatgaggagattctttgaaaacctcaaccccatgggaattttatctgaa
aaagagtttacagattatttgttcaacaaatcattagaaatcgaaccccgaaactgcaaa
caaccacctcgatttcctaggaagtcaaccttttccttaaaatctcctggaataaggccc
aatgctggccgccatggctctacctcaggcacgctacgaggtcacccaacgcctctggaa
agagagccttataagataagctttagccggatcgctgagacagagctagaatcaacagtg
tctgcaccaacctcccccaacactccatccaccccaccagtgtctgcttcttcagaccac
agcgtgtttctagatgtggacctcaatagctcctgtggcagcaacaccatctttgctcca
gtcctcttgccacactcaaagactttcttcagctcatgtggaagtttacacaaactgagt
gaagagccactaattcctcctccgcttccccctcggaaaaagtttgatcatgatgctctc
aattccaagggagctgtgaaatctgatgatgaccctcctgctattccaccaagacagccc
cctcctccgaaggtaaagccaagagttcctgtcctcatgggtacatttgatgggcctgtg
cccagtccacctccacctcctccaagagaccctcttcctgatacccctccaccagttcct
cttcggcctccggaacactttataaactgtccatttaatcttcagccgcctccactgggc
catcctcacagagacccagactggctcagagacgtcagcacgtgtcctaactcaccaagc
actcctcccactacgccctctccacggattccacgcagctgtcacttgctcagctccagt
cacagcagccttgctcatcttccagctcctcctgtcccaccaaggcagaattcaagccct
ctcttaccaaagctgccaccaaagacttacaaacgggagctttcccacccgccactgtat
agactgcctctgctggaaaatgcagaaactcctcaatga

DBGET integrated database retrieval system