KEGG   Rattus norvegicus (rat): 313845
Entry
313845            CDS       T01003                                 

Gene name
Sos1
Definition
(RefSeq) SOS Ras/Rac guanine nucleotide exchange factor 1
  KO
K03099  son of sevenless
Organism
rno  Rattus norvegicus (rat)
Pathway
rno01521  EGFR tyrosine kinase inhibitor resistance
rno01522  Endocrine resistance
rno04010  MAPK signaling pathway
rno04012  ErbB signaling pathway
rno04014  Ras signaling pathway
rno04062  Chemokine signaling pathway
rno04068  FoxO signaling pathway
rno04072  Phospholipase D signaling pathway
rno04150  mTOR signaling pathway
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04540  Gap junction
rno04630  JAK-STAT signaling pathway
rno04650  Natural killer cell mediated cytotoxicity
rno04660  T cell receptor signaling pathway
rno04662  B cell receptor signaling pathway
rno04664  Fc epsilon RI signaling pathway
rno04714  Thermogenesis
rno04722  Neurotrophin signaling pathway
rno04810  Regulation of actin cytoskeleton
rno04910  Insulin signaling pathway
rno04912  GnRH signaling pathway
rno04915  Estrogen signaling pathway
rno04917  Prolactin signaling pathway
rno04926  Relaxin signaling pathway
rno04935  Growth hormone synthesis, secretion and action
rno05034  Alcoholism
rno05160  Hepatitis C
rno05161  Hepatitis B
rno05163  Human cytomegalovirus infection
rno05165  Human papillomavirus infection
rno05200  Pathways in cancer
rno05205  Proteoglycans in cancer
rno05206  MicroRNAs in cancer
rno05210  Colorectal cancer
rno05211  Renal cell carcinoma
rno05213  Endometrial cancer
rno05214  Glioma
rno05215  Prostate cancer
rno05220  Chronic myeloid leukemia
rno05221  Acute myeloid leukemia
rno05223  Non-small cell lung cancer
rno05224  Breast cancer
rno05225  Hepatocellular carcinoma
rno05226  Gastric cancer
rno05231  Choline metabolism in cancer
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04010 MAPK signaling pathway
    313845 (Sos1)
   04012 ErbB signaling pathway
    313845 (Sos1)
   04014 Ras signaling pathway
    313845 (Sos1)
   04630 JAK-STAT signaling pathway
    313845 (Sos1)
   04068 FoxO signaling pathway
    313845 (Sos1)
   04072 Phospholipase D signaling pathway
    313845 (Sos1)
   04151 PI3K-Akt signaling pathway
    313845 (Sos1)
   04150 mTOR signaling pathway
    313845 (Sos1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    313845 (Sos1)
   04540 Gap junction
    313845 (Sos1)
  09142 Cell motility
   04810 Regulation of actin cytoskeleton
    313845 (Sos1)
 09150 Organismal Systems
  09151 Immune system
   04650 Natural killer cell mediated cytotoxicity
    313845 (Sos1)
   04660 T cell receptor signaling pathway
    313845 (Sos1)
   04662 B cell receptor signaling pathway
    313845 (Sos1)
   04664 Fc epsilon RI signaling pathway
    313845 (Sos1)
   04062 Chemokine signaling pathway
    313845 (Sos1)
  09152 Endocrine system
   04910 Insulin signaling pathway
    313845 (Sos1)
   04912 GnRH signaling pathway
    313845 (Sos1)
   04915 Estrogen signaling pathway
    313845 (Sos1)
   04917 Prolactin signaling pathway
    313845 (Sos1)
   04926 Relaxin signaling pathway
    313845 (Sos1)
   04935 Growth hormone synthesis, secretion and action
    313845 (Sos1)
  09156 Nervous system
   04722 Neurotrophin signaling pathway
    313845 (Sos1)
  09159 Environmental adaptation
   04714 Thermogenesis
    313845 (Sos1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    313845 (Sos1)
   05206 MicroRNAs in cancer
    313845 (Sos1)
   05205 Proteoglycans in cancer
    313845 (Sos1)
   05231 Choline metabolism in cancer
    313845 (Sos1)
  09162 Cancer: specific types
   05210 Colorectal cancer
    313845 (Sos1)
   05225 Hepatocellular carcinoma
    313845 (Sos1)
   05226 Gastric cancer
    313845 (Sos1)
   05214 Glioma
    313845 (Sos1)
   05221 Acute myeloid leukemia
    313845 (Sos1)
   05220 Chronic myeloid leukemia
    313845 (Sos1)
   05211 Renal cell carcinoma
    313845 (Sos1)
   05215 Prostate cancer
    313845 (Sos1)
   05213 Endometrial cancer
    313845 (Sos1)
   05224 Breast cancer
    313845 (Sos1)
   05223 Non-small cell lung cancer
    313845 (Sos1)
  09165 Substance dependence
   05034 Alcoholism
    313845 (Sos1)
  09172 Infectious disease: viral
   05161 Hepatitis B
    313845 (Sos1)
   05160 Hepatitis C
    313845 (Sos1)
   05163 Human cytomegalovirus infection
    313845 (Sos1)
   05165 Human papillomavirus infection
    313845 (Sos1)
  09176 Drug resistance: antineoplastic
   01521 EGFR tyrosine kinase inhibitor resistance
    313845 (Sos1)
   01522 Endocrine resistance
    313845 (Sos1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04990 Domain-containing proteins not elsewhere classified [BR:rno04990]
    313845 (Sos1)
Domain-containing proteins not elsewhere classified [BR:rno04990]
 Pleckstrin homology (PH) domain-containing proteins
  Dbl-Like RhoGEF family proteins
   313845 (Sos1)
SSDB
Motif
Pfam: RasGEF RasGEF_N RhoGEF Histone PH PH_19 PH_10 IQ_SEC7_PH PH_13
Other DBs
NCBI-GeneID: 313845
NCBI-ProteinID: NP_001094186
RGD: 1310949
Ensembl: ENSRNOG00000007106
UniProt: D4A3T0
LinkDB
Position
6q11
AA seq 1319 aa
MQAQQLPYEFFSEENAPKWRGLLVPALKKVQGQVHPTLESNEDALQYVEELILQLLNMLC
QAQPRSASDVEERVQKSFPHPIDKWAIADAQSAIEKRKRRNPLSLPAERIHHLLREVLGY
KIDHQVSVYIVAVLEYISADILKLVGNYVRNIRHYEITKQDIKVAMCADKVLMDMFHQDV
EDINILSLTDEEPSTSGEQTYYDLVKAFMAEIRQYIRELNLIIKVFREPFVSNSKLFSSY
DVENIFSRIVDIHELSVKLLGHIEDTVEMTDEGSPHPLVGSCFEDLAEELAFDPYESYAR
DVLRPGFHERFLSQLSKPGAALYLQSIGEGFKEAVQYVLPRLLLAPVYHCLHYFELLKQL
EEKSEDQEDKECMKQAITALLNVQSGMEKICSKNLAKRRLSESACRFYSQQMKGKQLAIK
KMNEIQKNIDGWEGKDIGQCCNEFIMEGTLTRVGAKHERHIFLFDGLMICCKSNHGQPRL
PGASNAEYRLKEKFFMRKVQINDKDDTSEYKHAFEIILKDGNSVIFSAKSAEEKNNWMAA
LISLQYRSTLERMLDVTMLQEEKEEHMRLPSADVYRFAEPDSEENILFEENVQPKAGIPI
IKAGTVVKLIERLTYHMYADPNFVRTFLTTYRSFCKPQELLNLLIERFEIPEPEPTEADR
IAIENGDQPLSAELKRFRKEYIQPVQLRVLNVCRHWVEHHFYDFERDVDLLQRMEEFIGT
VRGKAMKKWVESITKIIQRKKIARDNGPGHNITFQNSPPTVEWHISRPGHIETFDLLTLH
PIEIARQLTLLESDLYRAVQPSELVGSVWTKEDKEINSPNLLKMIRHTTNLTLWFEKCIV
ETENLEERVAVVSRIIEILQVFQELNNFNGVLEVVSAMNSSPVYRLDHTFEQIPSRQKKI
LEEAHELSEDHYKKYLAKLRSINPPCVPFFGIYLTNILKTEEGNPEVLRRHGKELINFSK
RRRVAEITGEIQQYQNQPYCLRVESDIKRFFENLNPMGNSMEKEFTDYLFNKSLEIEPRN
PKPLPRFPKKYSYPLKSPGVRPSNPRPGTMRHPTPLQQEPRKISYSRIPESETESTASAP
NSPRTPLTPPPASSASSNTDVCSVFDSDHSASPFHSRSASVSSISLSKGTEEVPVPPPVP
PRRRPESAPAESSPSKIMSKHLDSPPAIPPRQPTSKAYSPRYSISDRTSISDPPESPPLL
PPREPVRTPDVFSSSPLHLQPPPLGKKSDHGNAFFPNSPSPFTPPPPQTPSPHGTRRHLP
SPPLTQEVDLHSIAGPPVPPRQSTSQLIPKLPPKTYKREHTHPSMHRDGPPLLENAHSS
NT seq 3960 nt   +upstreamnt  +downstreamnt
atgcaggcgcagcagctgccctacgagtttttcagcgaggagaacgcgcccaagtggcgg
gggctgctggtgcctgcgctgaaaaaggttcaggggcaagttcaccctactcttgagtct
aatgaggatgctcttcagtatgttgaagaattaattttgcaattactaaatatgctatgc
caagctcagccccggagtgcttcagatgtggaggaacgtgttcaaaaaagtttccctcat
ccaattgataagtgggcaatagctgatgcccaatcagctattgaaaagaggaagagaagg
aaccctttatccctgccagcagaaagaattcatcatttattaagggaggtcctgggttat
aaaattgaccaccaggtttctgtttacatagtagcagtattagaatacatttctgcagat
attttaaagctggtggggaattatgtaagaaatatacggcattatgaaattacaaaacaa
gacattaaagtggcaatgtgtgctgataaggtattgatggatatgtttcatcaagatgta
gaagatataaatatcttatctttaactgatgaagaaccttccacctcaggagaacaaact
tactatgatttggtaaaagcattcatggcagaaattcgacaatatataagagaattaaat
ctaattataaaggtttttcgagaaccctttgtctctaattccaagttgttttcatcttat
gatgtagaaaacatatttagtcgtatagtggatatccatgaacttagtgtaaagttactg
ggccatatagaagacactgtagagatgacagatgaaggcagtccccacccgttagtcgga
agctgttttgaagacttagcagaagaactggcctttgacccatatgagtcctatgctcgg
gatgttctgcggcccggattccatgagcgttttctcagtcagctatcaaagcctggggca
gcactttatctgcagtccataggtgaaggcttcaaagaggccgttcagtatgtcctgccc
cggctgctgctcgcccctgtgtaccactgtctgcattactttgaacttctgaagcagtta
gaagaaaagagcgaagatcaagaagataaggagtgtatgaaacaagcaataacagccctg
cttaatgtccaaagtggcatggaaaaaatttgctccaaaaatcttgcaaaacgaagactg
agtgagtctgcatgtcggttttacagccagcaaatgaaggggaaacaactggccatcaag
aagatgaatgagattcagaagaacattgatggttgggaggggaaagacattggacaatgc
tgcaatgagtttataatggaaggaactcttacacgtgtaggagccaaacacgagagacac
atatttctcttcgatggcttaatgatttgctgtaaatcaaatcatgggcagccaagactt
cctggtgctagcaatgcagaatatcggcttaaagaaaagttttttatgcgaaaggtacag
attaacgataaggatgacaccagtgaatacaagcacgctttcgaaataattctgaaggat
ggaaatagtgttatattttctgccaagtcagctgaagagaagaataactggatggcagcc
ctgatatccctgcagtaccgcagcacgctggagagaatgctggacgttacaatgctgcag
gaggagaaggaggagcacatgaggctgccgagtgctgacgtgtacaggtttgcagagccc
gactctgaggagaacattctgtttgaagagaacgtgcagcccaaggctggcatccccatc
atcaaggcagggacagtggttaagctgattgagaggctcacgtaccacatgtacgcagat
ccaaattttgttcggacatttcttacaacatacagatcattttgcaaacctcaagaacta
ctgaatcttctaatagaaagatttgaaattccagagcctgagccaacagaagctgatcgc
atagctatagagaatggagatcagcccctgagcgcagagctgaaaaggtttagaaaggag
tatatccagcctgtacagctgagggtgttaaacgtgtgtcggcactgggtggagcaccat
ttctatgactttgaaagagatgtagaccttttacagagaatggaggaatttattggaaca
gtaagaggtaaagcaatgaaaaaatgggtcgaatccatcactaagataatccaaaggaaa
aaaattgcaagagacaatggcccaggtcataatattacatttcagaattcacctcccaca
gttgagtggcacataagcagacctgggcatatagagacttttgacttgctcaccctacac
ccaatagaaattgctcgacaactcactttacttgaatcagatctataccgagctgtgcag
ccatcagaattagttggaagtgtgtggacaaaagaagataaagaaattaactctcctaac
cttctgaaaatgattcggcacaccaccaacctcactctgtggtttgagaaatgtattgta
gaaacagagaacttagaagaaagagtagcagtagtgagtcgaataattgagattctacaa
gtctttcaagagctgaacaacttcaatggtgtcctggaagttgttagtgctatgaactcg
tcacctgtttacagactggaccacacatttgagcaaataccaagtagacaaaagaaaatt
ttagaagaagctcatgagttgagtgaagatcactataagaaatatttggcaaaactcagg
tctattaatccaccatgtgtgcctttctttggaatttatctcacaaatatcttgaaaaca
gaagaaggcaaccctgaggtcctaagaagacatggaaaagagcttatcaacttcagcaag
aggaggagagtggccgagataacaggagagatccagcagtaccaaaaccagccttactgc
ttacgggtagagtcagacatcaagagattctttgaaaacttgaatccaatgggaaatagc
atggagaaggaatttacagactatctattcaacaaatccctagaaatagaaccacggaac
cctaagcctcttccaagatttccaaaaaaatacagctatcccctaaaatctcctggtgtt
cgtccatcaaatccaagaccaggaaccatgaggcaccctacacctctgcagcaggagcca
aggaaaatcagctacagtcggattcctgagagcgagacagaaagtacagcgtcggcacca
aactccccgagaacaccgctgacgccgcctcctgcgtccagcgcctccagtaacacagac
gtctgcagcgtgttcgattccgaccactccgcaagcccttttcactcaagatctgcttca
gtctcatctataagtttatccaagggcaccgaggaagtgcctgtccctcctcctgtcccc
cctcgaagacgaccagagtctgccccagcggaatcctccccatccaagattatgtctaag
cacttggatagccccccagcgattcctcctaggcagcccacatcgaaagcctattcacca
cgatactcaatatcagatcggacctctatatcagatcctcccgagagccctcccttgtta
ccaccacgagaacctgtgaggacacccgatgttttctcaagctcgccattacatctccaa
cctcccccgttgggcaaaaagagtgaccatggcaatgccttcttcccaaacagcccatcc
ccctttacaccgccacctcctcaaaccccgtctcctcacggcacgagaaggcatctgcca
tcaccaccactgacacaggaagtggacctccattccattgctgggcctcccgttcctcca
cggcaaagcacttctcaacttatccccaaactccctccaaaaacttacaaaagggagcac
acacacccatccatgcacagagatggaccgccactgctggagaatgcccattcttcctga

KEGG   Rattus norvegicus (rat): 85384
Entry
85384             CDS       T01003                                 

Gene name
Sos2, Sos1
Definition
(RefSeq) SOS Ras/Rho guanine nucleotide exchange factor 2
  KO
K03099  son of sevenless
Organism
rno  Rattus norvegicus (rat)
Pathway
rno01521  EGFR tyrosine kinase inhibitor resistance
rno01522  Endocrine resistance
rno04010  MAPK signaling pathway
rno04012  ErbB signaling pathway
rno04014  Ras signaling pathway
rno04062  Chemokine signaling pathway
rno04068  FoxO signaling pathway
rno04072  Phospholipase D signaling pathway
rno04150  mTOR signaling pathway
rno04151  PI3K-Akt signaling pathway
rno04510  Focal adhesion
rno04540  Gap junction
rno04630  JAK-STAT signaling pathway
rno04650  Natural killer cell mediated cytotoxicity
rno04660  T cell receptor signaling pathway
rno04662  B cell receptor signaling pathway
rno04664  Fc epsilon RI signaling pathway
rno04714  Thermogenesis
rno04722  Neurotrophin signaling pathway
rno04810  Regulation of actin cytoskeleton
rno04910  Insulin signaling pathway
rno04912  GnRH signaling pathway
rno04915  Estrogen signaling pathway
rno04917  Prolactin signaling pathway
rno04926  Relaxin signaling pathway
rno04935  Growth hormone synthesis, secretion and action
rno05034  Alcoholism
rno05160  Hepatitis C
rno05161  Hepatitis B
rno05163  Human cytomegalovirus infection
rno05165  Human papillomavirus infection
rno05200  Pathways in cancer
rno05205  Proteoglycans in cancer
rno05206  MicroRNAs in cancer
rno05210  Colorectal cancer
rno05211  Renal cell carcinoma
rno05213  Endometrial cancer
rno05214  Glioma
rno05215  Prostate cancer
rno05220  Chronic myeloid leukemia
rno05221  Acute myeloid leukemia
rno05223  Non-small cell lung cancer
rno05224  Breast cancer
rno05225  Hepatocellular carcinoma
rno05226  Gastric cancer
rno05231  Choline metabolism in cancer
Brite
KEGG Orthology (KO) [BR:rno00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04010 MAPK signaling pathway
    85384 (Sos2)
   04012 ErbB signaling pathway
    85384 (Sos2)
   04014 Ras signaling pathway
    85384 (Sos2)
   04630 JAK-STAT signaling pathway
    85384 (Sos2)
   04068 FoxO signaling pathway
    85384 (Sos2)
   04072 Phospholipase D signaling pathway
    85384 (Sos2)
   04151 PI3K-Akt signaling pathway
    85384 (Sos2)
   04150 mTOR signaling pathway
    85384 (Sos2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    85384 (Sos2)
   04540 Gap junction
    85384 (Sos2)
  09142 Cell motility
   04810 Regulation of actin cytoskeleton
    85384 (Sos2)
 09150 Organismal Systems
  09151 Immune system
   04650 Natural killer cell mediated cytotoxicity
    85384 (Sos2)
   04660 T cell receptor signaling pathway
    85384 (Sos2)
   04662 B cell receptor signaling pathway
    85384 (Sos2)
   04664 Fc epsilon RI signaling pathway
    85384 (Sos2)
   04062 Chemokine signaling pathway
    85384 (Sos2)
  09152 Endocrine system
   04910 Insulin signaling pathway
    85384 (Sos2)
   04912 GnRH signaling pathway
    85384 (Sos2)
   04915 Estrogen signaling pathway
    85384 (Sos2)
   04917 Prolactin signaling pathway
    85384 (Sos2)
   04926 Relaxin signaling pathway
    85384 (Sos2)
   04935 Growth hormone synthesis, secretion and action
    85384 (Sos2)
  09156 Nervous system
   04722 Neurotrophin signaling pathway
    85384 (Sos2)
  09159 Environmental adaptation
   04714 Thermogenesis
    85384 (Sos2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    85384 (Sos2)
   05206 MicroRNAs in cancer
    85384 (Sos2)
   05205 Proteoglycans in cancer
    85384 (Sos2)
   05231 Choline metabolism in cancer
    85384 (Sos2)
  09162 Cancer: specific types
   05210 Colorectal cancer
    85384 (Sos2)
   05225 Hepatocellular carcinoma
    85384 (Sos2)
   05226 Gastric cancer
    85384 (Sos2)
   05214 Glioma
    85384 (Sos2)
   05221 Acute myeloid leukemia
    85384 (Sos2)
   05220 Chronic myeloid leukemia
    85384 (Sos2)
   05211 Renal cell carcinoma
    85384 (Sos2)
   05215 Prostate cancer
    85384 (Sos2)
   05213 Endometrial cancer
    85384 (Sos2)
   05224 Breast cancer
    85384 (Sos2)
   05223 Non-small cell lung cancer
    85384 (Sos2)
  09165 Substance dependence
   05034 Alcoholism
    85384 (Sos2)
  09172 Infectious disease: viral
   05161 Hepatitis B
    85384 (Sos2)
   05160 Hepatitis C
    85384 (Sos2)
   05163 Human cytomegalovirus infection
    85384 (Sos2)
   05165 Human papillomavirus infection
    85384 (Sos2)
  09176 Drug resistance: antineoplastic
   01521 EGFR tyrosine kinase inhibitor resistance
    85384 (Sos2)
   01522 Endocrine resistance
    85384 (Sos2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04990 Domain-containing proteins not elsewhere classified [BR:rno04990]
    85384 (Sos2)
Domain-containing proteins not elsewhere classified [BR:rno04990]
 Pleckstrin homology (PH) domain-containing proteins
  Dbl-Like RhoGEF family proteins
   85384 (Sos2)
SSDB
Motif
Pfam: RasGEF RasGEF_N RhoGEF Histone PH IQ_SEC7_PH PH_19
Other DBs
NCBI-GeneID: 85384
NCBI-ProteinID: NP_001129033
RGD: 620435
Ensembl: ENSRNOG00000004826
UniProt: F1MAI3
LinkDB
Position
6q24
AA seq 1333 aa
MQQAPQPYEFFSEENSPKWRGLLVPALRKVQEQVHPTLSANEESLYYIEELIFQLLNKLC
LAQPRTVQDVEERVQKTFPHPIDKWAIADAQSAIEKRKRRNPLLLPVDKIHPSLKEVLGY
KVDYHVSLYIVAVLEYISADILKLAGNYVFNIRHYEISQQDIKVSMCADKVLMDMFDQDD
DIGLVSLCEEEPCSSGELNYYDLVRTEIAEERQYLRELNMIIKVFREAFLSDKKLFKSSE
IEKIFSNISDIHELTVKLLGLIEDTVEMTDESSPHPLAGSCFEDLAEEQAFDPYETLSQD
ILAPEFNDHFSKLMARPAVALHFQSIADGFKEAVRYVLPRLMLVPVYHCWHYFELLKQLK
ACSEEQEDRECLNQAITALMNLQGSMDRIYKQHSPRRRPGDPVCLFYNRQLRSKHLAIKK
MNEIQKNIDGWEGKDIGQCCNEFIMEGPLTRIGAKHERHIFLFDGLMISCKPNHCQTRLP
GYSSAEYRLKEKFVMRKIQICDKEDACEYKHAFELVSKDENSVIFAAKSAEEKNNWMAAL
ISLHYRSTLDRMLDSVLLKEENEQPLRLPRPDVYRFVVTDSEENIVFEDNLQSRSGIPII
KGGTVVKLIERLTYHMYADPNFVRTFLTTYRSFCKPQELLSLLIERFEIPDPEPTEADKL
ALEKGEQPISADLKRFRKEYVQPVQLRVLNVFRHWVEHHFYDFERDLELLERLEAFISSV
RGKAMKKWVESIAKIIKRKKQAQANGVSHNITFESPPPPVEWHISRAGQSETFDLMTLHP
IEIARQLTLLESDLYRRVQPSELVGSVWTKEDKEINSPNLLKMIRHTTNLTLWFEKCIVE
AENFEERVAVLSRIVEILQVFQDLNNFNGVLEIVSAVNSVSVYRLDHTFEALQERKRRVL
DDAVELSQDHFKKYLVKLKSINPPCVPFFGIYLTNILKTEEGNSDFLKRKGKDLINFSKR
RKVAEITGEIQQYQNQPYCLRTEPEMRRFFENLNPMGLLSEKEFTDYLFNKSHEIEPRNC
KQPPRFPRKSTFSLKSPGIRPNASRHGSTSGTFRGHPTPLEREPYKISFSRIAETELEST
VSAPTSPNTPSTPPVSASSDHSVFLDVDLNSSCGSNTIFAPVLLPHSKSFFSSCGSLHKL
SEEPLVPPPLPPRKKFDHDAPNSKGAMKSDDDPPAIPPRQPPPPKVKPRAPVLTGTFEGP
VPSPPPPPPRDPLPDTPPPVPLRPPEHFINCPFNLQPPPLGHPHRDPDWLRDVSTCPNSP
STPPPTPSPRIPRRCHLPSPSHNNLAHPPAPPVPPRQNSSPHLPKLPPKTYKRELSHPPL
YRLPLFENAETPQ
NT seq 4002 nt   +upstreamnt  +downstreamnt
atgcagcaggcgccgcagccctacgagttcttcagcgaagagaacagcccgaaatggcgg
ggactgttggtcccggccctgcggaaggttcaggagcaagtgcaccccaccctgtcggct
aacgaagagtctctctattacatcgaagaactcatctttcaactgctcaataagctgtgc
ctggctcaaccgaggactgttcaagatgttgaggaacgagttcaaaagacttttcctcat
cctattgataaatgggcaattgctgatgcacagtctgccatagagaaacgaaaacgaaga
aatcctctcttactacctgtggacaaaatccatccttccttgaaggaagttttggggtat
aaagtggactaccatgtatccctctacattgtggctgtattggagtatatctcagcagat
attttgaaattggctggtaattatgtttttaatatccggcattatgaaatatctcagcaa
gacattaaagtgtcaatgtgtgcagataaggttttgatggacatgtttgaccaggatgac
gatataggcttggtttctctctgtgaagaggaaccttgttcttctggtgagctaaactat
tatgaccttgttagaactgaaatcgcagaagaaagacagtatctacgggaactgaatatg
atcattaaagtattccgggaagcctttctttcagacaaaaagttgttcaagtcttctgaa
attgaaaagattttcagtaacatttcagatatacatgaactgactgtgaaacttttaggt
ttgattgaagacacagtagaaatgacagatgaaagcagtcctcacccgctagctggtagc
tgctttgaagatttagcagaggagcaggcttttgacccctatgaaacgttatcacaggac
attcttgctccagagtttaatgaccacttcagcaagttgatggccagacctgcagttgct
ctgcactttcagtccattgctgacggctttaaggaggctgttcgatatgtccttccgcgc
ctcatgctggtgccggtgtatcattgttggcattactttgaattattaaagcagttgaaa
gcatgtagcgaagaacaggaagaccgagagtgtttgaatcaggctatcactgccctcatg
aacctccaaggcagcatggaccggatttacaagcagcactcgccaaggcgccggccaggg
gatcctgtttgccttttttacaatcgtcaattaagaagcaagcacctggctatcaaaaaa
atgaatgaaattcagaaaaacattgatgggtgggaaggcaaagatattggacagtgttgt
aatgagttcataatggaaggaccattgaccagaattggtgctaagcatgaaaggcatatt
ttcctcttcgatggcttaatgatcagctgtaaacccaatcattgccagactcgacttcca
gggtatagtagtgcagaatacaggttaaaagagaagtttgtcatgaggaaaattcaaatc
tgtgataaagaagacgcttgtgaatacaaacatgcttttgaattagtatccaaagatgaa
aacagtgtaatatttgctgccaagtcagctgaagagaaaaacaactggatggcagccctt
atctcccttcactatcggagcactctagatagaatgctggactctgtgttactgaaagag
gagaacgagcagcccctgcggctaccgcggccagacgtgtaccgcttcgtggtgacagac
tcagaggagaacatagtttttgaagacaacttgcaaagcagaagtggaatccccattatt
aaaggaggcaccgtggtgaaactgatcgaaaggctaacatatcacatgtatgcagatccc
aattttgttcgtactttccttactacgtatcgttcattttgtaagccacaggaattgcta
agcttgctgatagaacggtttgaaattccagatccagagcctactgaggccgacaaactg
gcgctagagaaaggcgagcagccaatcagcgcagatctgaaaagattccgcaaggaatat
gtccaacccgtgcaacttagggtcttgaatgtctttcgccactgggttgagcatcatttt
tatgactttgaaagagacttggaattgcttgaaagactggaagccttcatttcaagtgta
agagggaaagccatgaagaagtgggtagaatccattgctaaaatcatcaagaggaagaag
caagctcaggccaatggagtaagccataatatcacctttgaaagtccgcctccaccggtt
gaatggcacataagtagggcgggacagtctgaaacctttgaccttatgacacttcatccc
atagagattgcacggcagttaacactcttggaatctgatctctacaggagagtccagcct
tctgaacttgtagggagtgtctggaccaaagaagataaagaaataaattctccaaatcta
ttaaaaatgattcgccatacaacaaacctcactttgtggtttgaaaaatgcattgtggaa
gcagaaaactttgaagaacgggtggcagtgctcagcagaatagtagaaattctgcaagtg
tttcaagatctgaataatttcaatggcgtgttggagatagtcagtgcagtcaattcagtg
tcagtgtacagactagaccacacatttgaggcattgcaggaaaggaagcggagagttttg
gatgacgctgtggaattaagtcaggatcactttaaaaaatacctagtaaaacttaaatca
atcaatccaccttgtgtgcctttttttggaatatatttaacaaatattctgaagactgaa
gaagggaacagcgactttctaaagaggaaagggaaagatttgatcaatttcagtaagagg
agaaaagtggctgaaataaccggagagattcagcagtatcagaaccaaccatactgctta
cggacagaaccagagatgaggagattctttgaaaacctcaatcccatgggacttttatct
gaaaaagagtttacagattatttgttcaacaaatctcatgaaattgaaccccgaaactgc
aaacaaccacctcgatttcctaggaagtcaaccttttccttaaaatctcctggaataagg
cccaatgccagccgccatggctctacctcaggcacgttccgaggtcacccaacccctctg
gaaagggagccttataagataagctttagcaggatcgccgagacggagctagaatccacg
gtgtcagcaccgacatctcccaacaccccatccaccccacctgtgtctgcttcttcagac
cacagcgtgtttctagatgtggacctcaatagctcctgtggcagcaataccatctttgct
ccagtcctcttgccacactcaaagtctttcttcagctcatgtggaagtctacacaaactg
agtgaagagccactagttcctcctccgcttccccctcggaaaaagtttgatcacgatgct
ccgaactccaagggagctatgaaatcggatgatgaccctcctgctattccaccaagacaa
ccccctcctccaaaggtaaagccaagagctcctgtcctcacgggtacatttgaagggcct
gtgcccagtccacctccacctcctcccagagaccctcttcctgatactcctccaccagtt
cctcttcggcctccggaacactttataaactgtccgtttaatctccagccgcctccgctg
ggacatcctcacagagacccagactggctcagagacgttagcacatgccctaattcacca
agcactcctccccctacgccctcgccacggattccacgcagatgccatttgcccagcccc
agtcacaacaatcttgctcaccctccagctcctcccgttccaccaaggcagaattcaagc
cctcacctaccaaaactgccaccaaagacttacaagcgggagctttcacacccgccattg
tacagactacctttgtttgaaaatgcagaaactcctcagtga

DBGET integrated database retrieval system