KEGG   Pongo abelii (Sumatran orangutan): 100434331
Entry
100434331         CDS       T01416                                 

Gene name
SOS2
Definition
(RefSeq) son of sevenless homolog 2 isoform X1
  KO
K03099  son of sevenless
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon01521  EGFR tyrosine kinase inhibitor resistance
pon01522  Endocrine resistance
pon04010  MAPK signaling pathway
pon04012  ErbB signaling pathway
pon04014  Ras signaling pathway
pon04062  Chemokine signaling pathway
pon04068  FoxO signaling pathway
pon04072  Phospholipase D signaling pathway
pon04150  mTOR signaling pathway
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04540  Gap junction
pon04630  JAK-STAT signaling pathway
pon04650  Natural killer cell mediated cytotoxicity
pon04660  T cell receptor signaling pathway
pon04662  B cell receptor signaling pathway
pon04664  Fc epsilon RI signaling pathway
pon04714  Thermogenesis
pon04722  Neurotrophin signaling pathway
pon04810  Regulation of actin cytoskeleton
pon04910  Insulin signaling pathway
pon04912  GnRH signaling pathway
pon04915  Estrogen signaling pathway
pon04917  Prolactin signaling pathway
pon04926  Relaxin signaling pathway
pon04935  Growth hormone synthesis, secretion and action
pon05034  Alcoholism
pon05160  Hepatitis C
pon05161  Hepatitis B
pon05163  Human cytomegalovirus infection
pon05165  Human papillomavirus infection
pon05200  Pathways in cancer
pon05205  Proteoglycans in cancer
pon05206  MicroRNAs in cancer
pon05210  Colorectal cancer
pon05211  Renal cell carcinoma
pon05213  Endometrial cancer
pon05214  Glioma
pon05215  Prostate cancer
pon05220  Chronic myeloid leukemia
pon05221  Acute myeloid leukemia
pon05223  Non-small cell lung cancer
pon05224  Breast cancer
pon05225  Hepatocellular carcinoma
pon05226  Gastric cancer
pon05231  Choline metabolism in cancer
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04010 MAPK signaling pathway
    100434331 (SOS2)
   04012 ErbB signaling pathway
    100434331 (SOS2)
   04014 Ras signaling pathway
    100434331 (SOS2)
   04630 JAK-STAT signaling pathway
    100434331 (SOS2)
   04068 FoxO signaling pathway
    100434331 (SOS2)
   04072 Phospholipase D signaling pathway
    100434331 (SOS2)
   04151 PI3K-Akt signaling pathway
    100434331 (SOS2)
   04150 mTOR signaling pathway
    100434331 (SOS2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100434331 (SOS2)
   04540 Gap junction
    100434331 (SOS2)
  09142 Cell motility
   04810 Regulation of actin cytoskeleton
    100434331 (SOS2)
 09150 Organismal Systems
  09151 Immune system
   04650 Natural killer cell mediated cytotoxicity
    100434331 (SOS2)
   04660 T cell receptor signaling pathway
    100434331 (SOS2)
   04662 B cell receptor signaling pathway
    100434331 (SOS2)
   04664 Fc epsilon RI signaling pathway
    100434331 (SOS2)
   04062 Chemokine signaling pathway
    100434331 (SOS2)
  09152 Endocrine system
   04910 Insulin signaling pathway
    100434331 (SOS2)
   04912 GnRH signaling pathway
    100434331 (SOS2)
   04915 Estrogen signaling pathway
    100434331 (SOS2)
   04917 Prolactin signaling pathway
    100434331 (SOS2)
   04926 Relaxin signaling pathway
    100434331 (SOS2)
   04935 Growth hormone synthesis, secretion and action
    100434331 (SOS2)
  09156 Nervous system
   04722 Neurotrophin signaling pathway
    100434331 (SOS2)
  09159 Environmental adaptation
   04714 Thermogenesis
    100434331 (SOS2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    100434331 (SOS2)
   05206 MicroRNAs in cancer
    100434331 (SOS2)
   05205 Proteoglycans in cancer
    100434331 (SOS2)
   05231 Choline metabolism in cancer
    100434331 (SOS2)
  09162 Cancer: specific types
   05210 Colorectal cancer
    100434331 (SOS2)
   05225 Hepatocellular carcinoma
    100434331 (SOS2)
   05226 Gastric cancer
    100434331 (SOS2)
   05214 Glioma
    100434331 (SOS2)
   05221 Acute myeloid leukemia
    100434331 (SOS2)
   05220 Chronic myeloid leukemia
    100434331 (SOS2)
   05211 Renal cell carcinoma
    100434331 (SOS2)
   05215 Prostate cancer
    100434331 (SOS2)
   05213 Endometrial cancer
    100434331 (SOS2)
   05224 Breast cancer
    100434331 (SOS2)
   05223 Non-small cell lung cancer
    100434331 (SOS2)
  09165 Substance dependence
   05034 Alcoholism
    100434331 (SOS2)
  09172 Infectious disease: viral
   05161 Hepatitis B
    100434331 (SOS2)
   05160 Hepatitis C
    100434331 (SOS2)
   05163 Human cytomegalovirus infection
    100434331 (SOS2)
   05165 Human papillomavirus infection
    100434331 (SOS2)
  09176 Drug resistance: antineoplastic
   01521 EGFR tyrosine kinase inhibitor resistance
    100434331 (SOS2)
   01522 Endocrine resistance
    100434331 (SOS2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04990 Domain-containing proteins not elsewhere classified [BR:pon04990]
    100434331 (SOS2)
Domain-containing proteins not elsewhere classified [BR:pon04990]
 Pleckstrin homology (PH) domain-containing proteins
  Dbl-Like RhoGEF family proteins
   100434331 (SOS2)
SSDB
Motif
Pfam: RasGEF RasGEF_N RhoGEF PH Histone IQ_SEC7_PH PH_19 PH_13
Other DBs
NCBI-GeneID: 100434331
NCBI-ProteinID: XP_002824770
Ensembl: ENSPPYG00000005799
UniProt: A0A6D2WP06 H2NL72
LinkDB
Position
14
AA seq 1332 aa
MQQAPQPYEFFSEENSPKWRGLLVSALRKVQEQVHPTLSANEESLYYIEELIFQLLNKLC
MAQPRTVQDVEERVQKTFPHPIDKWAIADAQSAIEKRKRRNPLLLPVDKIHPSLKEVLGY
KVDYHVSLYIVAVLEYISADILKLAGNYVFNIRHYEISQQDIKVSMCADKVLMDMFDQDD
IGLVSLCEDEPSSSGELNYYDLVRTEIAEERQYLRELNMIIKVFREAFLSDRKLFKPYDI
EKIFSNISDIHELTVKLLGLIEDTVEMTDESSPHPLAGSCFEDLAEEQAFDPYETLSQDI
LSPEFHEHFNKLMARPAVALHFQSIADGFKEAVRYVLPRLMLVPVYHCWHYFELLKQLKA
CSEEQEDRECLNQAITALMNLQGSMDRIYKQYSPRRRPGDPVCPFYSHQLRSKHLAIKKM
NEIQKNIDGWEGKDIGQCCNEFIMEGPLTRIGAKHERHIFLFDGLMISCKPNHGQTRLPG
YSSAEYRLKEKFVMRKIQICDKEDTCEYKHAFELVSKDENSIIFAAKSAEEKNNWMAALI
SLHYRSTLDRMLDSVLLKEENEQPLRLPSPEVYRFVVKDSEENIVFEDNLQSRSGIPIIK
GGTVVKLIERLTYHMYADPNFVRTFLTTYRSFCKPQELLSLLIERFEIPEPEPTDADKLA
IEKGEQPISADLKRFRKEYVQPVQLRILNVFRHWVEHHFYDFERDLELLERLESFISSVR
GKAMKKWVESIAKIIRRKKQAQANGISHNITFESPPPPIEWHISKPGQFETFDLMTLHPI
EIARQLTLLESDLYRKVQPSELVGSVWTKEDKEINSPNLLKMIRHTTNLTLWFEKCIVEA
ENFEERVAVLSRIIEILQVFQDLNNFNGVLEIVSAVNSVSVYRLDHTFEALQERKRKILD
EAVELSQDHFKKYLVKLKSINPPCVPFFGIYLTNILKTEEGNNDFLKKKGKDLINFSKRR
KVAEITGEIQQYQNQPYCLRIEPDMRRFFENLNPMGSASEKEFTDYLFNKSLEIEPRNCK
QPPRFPRKSTFSLKSPGIRPNTGRHGSTSGTLRGHPTPLEREPCKISFSRIAETELESTV
SAPTSPNTPSTPPVSASSDLSVFLDVDLNSSCGSNSVFAPVLLPHSKSFFSSCGSLHKLS
EEPLIPPPLPPRKKFDHDASNSKGNMKSDDDPPAIPPRQPPPPKIKPRVPVPTGAFDGPL
HSPPPPPPRDPLPDTPPPVPLRPPEHFINCPFNLQPPPLGHLHRDSDWLRDISTCPNSPS
TPPSTPSPRVPRRCYVLSSSQNNLAHPPAPPVPPRQNSSPHLPKLPPKTYKREFSHPPLY
RLPLLENAETPQ
NT seq 3999 nt   +upstreamnt  +downstreamnt
atgcagcaggcgccgcagccttacgagttcttcagcgaggagaacagtccgaaatggcgg
ggactgttggtctcggccctgcggaaggttcaggaacaagtgcatcccactctctcagct
aatgaagagtctctctattatattgaagagctgatttttcagctgcttaataaattatgc
atggcccagccaaggactgttcaagatgtagaggagcgagttcagaagacctttcctcac
ccaattgataaatgggccattgctgatgcacaatctgctatagaaaaacgaaaacgaaga
aatcctcttttactgcctgtggacaaaatccatccttcgttgaaggaagtgttagggtac
aaagtggactaccatgtatccctatatattgtggctgtactagagtatatctcagctgat
attttaaaattggctggtaattatgtttttaatatccggcattatgaaatatctcagcag
gacattaaagtgtcaatgtgtgcggataaggttttgatggacatgtttgatcaggatgac
ataggtttggtttctctctgtgaagatgaacctagttcttctggtgaattaaactactat
gatcttgtcagaactgaaatcgcagaagaaagacagtatctacgggaattaaatatgatc
ataaaagtgtttcgagaagcctttctttctgataggaagctctttaaaccttatgatatc
gaaaagatttttagtaacatttcagatatacatgaattgactgtgaaacttttaggtttg
attgaagacacagttgaaatgactgatgaaagcagtcctcatcccttagctggcagctgt
tttgaagatttggcagaagagcaagcatttgatccttatgaaacattatcacaggacatt
ctttcaccagagtttcatgaacatttcaataaattgatggccagacctgcagttgctcta
cactttcagtccattgctgatggttttaaagaggcagttcgttatgtccttccacgtctt
atgctggtgccagtgtaccactgttggcactactttgagttattaaagcaattgaaagca
tgtagtgaagaacaagaagacagagaatgtttgaaccaagctattactgctctcatgaat
ctccaaggtagcatggaccgaatttacaagcagtattcacctagacgtcgacctggagat
cctgtttgccctttttatagtcatcaattaagaagcaaacacctggctatcaaaaaaatg
aatgaaattcagaaaaatatcgatggatgggaaggcaaagatattggacagtgttgtaat
gaattcattatggagggaccattgacaagaatcggtgccaaacatgaacggcatattttt
ctgtttgatggcttaatgatcagttgtaaacctaatcatggccagactcggcttccaggt
tacagtagtgcagaatacaggttaaaagaaaaatttgtcatgaggaaaatacaaatttgt
gataaagaagatacttgtgagtacaagcatgcatttgaattagtatccaaagatgagaac
agcataatatttgctgctaagtctgctgaagaaaaaaacaactggatggcagcccttatt
tctcttcattatcgtagtactctagatcgaatgttagattcagtattattgaaagaagaa
aatgagcaaccactgagattaccaagtcctgaagtatatcgttttgtagtaaaagactct
gaggaaaacattgtttttgaagacaacttgcaaagtagaagtggcatccccattattaaa
ggaggaactgtagtgaaattaattgaaaggttaacataccatatgtatgcagatcccaat
tttgttcgtacttttcttaccacatatcgttcattttgtaaaccacaggaattgctgagc
ttactgattgaacggtttgaaattccagagccagaacctactgacgcagacaaattggca
atagagaaaggcgagcagccaatcagtgcagaccttaaaagatttcgcaaggaatacgtc
caaccagtacaacttaggatcttaaatgtatttcggcattgggttgaacatcatttttat
gactttgaaagagacttggaattgcttgaaagactagaatccttcatttcaagtgtaaga
gggaaagctatgaaaaaatgggtagagtcaatcgctaaaatcatcaggaggaagaagcaa
gctcaggcaaacggaataagccataatattacctttgaaagtccacctccaccaattgaa
tggcatatcagcaaaccaggacagtttgaaacatttgatctcatgacacttcatccaata
gaaattgcacgtcagctgacacttttggagtctgatctctacaggaaagttcaaccgtct
gaacttgtagggagtgtgtggaccaaagaagataaagaaataaattctccaaatttatta
aaaatgattcgccataccacaaatctcaccctctggtttgaaaaatgcattgtggaagca
gaaaattttgaagaacgggtggcagtactaagtagaattatagaaattctgcaagttttt
caagatttgaataatttcaatggcgtattggagatagtcagtgcagtaaattcagtgtca
gtatacagactagaccatacctttgaggcattgcaggaaagaaaaaggaaaattttggat
gaggctgtggaattaagtcaagatcactttaaaaaatacctagtaaaacttaagtcaatc
aacccaccttgtgtgcctttttttggaatatatttaacaaatattctgaagaccgaagaa
gggaataatgattttttaaaaaagaaagggaaagatttaatcaatttcagtaagaggagg
aaagtagctgaaattactggagaaattcagcagtatcagaatcagccttactgtttacgg
atagaaccagatatgaggagattctttgaaaaccttaaccccatgggaagtgcatctgaa
aaagagtttacagattatttgttcaacaagtcactagaaattgaacctcgaaactgcaaa
cagccacctcgatttcctaggaaatcaactttctccttaaaatctcctggaatacggcct
aacaccggccgacatggctctacctcaggtactttacgaggtcacccaacaccattagaa
agagaaccatgtaaaataagctttagtcggattgctgaaactgagctggaatcaacagtg
tcagcaccaacctctccaaatacaccatctactccaccagtatctgcttcttcagacctt
agtgtatttttagatgtggatctcaacagctcctgtggcagcaatagcgtctttgctcca
gtgcttttgccacattcaaagtctttctttagttcatgtggtagtttacataaactaagt
gaagagcccctgattcctcctcctcttcctcctcgaaaaaagtttgatcatgatgcttca
aattccaagggaaatatgaaatctgatgatgatcctcctgctattccaccaagacagcct
cctcctccaaagataaaacccagagttcctgttcctactggtgcatttgatgggcctctg
catagtccacctccgccaccaccaagagatcctcttcctgatacccctccaccggttccc
cttcggcctccagaacactttataaactgtccatttaatcttcagccacctccactgggg
catcttcacagagattcagactggcttagagacattagtacgtgtccaaattcgccaagc
actcctcctagcacaccctctccaagggtaccgcgtcgatgctatgtgctcagttctagt
cagaataatcttgctcatcctccagctccccctgttccaccaaggcagaattcaagccct
catctgccaaaactgccaccaaagacttacaaacgggagttttcgcaccccccattgtac
agactgcctttgctagaaaatgcagaaactccccaatga

KEGG   Pongo abelii (Sumatran orangutan): 100458733
Entry
100458733         CDS       T01416                                 

Gene name
SOS1
Definition
(RefSeq) son of sevenless homolog 1 isoform X1
  KO
K03099  son of sevenless
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon01521  EGFR tyrosine kinase inhibitor resistance
pon01522  Endocrine resistance
pon04010  MAPK signaling pathway
pon04012  ErbB signaling pathway
pon04014  Ras signaling pathway
pon04062  Chemokine signaling pathway
pon04068  FoxO signaling pathway
pon04072  Phospholipase D signaling pathway
pon04150  mTOR signaling pathway
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04540  Gap junction
pon04630  JAK-STAT signaling pathway
pon04650  Natural killer cell mediated cytotoxicity
pon04660  T cell receptor signaling pathway
pon04662  B cell receptor signaling pathway
pon04664  Fc epsilon RI signaling pathway
pon04714  Thermogenesis
pon04722  Neurotrophin signaling pathway
pon04810  Regulation of actin cytoskeleton
pon04910  Insulin signaling pathway
pon04912  GnRH signaling pathway
pon04915  Estrogen signaling pathway
pon04917  Prolactin signaling pathway
pon04926  Relaxin signaling pathway
pon04935  Growth hormone synthesis, secretion and action
pon05034  Alcoholism
pon05160  Hepatitis C
pon05161  Hepatitis B
pon05163  Human cytomegalovirus infection
pon05165  Human papillomavirus infection
pon05200  Pathways in cancer
pon05205  Proteoglycans in cancer
pon05206  MicroRNAs in cancer
pon05210  Colorectal cancer
pon05211  Renal cell carcinoma
pon05213  Endometrial cancer
pon05214  Glioma
pon05215  Prostate cancer
pon05220  Chronic myeloid leukemia
pon05221  Acute myeloid leukemia
pon05223  Non-small cell lung cancer
pon05224  Breast cancer
pon05225  Hepatocellular carcinoma
pon05226  Gastric cancer
pon05231  Choline metabolism in cancer
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04010 MAPK signaling pathway
    100458733 (SOS1)
   04012 ErbB signaling pathway
    100458733 (SOS1)
   04014 Ras signaling pathway
    100458733 (SOS1)
   04630 JAK-STAT signaling pathway
    100458733 (SOS1)
   04068 FoxO signaling pathway
    100458733 (SOS1)
   04072 Phospholipase D signaling pathway
    100458733 (SOS1)
   04151 PI3K-Akt signaling pathway
    100458733 (SOS1)
   04150 mTOR signaling pathway
    100458733 (SOS1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100458733 (SOS1)
   04540 Gap junction
    100458733 (SOS1)
  09142 Cell motility
   04810 Regulation of actin cytoskeleton
    100458733 (SOS1)
 09150 Organismal Systems
  09151 Immune system
   04650 Natural killer cell mediated cytotoxicity
    100458733 (SOS1)
   04660 T cell receptor signaling pathway
    100458733 (SOS1)
   04662 B cell receptor signaling pathway
    100458733 (SOS1)
   04664 Fc epsilon RI signaling pathway
    100458733 (SOS1)
   04062 Chemokine signaling pathway
    100458733 (SOS1)
  09152 Endocrine system
   04910 Insulin signaling pathway
    100458733 (SOS1)
   04912 GnRH signaling pathway
    100458733 (SOS1)
   04915 Estrogen signaling pathway
    100458733 (SOS1)
   04917 Prolactin signaling pathway
    100458733 (SOS1)
   04926 Relaxin signaling pathway
    100458733 (SOS1)
   04935 Growth hormone synthesis, secretion and action
    100458733 (SOS1)
  09156 Nervous system
   04722 Neurotrophin signaling pathway
    100458733 (SOS1)
  09159 Environmental adaptation
   04714 Thermogenesis
    100458733 (SOS1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    100458733 (SOS1)
   05206 MicroRNAs in cancer
    100458733 (SOS1)
   05205 Proteoglycans in cancer
    100458733 (SOS1)
   05231 Choline metabolism in cancer
    100458733 (SOS1)
  09162 Cancer: specific types
   05210 Colorectal cancer
    100458733 (SOS1)
   05225 Hepatocellular carcinoma
    100458733 (SOS1)
   05226 Gastric cancer
    100458733 (SOS1)
   05214 Glioma
    100458733 (SOS1)
   05221 Acute myeloid leukemia
    100458733 (SOS1)
   05220 Chronic myeloid leukemia
    100458733 (SOS1)
   05211 Renal cell carcinoma
    100458733 (SOS1)
   05215 Prostate cancer
    100458733 (SOS1)
   05213 Endometrial cancer
    100458733 (SOS1)
   05224 Breast cancer
    100458733 (SOS1)
   05223 Non-small cell lung cancer
    100458733 (SOS1)
  09165 Substance dependence
   05034 Alcoholism
    100458733 (SOS1)
  09172 Infectious disease: viral
   05161 Hepatitis B
    100458733 (SOS1)
   05160 Hepatitis C
    100458733 (SOS1)
   05163 Human cytomegalovirus infection
    100458733 (SOS1)
   05165 Human papillomavirus infection
    100458733 (SOS1)
  09176 Drug resistance: antineoplastic
   01521 EGFR tyrosine kinase inhibitor resistance
    100458733 (SOS1)
   01522 Endocrine resistance
    100458733 (SOS1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04990 Domain-containing proteins not elsewhere classified [BR:pon04990]
    100458733 (SOS1)
Domain-containing proteins not elsewhere classified [BR:pon04990]
 Pleckstrin homology (PH) domain-containing proteins
  Dbl-Like RhoGEF family proteins
   100458733 (SOS1)
SSDB
Motif
Pfam: RasGEF RasGEF_N RhoGEF Histone PH PH_19 PH_10 IQ_SEC7_PH PH_13
Other DBs
NCBI-GeneID: 100458733
NCBI-ProteinID: XP_024098134
Ensembl: ENSPPYG00000012474
LinkDB
Position
2A
AA seq 1333 aa
MQAQQLPYEFFSEENAPKWRGLLVPALKKVQGQVHPTLESNDDALQYVEELILQLLNMLC
QAQPRSASDVEERVQKSFPHPIDKWAIADAQSAIEKRKRRNPLSLPVEKIHPLLKEVLGY
KIDHQVSVYIVAVLEYISADILKLVGNYVRNIRHYEITKQDIKVAMCADKVLMDMFHQDV
EDINVLSLTDEEPSTSGEQTYYDLVKAFMAEIRQYIRELNLIIKVFREPFVSNSKLFSAN
DVENIFSRIVDIHELSVKLLGHIEDTVEMTDEGSPHPLVGSCFEDLAEELAFDPYESYAR
DILRPGFHDRFLSQLSKPGAALYLQSIGEGFKEAVQYVLPRLLLAPVYHCLHYFELLKQL
EEKSEDQEDKECLKQAITALLNVQSGMEKICSKSLAKRRLSESACRFYSQQMKGKQLAIK
KMNEIQKNIDGWEGKDIGQCCNEFIMEGTLTRVGAKHERHIFLFDGLMICCKSNHGQPRL
PGASNAEYRLKEKFFMRKVQINDKDDTNEYKHAFEIILKDENSVIFSAKSAEEKNNWMAA
LISLQYRSTLERMLDVTMLQEEKEEQMRLPSADVYRFAEPDSEENIIFEENMQPKAGIPI
IKAGTVIKLIERLTYHMYADPNFVRTFLTTYRSFCKPQELLSLIIERFEIPEPEPTEADR
IAIENGDQPLSAELKRFRKEYIQPVQLRVLNVCRHWVEHHFYDFERDAYLLQRMEEFIGT
VRGKAMKKWVESITKIIQRKKIARDNGPGHNITFQSSPPTVEWHISRPGHIETFDLLTLH
PIEIARQLTLLESDLYRAVQPSELVGSVWTKEDKEINSPNLLKMIRHTTNLTLWFEKCIV
ETENLEERVAVVSRIIEILQVFQELNNFNGVLEVVSAMNSSPVYRLDHTFEQIPSRQKKI
LEEAHELSEDHYKKYLAKLRSINPPCVPFFGIYLTNILKTEEGNPEVLKRHGKELINFSK
RRKVAEITGEIQQYQNQPYCLRVESDIKRFFENLNPMGNSMEKEFTDYLFNKSLEIEPRN
PKPLPRFPKKYSYPLKSPGVRPSNPRPGTMRHPTPLQQEPRKISYSRIPESETESTASAP
NSPRTPLTPPPASGASSTTDVCSVFDSDHSSPFHSSNDTVFIQVTLPHGPRSASVSSISL
TKGTDEVPVPPPVPPRRRPESAPAESSPSKIMSKHLDSPPAIPPRQPTSKAYSPRYSISD
RTSVSDPPESPPLLPPREPVRTPDVFSSSPLHLQPPPLGKKSDHGNAFFPNSPSPFTPPP
PQTPSPHGTRRHLPSPPLTQEVDLHSIAGPPVPPRQSTSQHIPKLPPKTYKREHTHPSMH
RDGPPLLENAHSS
NT seq 4002 nt   +upstreamnt  +downstreamnt
atgcaggcgcagcagctgccctacgagtttttcagcgaagagaacgcgcccaagtggcgg
ggactgctggtgcctgcgctgaaaaaggtccaggggcaagttcatcctactctcgagtct
aatgacgatgctcttcagtatgttgaagaattaattttgcaattattaaatatgctatgc
caagctcagccccgaagtgcttcagatgtagaggaacgtgttcaaaaaagtttccctcat
ccaattgataaatgggcaatagctgatgcccaatcagctattgaaaagaggaagcgaaga
aaccctttatctctcccagtagaaaaaattcatcctttattaaaggaggtcctaggttat
aaaattgaccaccaggtttctgtttacatagtagcagtcttagaatacatttctgcagac
attttaaagctggttgggaattacgtaagaaatatacggcattatgaaattacaaaacaa
gatattaaagtggcaatgtgtgctgacaaggtattgatggatatgtttcatcaagatgta
gaagatattaatgtattatctttaactgatgaagagccttccacctcaggagaacaaact
tactatgatttggtaaaagcatttatggcagaaattcgacaatatataagggaactaaat
ctaattataaaagtttttagagagccctttgtctccaattcaaaattgttttcagctaat
gatgtagaaaatatatttagtcgcatagtagatatacatgaacttagtgtaaagttactg
ggccatatagaagatacagtagaaatgacagatgaaggcagtccccatccactagtagga
agctgctttgaagacttagcagaggaactggcatttgatccatatgaatcgtatgctcga
gatattttgcgacctggttttcatgatcgtttccttagtcagttatcaaagcctggggca
gcactttatttgcagtcaataggcgaaggtttcaaagaagctgttcaatacgttttaccc
aggctgcttctggcccctgtttaccactgtctccattactttgaacttttgaagcagtta
gaagaaaaaagtgaagatcaagaagacaaggaatgtttaaaacaagcaataacagctttg
cttaatgttcagagtggtatggaaaaaatatgttctaaaagtcttgcaaaacgaagactg
agtgaatctgcatgtcggttttatagtcagcaaatgaaggggaaacaactagcaatcaag
aaaatgaacgagattcagaagaatattgatggttgggagggaaaagacattggacagtgt
tgcaatgaatttataatggaaggaactcttacacgtgtaggagccaaacatgagagacac
atatttctctttgatggcttaatgatttgctgtaaatcaaatcatgggcagccaagactt
cctggtgctagcaatgcagaatatcgtcttaaagaaaagttttttatgcgaaaggtacaa
attaatgataaagatgacaccaatgaatacaaacatgcttttgaaataattttaaaagat
gaaaatagtgttatattttctgccaagtcagctgaagagaaaaacaattggatggcagca
ttgatatctttacagtaccggagtacactggaaaggatgcttgatgtaacaatgctacag
gaagagaaagaggagcagatgaggctgcctagtgccgatgtttatagatttgcagagcct
gactctgaagagaatattatatttgaagagaacatgcagcccaaggctggaattccaatt
atcaaagcaggaactgttattaaacttatagagaggcttacgtaccatatgtacgcagat
cccaattttgttcggacatttcttacaacatacagatccttttgcaaacctcaagaacta
ctgagtcttataatagaaaggtttgaaattccagagcctgagccaacagaagctgatcgc
atagctatagagaatggagatcaacccttgagtgcagaactaaaaagatttagaaaagaa
tatatacagcctgtgcaactgcgagtattaaatgtatgtcggcactgggtagagcaccac
ttctatgattttgaaagagatgcatatcttttgcaacgaatggaagaatttattggaaca
gtaagaggtaaagcaatgaaaaaatgggttgaatccatcactaaaataatccaaaggaaa
aaaattgcaagagacaatggaccaggtcataatattacatttcagagttcacctcccaca
gttgagtggcatataagcagacctgggcacatagagacttttgacctgctcaccttacac
ccaatagaaattgctcgacaactcactttacttgaatcagatctataccgagctgtacag
ccatcagaattagttggaagtgtgtggacaaaagaagacaaagaaattaactctcctaat
cttctgaaaatgattcgacataccaccaacctcactctgtggtttgagaaatgtattgta
gaaactgaaaatttagaagaaagagtagctgtggtgagtcgaataattgagattctacaa
gtctttcaagagttgaacaactttaatggtgtccttgaggtcgtcagtgctatgaattca
tcacctgtttacagactagaccacacatttgagcaaataccaagtcgccagaagaaaatt
ttagaagaagctcatgaattgagtgaagatcactataagaaatatttggcaaaactcagg
tctattaatccaccatgtgtgcctttctttggaatttatctcactaatatcttgaaaaca
gaagaaggcaaccctgaggtcctaaaaagacatgggaaagagcttataaactttagcaaa
aggaggaaagtagcagaaataacaggagagatccagcagtaccaaaatcagccttactgt
ttacgagtagaatcagatatcaaaaggttctttgaaaacttgaatccgatgggaaatagc
atggagaaagaatttacagattatcttttcaacaaatccctagaaatagaaccacgaaac
cctaagcctctcccaagatttccaaaaaaatatagctatcccctaaaatctcctggtgtt
cgtccatcaaacccaagaccaggtaccatgaggcatcccacacctctgcagcaggagcca
aggaaaattagttatagtaggatccctgaaagtgaaacagaaagtacagcatctgcacca
aattctccaagaacaccgttaacacctccgcctgcttctggtgcttccagtaccacagat
gtttgcagtgtatttgattccgatcattcgagcccttttcactcaagcaatgataccgtc
tttatccaagttactctgccccatggcccaagatctgcttctgtatcatctataagttta
accaaaggcactgatgaagtgcctgtccctcctcctgttcctccacgaagacgaccagaa
tctgccccagcagaatcttcaccatctaagattatgtctaagcatttggacagtccccca
gccattcctcctaggcaacccacatcaaaagcctattcaccacgatattcaatatcagac
cggacctctgtctcagaccctcctgaaagccctcccttattaccaccacgagaacctgtg
aggacacctgatgttttctcaagctcaccactacatctccaacctccccctttgggcaaa
aaaagtgaccatggcaatgccttcttcccaaacagcccttccccttttacaccacctcct
cctcaaacaccttctcctcacggcacaagaaggcatctgccatcaccaccattgacacaa
gaagtggaccttcattccattgctgggccgcctgttcctccacgacaaagcacttctcaa
catatccctaaactccctccaaaaacttacaaaagggagcacacacacccatccatgcac
agagatggaccaccactgttggagaatgcccattcttcctga

DBGET integrated database retrieval system