Pan troglodytes (chimpanzee): 452897
Help
Entry
452897 CDS
T01005
Symbol
SOS2
Name
(RefSeq) son of sevenless homolog 2 isoform X1
KO
K03099
son of sevenless
Organism
ptr
Pan troglodytes (chimpanzee)
Pathway
ptr01521
EGFR tyrosine kinase inhibitor resistance
ptr01522
Endocrine resistance
ptr04010
MAPK signaling pathway
ptr04012
ErbB signaling pathway
ptr04014
Ras signaling pathway
ptr04062
Chemokine signaling pathway
ptr04068
FoxO signaling pathway
ptr04072
Phospholipase D signaling pathway
ptr04150
mTOR signaling pathway
ptr04151
PI3K-Akt signaling pathway
ptr04510
Focal adhesion
ptr04540
Gap junction
ptr04630
JAK-STAT signaling pathway
ptr04650
Natural killer cell mediated cytotoxicity
ptr04660
T cell receptor signaling pathway
ptr04662
B cell receptor signaling pathway
ptr04664
Fc epsilon RI signaling pathway
ptr04714
Thermogenesis
ptr04722
Neurotrophin signaling pathway
ptr04810
Regulation of actin cytoskeleton
ptr04910
Insulin signaling pathway
ptr04912
GnRH signaling pathway
ptr04915
Estrogen signaling pathway
ptr04917
Prolactin signaling pathway
ptr04926
Relaxin signaling pathway
ptr04935
Growth hormone synthesis, secretion and action
ptr05034
Alcoholism
ptr05160
Hepatitis C
ptr05161
Hepatitis B
ptr05163
Human cytomegalovirus infection
ptr05165
Human papillomavirus infection
ptr05200
Pathways in cancer
ptr05205
Proteoglycans in cancer
ptr05206
MicroRNAs in cancer
ptr05207
Chemical carcinogenesis - receptor activation
ptr05208
Chemical carcinogenesis - reactive oxygen species
ptr05210
Colorectal cancer
ptr05211
Renal cell carcinoma
ptr05213
Endometrial cancer
ptr05214
Glioma
ptr05215
Prostate cancer
ptr05220
Chronic myeloid leukemia
ptr05221
Acute myeloid leukemia
ptr05223
Non-small cell lung cancer
ptr05224
Breast cancer
ptr05225
Hepatocellular carcinoma
ptr05226
Gastric cancer
ptr05231
Choline metabolism in cancer
Brite
KEGG Orthology (KO) [BR:
ptr00001
]
09130 Environmental Information Processing
09132 Signal transduction
04010 MAPK signaling pathway
452897 (SOS2)
04012 ErbB signaling pathway
452897 (SOS2)
04014 Ras signaling pathway
452897 (SOS2)
04630 JAK-STAT signaling pathway
452897 (SOS2)
04068 FoxO signaling pathway
452897 (SOS2)
04072 Phospholipase D signaling pathway
452897 (SOS2)
04151 PI3K-Akt signaling pathway
452897 (SOS2)
04150 mTOR signaling pathway
452897 (SOS2)
09140 Cellular Processes
09144 Cellular community - eukaryotes
04510 Focal adhesion
452897 (SOS2)
04540 Gap junction
452897 (SOS2)
09142 Cell motility
04810 Regulation of actin cytoskeleton
452897 (SOS2)
09150 Organismal Systems
09151 Immune system
04650 Natural killer cell mediated cytotoxicity
452897 (SOS2)
04660 T cell receptor signaling pathway
452897 (SOS2)
04662 B cell receptor signaling pathway
452897 (SOS2)
04664 Fc epsilon RI signaling pathway
452897 (SOS2)
04062 Chemokine signaling pathway
452897 (SOS2)
09152 Endocrine system
04910 Insulin signaling pathway
452897 (SOS2)
04912 GnRH signaling pathway
452897 (SOS2)
04915 Estrogen signaling pathway
452897 (SOS2)
04917 Prolactin signaling pathway
452897 (SOS2)
04926 Relaxin signaling pathway
452897 (SOS2)
04935 Growth hormone synthesis, secretion and action
452897 (SOS2)
09156 Nervous system
04722 Neurotrophin signaling pathway
452897 (SOS2)
09159 Environmental adaptation
04714 Thermogenesis
452897 (SOS2)
09160 Human Diseases
09161 Cancer: overview
05200 Pathways in cancer
452897 (SOS2)
05206 MicroRNAs in cancer
452897 (SOS2)
05205 Proteoglycans in cancer
452897 (SOS2)
05207 Chemical carcinogenesis - receptor activation
452897 (SOS2)
05208 Chemical carcinogenesis - reactive oxygen species
452897 (SOS2)
05231 Choline metabolism in cancer
452897 (SOS2)
09162 Cancer: specific types
05210 Colorectal cancer
452897 (SOS2)
05225 Hepatocellular carcinoma
452897 (SOS2)
05226 Gastric cancer
452897 (SOS2)
05214 Glioma
452897 (SOS2)
05221 Acute myeloid leukemia
452897 (SOS2)
05220 Chronic myeloid leukemia
452897 (SOS2)
05211 Renal cell carcinoma
452897 (SOS2)
05215 Prostate cancer
452897 (SOS2)
05213 Endometrial cancer
452897 (SOS2)
05224 Breast cancer
452897 (SOS2)
05223 Non-small cell lung cancer
452897 (SOS2)
09172 Infectious disease: viral
05161 Hepatitis B
452897 (SOS2)
05160 Hepatitis C
452897 (SOS2)
05163 Human cytomegalovirus infection
452897 (SOS2)
05165 Human papillomavirus infection
452897 (SOS2)
09165 Substance dependence
05034 Alcoholism
452897 (SOS2)
09176 Drug resistance: antineoplastic
01521 EGFR tyrosine kinase inhibitor resistance
452897 (SOS2)
01522 Endocrine resistance
452897 (SOS2)
BRITE hierarchy
SSDB
Ortholog
Paralog
Gene cluster
GFIT
Motif
Pfam:
RasGEF
RasGEF_N
RhoGEF
SOS1_NGEF_PH
PH
IQ_SEC7_PH
PH_19
RHG20_PH
PH_13
Motif
Other DBs
NCBI-GeneID:
452897
NCBI-ProteinID:
XP_016781538
Ensembl:
ENSPTRG00000006328
VGNC:
8216
UniProt:
K7CZM1
LinkDB
All DBs
Position
15:complement(48135652..48248484)
Genome browser
AA seq
1332 aa
AA seq
DB search
MQQAPQPYEFFSEENSPKWRGLLVSALRKVQEQVHPTLSANEESLYYIEELIFQLLNKLC
MAQPRTVQDVEERVQKTFPHPIDKWAIADAQSAIEKRKRRNPLLLPVDKIHPSLKEVLGY
KVDYHVSLYIVAVLEYISADILKLAGNYVFNIRHYEISQQDIKVSMCADKVLMDMFDQDD
IGLVSLCEDEPSSSGELNYYDLVRTEIAEERQYLRELNMIIKVFREAFLSDRKLFKPSDI
EKIFSNISDIHELTVKLLGLIEDTVEMTDESSPHPLAGSCFEDLAEEQAFDPYETLSQDI
LSPEFHEHFNKLMARPAVALHFQSIADGFKEAVRYVLPRLMLVPVYHCWHYFELLKQLKA
CSEEQEDRECLNQAITALMNLQGSMDRIYKQYSPRRRPGDPVCPFYSHQLRSKHLAIKKM
NEIQKNIDGWEGKDIGQCCNEFIMEGPLTRIGAKHERHIFLFDGLMISCKPNHGQTRLPG
YSSAEYRLKEKFVMRKIQICDKEDTCEYKHAFELVSKDENSIIFAAKSAEEKNNWMAALI
SLHYRSTLDRMLDSVLLKEENEQPLRLPSPEVYRFVVKDSEENIVFEDNLQSRSGIPIIK
GGTVVKLIERLTYHMYADPNFVRTFLTTYRSFCKPQELLSLLIERFEIPEPEPTDADKLA
IEKGEQPISADLKRFRKEYVQPVQLRILNVFRHWVEHHFYDFERDLELLERLESFISSVR
GKAMKKWVESIAKIIRRKKQAQANGISHNITFESPPPPIEWHISKPGQFETFDLMTLHPI
EIARQLTLLESDLYRKVQPSELVGSVWTKEDKEINSPNLLKMIRHTTNLTLWFEKCIVEA
ENFEERVAVLSRIIEILQVFQDLNNFNGVLEIVSAVNSVSVYRLDHTFEALQERKRKILD
EAVELSQDHFKKYLVKLKSINPPCVPFFGIYLTNILKTEEGNNDFLKKKGKDLINFSKRR
KVAEITGEIQQYQNQPYCLRIEPDMRRFFENLNPMGSASEKEFTDYLFNKSLEIEPRNCK
QPPRFPRKSTFSLKSPGIRPNTGRHGSTSGTLRGHPTPLEREPCKISFSRIAETELESTV
SAPTSPNTPSTPPVSASSDLSVFLDVDLNSSCGSNSIFAPVLLPHSKSFFSSCGSLHKLS
EEPLIPPPLPPRKKFDHDASNSKGNMKSDDDPPAIPPRQPPPPKVKPRVPVPTGAFDGPL
HSPPPPPPRDPLPDTPPPVPLRPPEHFINCPFNLQPPPLGHLHRDSDWLRDISTCPNSPS
TPPSTPSPRVPRRCYVLSSSQNNLAHPPAPPVPPRQNSSPHLPKLPPKTYKRELSHPPLY
RLPLLENAETPQ
NT seq
3999 nt
NT seq
+upstream
nt +downstream
nt
atgcagcaggcgccgcagccttacgagttcttcagcgaggagaacagtccgaaatggcgg
ggactgttggtctcggccctgcggaaggttcaggaacaagtgcatcccactctctcagct
aatgaagagtctctctattatattgaagagctgatttttcagctgcttaataaattatgc
atggcccagccaaggactgttcaagatgtagaggagcgagttcagaagacctttcctcac
ccaattgataaatgggccattgctgatgcacaatctgctatagaaaaacgaaaacgaaga
aatcctcttttactgcctgtggacaaaatccatccttcgttgaaggaagtattagggtac
aaagtggactaccatgtgtccctatatattgtggctgtactagagtatatctcagctgat
attttaaaattggctggtaattatgtttttaatatccggcattatgaaatatctcagcag
gacattaaagtgtcaatgtgtgcggataaggttttgatggacatgtttgatcaggatgac
ataggtttggtttctctctgtgaagatgaacctagttcttctggtgaattaaactactat
gatcttgtcagaactgaaatcgcagaagaaagacagtatctacgggaattaaatatgatc
ataaaagtgtttcgagaagcctttctttctgatagaaagctgtttaaaccttctgatatc
gaaaagatttttagtaacatttcagatatacatgaattgactgtgaaacttttaggtttg
attgaagacacagttgaaatgactgatgaaagcagtcctcatcccttagctggcagctgt
tttgaagatttggcagaagagcaagcatttgatccttatgaaacattatcacaggacatt
ctttcaccagagtttcatgaacatttcaataaattaatggccagacctgcagttgctcta
cactttcagtccattgctgatggttttaaagaggcagttcgttatgtccttccacgtctt
atgctggtgccagtgtatcactgttggcactactttgagttactaaagcaattgaaagca
tgtagtgaagaacaagaagacagagaatgtttgaaccaagctattactgctctcatgaat
ctccaaggtagcatggaccgaatttacaagcagtattcacctagacgtcgacctggagat
cctgtttgccctttttatagtcaccaattaagaagcaaacacctggctatcaaaaaaatg
aatgaaattcagaaaaatattgatggatgggaaggcaaagatattggacagtgttgtaat
gaattcattatggagggaccattgacaagaatcggtgccaaacatgaacggcatattttt
ctgtttgatggcttaatgatcagttgtaaacctaatcatggccagactcggcttccaggt
tacagtagtgcagaatacaggttaaaagaaaaatttgtcatgaggaaaatacaaatttgt
gataaagaagatacttgtgagtacaagcatgcatttgaattagtatccaaagatgagaac
agcataatatttgctgctaagtctgctgaagaaaaaaacaactggatggcagcccttatt
tctcttcattatcgtagtactctagatcgaatgttagattcagtattattgaaagaagaa
aatgagcaaccactgagattaccaagtcctgaagtatatcgttttgtagtaaaagactct
gaggaaaacattgtttttgaagacaacttgcaaagtagaagtggcatccccattattaaa
ggaggaactgtagtgaaattaattgaaaggttaacatatcatatgtatgcagatcccaat
tttgttcgtacttttcttaccacatatcgttcattttgtaaaccacaggaattgctgagc
ttactgattgaacggtttgaaattccagagccagaacctactgatgcagacaaattggca
atagagaaaggcgagcagccaatcagtgcagaccttaaaagatttcgcaaggaatatgtc
caaccagtacaacttaggatcttaaatgtatttcggcattgggttgaacatcatttttat
gactttgaaagagacttggaattgcttgaaagactagaatccttcatttcaagtgtaaga
gggaaagctatgaaaaaatgggtagagtcaatcgctaagatcatcaggaggaagaagcaa
gctcaggcaaatggaataagccataatattacctttgaaagtccacctccaccaattgaa
tggcatatcagcaaaccaggacagtttgaaacatttgatctcatgacacttcatccaata
gaaattgcacgtcagctgacacttttggagtctgatctctacaggaaagttcaaccgtct
gaacttgtagggagtgtgtggaccaaagaagataaagaaataaattctccaaatttatta
aaaatgattcgccataccacaaatctcaccctctggtttgaaaaatgcattgtggaagca
gaaaattttgaagaacgggtggcagtactaagtagaattatagaaattctgcaagttttt
caagatttgaataatttcaatggcgtattggagatagtcagtgcagtaaattcagtatca
gtatacagactagaccatacctttgaggcattgcaggaaagaaaaaggaaaattttggac
gaagctgtggaattaagtcaagatcactttaaaaaatacctagtaaaacttaagtcaatc
aatccaccttgtgtgcctttttttggaatatatttaacaaatattctgaagaccgaagaa
gggaataatgattttttaaaaaagaaagggaaagatttaatcaatttcagtaagaggagg
aaagtagctgaaattactggagaaattcagcagtatcagaatcagccttactgtttacgg
atagaaccagatatgaggagattctttgaaaaccttaacccaatgggaagtgcatctgaa
aaagagtttacagattatttgttcaacaagtcactagaaattgaacctcgaaactgcaaa
cagccacctcgatttcctaggaaatcaactttttccttaaaatctcctggaataaggcct
aacacaggccgacatggctctacctcaggtactttacgaggtcacccaacaccattagaa
agagaaccatgtaaaataagctttagtcggattgctgaaactgagctggaatcaacagtg
tcagcaccaacctctccaaatacaccatctactccaccagtatctgcttcttcagacctt
agtgtatttttagatgtggatctcaacagctcctgtggcagcaatagcatcttcgctcca
gtgcttttgccacattcaaagtctttctttagttcatgtggtagtttacataaactaagt
gaagagcccctgattcctcctcctcttcctcctcgaaaaaagtttgatcatgatgcttca
aattccaagggaaatatgaaatctgatgatgatcctcctgctattccaccgagacagcct
cctcctccaaaggtaaaacccagagttcctgttcctactggtgcatttgatgggcctctg
catagtccacctccgccaccaccaagagatcctcttcctgatacccctccaccagttccc
cttcggcctccagaacactttataaactgtccatttaatcttcagccacctccactgggg
catcttcacagagattcagactggctcagagacattagtacgtgtccaaattcgccaagc
actcctcctagcacaccctctccaagggtaccgcgtcgatgctatgtgctcagttctagt
cagaataatcttgctcatcctccagctccccctgttccaccaaggcagaattcaagccct
catctgccaaaactgccaccaaagacttacaaacgggagctttcgcaccccccattgtac
agactgcctttgctagaaaatgcagaaactccccaatga
Pan troglodytes (chimpanzee): 459171
Help
Entry
459171 CDS
T01005
Symbol
SOS1
Name
(RefSeq) son of sevenless homolog 1 isoform X2
KO
K03099
son of sevenless
Organism
ptr
Pan troglodytes (chimpanzee)
Pathway
ptr01521
EGFR tyrosine kinase inhibitor resistance
ptr01522
Endocrine resistance
ptr04010
MAPK signaling pathway
ptr04012
ErbB signaling pathway
ptr04014
Ras signaling pathway
ptr04062
Chemokine signaling pathway
ptr04068
FoxO signaling pathway
ptr04072
Phospholipase D signaling pathway
ptr04150
mTOR signaling pathway
ptr04151
PI3K-Akt signaling pathway
ptr04510
Focal adhesion
ptr04540
Gap junction
ptr04630
JAK-STAT signaling pathway
ptr04650
Natural killer cell mediated cytotoxicity
ptr04660
T cell receptor signaling pathway
ptr04662
B cell receptor signaling pathway
ptr04664
Fc epsilon RI signaling pathway
ptr04714
Thermogenesis
ptr04722
Neurotrophin signaling pathway
ptr04810
Regulation of actin cytoskeleton
ptr04910
Insulin signaling pathway
ptr04912
GnRH signaling pathway
ptr04915
Estrogen signaling pathway
ptr04917
Prolactin signaling pathway
ptr04926
Relaxin signaling pathway
ptr04935
Growth hormone synthesis, secretion and action
ptr05034
Alcoholism
ptr05160
Hepatitis C
ptr05161
Hepatitis B
ptr05163
Human cytomegalovirus infection
ptr05165
Human papillomavirus infection
ptr05200
Pathways in cancer
ptr05205
Proteoglycans in cancer
ptr05206
MicroRNAs in cancer
ptr05207
Chemical carcinogenesis - receptor activation
ptr05208
Chemical carcinogenesis - reactive oxygen species
ptr05210
Colorectal cancer
ptr05211
Renal cell carcinoma
ptr05213
Endometrial cancer
ptr05214
Glioma
ptr05215
Prostate cancer
ptr05220
Chronic myeloid leukemia
ptr05221
Acute myeloid leukemia
ptr05223
Non-small cell lung cancer
ptr05224
Breast cancer
ptr05225
Hepatocellular carcinoma
ptr05226
Gastric cancer
ptr05231
Choline metabolism in cancer
Brite
KEGG Orthology (KO) [BR:
ptr00001
]
09130 Environmental Information Processing
09132 Signal transduction
04010 MAPK signaling pathway
459171 (SOS1)
04012 ErbB signaling pathway
459171 (SOS1)
04014 Ras signaling pathway
459171 (SOS1)
04630 JAK-STAT signaling pathway
459171 (SOS1)
04068 FoxO signaling pathway
459171 (SOS1)
04072 Phospholipase D signaling pathway
459171 (SOS1)
04151 PI3K-Akt signaling pathway
459171 (SOS1)
04150 mTOR signaling pathway
459171 (SOS1)
09140 Cellular Processes
09144 Cellular community - eukaryotes
04510 Focal adhesion
459171 (SOS1)
04540 Gap junction
459171 (SOS1)
09142 Cell motility
04810 Regulation of actin cytoskeleton
459171 (SOS1)
09150 Organismal Systems
09151 Immune system
04650 Natural killer cell mediated cytotoxicity
459171 (SOS1)
04660 T cell receptor signaling pathway
459171 (SOS1)
04662 B cell receptor signaling pathway
459171 (SOS1)
04664 Fc epsilon RI signaling pathway
459171 (SOS1)
04062 Chemokine signaling pathway
459171 (SOS1)
09152 Endocrine system
04910 Insulin signaling pathway
459171 (SOS1)
04912 GnRH signaling pathway
459171 (SOS1)
04915 Estrogen signaling pathway
459171 (SOS1)
04917 Prolactin signaling pathway
459171 (SOS1)
04926 Relaxin signaling pathway
459171 (SOS1)
04935 Growth hormone synthesis, secretion and action
459171 (SOS1)
09156 Nervous system
04722 Neurotrophin signaling pathway
459171 (SOS1)
09159 Environmental adaptation
04714 Thermogenesis
459171 (SOS1)
09160 Human Diseases
09161 Cancer: overview
05200 Pathways in cancer
459171 (SOS1)
05206 MicroRNAs in cancer
459171 (SOS1)
05205 Proteoglycans in cancer
459171 (SOS1)
05207 Chemical carcinogenesis - receptor activation
459171 (SOS1)
05208 Chemical carcinogenesis - reactive oxygen species
459171 (SOS1)
05231 Choline metabolism in cancer
459171 (SOS1)
09162 Cancer: specific types
05210 Colorectal cancer
459171 (SOS1)
05225 Hepatocellular carcinoma
459171 (SOS1)
05226 Gastric cancer
459171 (SOS1)
05214 Glioma
459171 (SOS1)
05221 Acute myeloid leukemia
459171 (SOS1)
05220 Chronic myeloid leukemia
459171 (SOS1)
05211 Renal cell carcinoma
459171 (SOS1)
05215 Prostate cancer
459171 (SOS1)
05213 Endometrial cancer
459171 (SOS1)
05224 Breast cancer
459171 (SOS1)
05223 Non-small cell lung cancer
459171 (SOS1)
09172 Infectious disease: viral
05161 Hepatitis B
459171 (SOS1)
05160 Hepatitis C
459171 (SOS1)
05163 Human cytomegalovirus infection
459171 (SOS1)
05165 Human papillomavirus infection
459171 (SOS1)
09165 Substance dependence
05034 Alcoholism
459171 (SOS1)
09176 Drug resistance: antineoplastic
01521 EGFR tyrosine kinase inhibitor resistance
459171 (SOS1)
01522 Endocrine resistance
459171 (SOS1)
BRITE hierarchy
SSDB
Ortholog
Paralog
Gene cluster
GFIT
Motif
Pfam:
RasGEF
RasGEF_N
RhoGEF
SOS1_NGEF_PH
PH
PH_10
PH_19
IQ_SEC7_PH
RHG20_PH
Takusan
Motif
Other DBs
NCBI-GeneID:
459171
NCBI-ProteinID:
XP_009440628
Ensembl:
ENSPTRG00000011857
VGNC:
6658
UniProt:
A0A2I3T5C9
A0A6D2YAU4
LinkDB
All DBs
Position
12:83655379..83798295
Genome browser
AA seq
1318 aa
AA seq
DB search
MQAQQLPYEFFSEENAPKWRGLLVPALKKVQGQVHPTLESNDDALQYVEELILQLLNMLC
QAQPRSASDVEERVQKSFPHPIDKWAIADAQSAIEKRKRRNPLSLPVEKIHPLLKEVLGY
KIDHQVSVYIVAVLEYISADILKLVGNYVRNIRHYEITKQDIKVAMCADKVLMDMFHQDV
EDINILSLTDEEPSTSGEQTYYDLVKAFMAEIRQYIRELNLIIKVFREPFVSNSKLFSAN
DVENIFSRIVDIHELSVKLLGHIEDTVEMTDEGSPHPLVGSCFEDLAEELAFDPYESYAR
DILRPGFHDRFLSQLSKPGAALYLQSIGEGFKEAVQYVLPRLLLAPVYHCLHYFELLKQL
EEKSEDQEDKECLKQAITALLNVQSGMEKICSKSLAKRRLSESACRFYSQQMKGKQLAIK
KMNEIQKNIDGWEGKDIGQCCNEFIMEGTLTRVGAKHERHIFLFDGLMICCKSNHGQPRL
PGASNAEYRLKEKFFMRKVQINDKDDTNEYKHAFEIILKDENSVIFSAKSAEEKNNWMAA
LISLQYRSTLERMLDVTMLQEEKEEQMRLPSADVYRFAEPDSEENIIFEENMQPKAGIPI
IKAGTVIKLIERLTYHMYADPNFVRTFLTTYRSFCKPQELLSLIIERFEIPEPEPTEADR
IAIENGDQPLSAELKRFRKEYIQPVQLRVLNVCRHWVEHHFYDFERDAYLLQRMEEFIGT
VRGKAMKKWVESITKIIQRKKIARDNGPGHNITFQSSPPTVEWHISRPGHIETFDLLTLH
PIEIARQLTLLESDLYRAVQPSELVGSVWTKEDKEINSPNLLKMIRHTTNLTLWFEKCIV
ETENLEERVAVVSRIIEILQVFQELNNFNGVLEVVSAMNSSPVYRLDHTFEQIPSRQKKI
LEEAHELSEDHYKKYLAKLRSINPPCVPFFGIYLTNILKTEEGNPEVLKRHGKELINFSK
RRKVAEITGEIQQYQNQPYCLRVESDIKRFFENLNPMGNSMEKEFTDYLFNKSLEIEPRN
PKPLPRFPKKYSYPLKSPGVRPSNPRPGTMRHPTPLQQEPRKISYSRIPESETESTASAP
NSPRTPLTPPPASGASSTTDVCSVFDSDHSSPFHSRSASVSSISLTKGTDEVPVPPPVPP
RRRPESAPAESSPSKIMSKHLDSPPAIPPRQPTSKAYSPRYSLSDRTSISDPPESPPLLP
PREPVRTPDVFSSSPLHLQPPPLGKKSDHGNAFFPNSPSPFTPPPPQTPSPHGTRRHLPS
PPLTQEVDLHSIAGPPVPPRQSTSQHIPKLPPKTYKREHTHPSMHRDGPPLLENAHSS
NT seq
3957 nt
NT seq
+upstream
nt +downstream
nt
atgcaggcgcagcagctgccctacgagtttttcagcgaagagaacgcgcccaagtggcgg
ggactgctggtgcctgcgctgaaaaaggtccaggggcaagttcatcctactctcgagtct
aatgatgatgctcttcagtatgttgaagaattaattttgcaattattaaatatgctatgc
caagctcagccccgaagtgcttcagatgtagaggaacgtgttcaaaaaagtttccctcat
ccaattgataaatgggcaatagctgatgcccaatcagctattgaaaagaggaagcgaaga
aaccctttatctctcccagtagaaaaaattcatcctttattaaaggaggtcctaggttat
aaaattgaccaccaggtttctgtttacatagtagcagtcttagaatacatttctgcagac
attttaaagctggttgggaattatgtaagaaatatacggcattatgaaattacaaaacaa
gatattaaagtggcaatgtgtgctgacaaggtattgatggatatgtttcatcaagatgta
gaagatattaatatattatctttaactgatgaagagccttccacctcaggagaacaaact
tactatgatttggtaaaagcatttatggcagaaattcgacaatatataagggaactaaat
ctaattataaaagtttttagagagccctttgtctccaattcaaaattgttttcagctaat
gatgtagaaaatatatttagtcgcatagtagatatacatgaacttagtgtaaagttactg
ggccatatagaagatacggtagaaatgacagatgaaggcagtccccatccactagtagga
agctgctttgaagacttagcagaggaactggcatttgatccatatgaatcgtatgctcga
gatattttgcgacctggttttcatgatcgtttccttagtcagttatcaaagcctggggca
gcactttatttgcagtcaataggcgaaggtttcaaagaagctgttcaatatgttttaccc
aggctgcttctggcccctgtttaccactgtctccattactttgaacttttgaagcagtta
gaagaaaaaagtgaagatcaagaagacaaggaatgtttaaaacaagcaataacagctttg
cttaatgttcagagtggtatggaaaaaatatgttctaaaagtcttgcaaaacgaagactg
agtgaatctgcatgtcggttttatagtcagcaaatgaaggggaaacaactagcaatcaag
aaaatgaacgagattcagaagaatattgatggttgggagggaaaagacattggacagtgt
tgtaatgaatttataatggaaggaactcttacacgtgtaggagccaaacacgagagacac
atatttctctttgatggcttaatgatttgctgtaaatcaaatcacgggcagccaagactt
cctggtgctagcaatgcagaatatcgtcttaaagaaaagttttttatgcgaaaggtacaa
attaatgataaagatgacaccaatgaatacaagcatgcttttgaaataattttaaaagat
gaaaatagtgttatattttctgccaagtcagctgaagagaaaaacaattggatggcagca
ttgatatctttacagtaccggagtacactggaaaggatgcttgatgtaacaatgctacag
gaagagaaagaggagcagatgaggctgcctagtgctgatgtttatagatttgcagagcct
gactctgaagagaatattatatttgaagagaacatgcagcccaaggctggaattccaatt
atcaaagcaggaactgttattaaacttatagagaggcttacgtaccatatgtacgcagat
cccaattttgttcggacatttcttacaacatacagatccttttgcaaacctcaagaacta
ctgagtcttataatagaaaggtttgaaattccagagcctgagccaacagaagctgatcgc
atagctatagagaatggagatcaacccttgagtgcagaactgaaaagatttagaaaagaa
tatatacagcctgtgcaactgcgagtattaaatgtatgtcggcactgggtagagcaccac
ttctatgattttgaaagagatgcatatcttttgcaacgaatggaagaatttattggaaca
gtaagaggtaaagcaatgaaaaaatgggttgaatccatcactaaaataatccaaaggaaa
aaaattgcaagagacaatggaccaggtcataatattacatttcagagttcacctcccaca
gttgagtggcatataagcagacctgggcacatagagacttttgacctgctcaccttacac
ccaatagaaattgctcgacaactcactttacttgaatcagatctataccgagctgtacag
ccatcagaattagttggaagtgtgtggacaaaagaagacaaagaaattaactctcctaat
cttctgaaaatgattcgacataccaccaacctcactctgtggtttgagaaatgtattgta
gaaactgaaaatttagaagaaagagtagctgtggtgagtcgaattattgagattctacaa
gtctttcaagagttgaacaactttaatggtgtccttgaggttgtcagtgctatgaattca
tcacctgtttacagactagaccacacatttgagcaaataccaagtcgccagaagaaaatt
ttagaagaagctcatgaattgagtgaagatcactataagaaatatttggcaaaactcagg
tctattaatccaccatgtgtgcctttctttggaatttatctcactaatatcttgaaaaca
gaagaaggcaaccctgaggtcctaaaaagacatggaaaagagcttataaactttagcaaa
aggaggaaagtagcagaaataacaggagagatccagcagtaccaaaatcagccttactgt
ttacgagtagaatcagatatcaaaaggttctttgaaaacttgaatccgatgggaaatagc
atggagaaggaatttacagattatcttttcaacaaatccctagaaatagaaccacgaaac
cctaagcctctcccaagatttccaaaaaaatatagctatcccctaaaatctcctggtgtt
cgtccatcaaacccaagaccaggtaccatgaggcatcccacacctctgcagcaggagcca
aggaaaattagttatagtaggatccctgaaagtgaaacagaaagtacagcatctgcacca
aattctccaagaacaccgttaacacctccgcctgcttctggtgcttccagtaccacagat
gtttgcagtgtatttgattccgatcattcgagcccttttcactcaagatctgcttctgta
tcatctataagtttaaccaaaggcactgatgaagtgcctgtccctcctcctgttcctcca
cgaagacgaccagaatctgccccagcagaatcttcaccatctaagattatgtctaagcat
ttggacagtcccccagccattcctcctaggcaacccacatcaaaagcctattcaccacga
tattcactatcagaccggacctctatctcagaccctcctgaaagccctcccttattacca
ccacgagaacctgtgaggacacctgatgttttctcaagctcaccactacatctccaacct
ccccctttgggcaaaaaaagtgaccatggcaatgccttcttcccaaacagcccttccccc
tttacaccacctcctcctcaaacaccttctcctcacggcacaagaaggcatctgccatca
ccaccattgacacaagaagtggaccttcattccattgctgggccgcctgttcctccacga
caaagcacttctcaacatatccctaaactccctccaaaaacttacaaaagagagcacaca
cacccatccatgcacagagatggaccaccactgttggagaatgcccattcttcctga
DBGET
integrated database retrieval system