Equus caballus (horse): 100053889
Help
Entry
100053889 CDS
T01058
Gene name
SOS1
Definition
(RefSeq) son of sevenless homolog 1 isoform X1
KO
K03099
son of sevenless
Organism
ecb
Equus caballus (horse)
Pathway
ecb01521
EGFR tyrosine kinase inhibitor resistance
ecb01522
Endocrine resistance
ecb04010
MAPK signaling pathway
ecb04012
ErbB signaling pathway
ecb04014
Ras signaling pathway
ecb04062
Chemokine signaling pathway
ecb04068
FoxO signaling pathway
ecb04072
Phospholipase D signaling pathway
ecb04150
mTOR signaling pathway
ecb04151
PI3K-Akt signaling pathway
ecb04510
Focal adhesion
ecb04540
Gap junction
ecb04630
JAK-STAT signaling pathway
ecb04650
Natural killer cell mediated cytotoxicity
ecb04660
T cell receptor signaling pathway
ecb04662
B cell receptor signaling pathway
ecb04664
Fc epsilon RI signaling pathway
ecb04714
Thermogenesis
ecb04722
Neurotrophin signaling pathway
ecb04810
Regulation of actin cytoskeleton
ecb04910
Insulin signaling pathway
ecb04912
GnRH signaling pathway
ecb04915
Estrogen signaling pathway
ecb04917
Prolactin signaling pathway
ecb04926
Relaxin signaling pathway
ecb04935
Growth hormone synthesis, secretion and action
ecb05034
Alcoholism
ecb05160
Hepatitis C
ecb05161
Hepatitis B
ecb05163
Human cytomegalovirus infection
ecb05165
Human papillomavirus infection
ecb05200
Pathways in cancer
ecb05205
Proteoglycans in cancer
ecb05206
MicroRNAs in cancer
ecb05210
Colorectal cancer
ecb05211
Renal cell carcinoma
ecb05213
Endometrial cancer
ecb05214
Glioma
ecb05215
Prostate cancer
ecb05220
Chronic myeloid leukemia
ecb05221
Acute myeloid leukemia
ecb05223
Non-small cell lung cancer
ecb05224
Breast cancer
ecb05225
Hepatocellular carcinoma
ecb05226
Gastric cancer
ecb05231
Choline metabolism in cancer
Brite
KEGG Orthology (KO) [BR:
ecb00001
]
09130 Environmental Information Processing
09132 Signal transduction
04010 MAPK signaling pathway
100053889 (SOS1)
04012 ErbB signaling pathway
100053889 (SOS1)
04014 Ras signaling pathway
100053889 (SOS1)
04630 JAK-STAT signaling pathway
100053889 (SOS1)
04068 FoxO signaling pathway
100053889 (SOS1)
04072 Phospholipase D signaling pathway
100053889 (SOS1)
04151 PI3K-Akt signaling pathway
100053889 (SOS1)
04150 mTOR signaling pathway
100053889 (SOS1)
09140 Cellular Processes
09144 Cellular community - eukaryotes
04510 Focal adhesion
100053889 (SOS1)
04540 Gap junction
100053889 (SOS1)
09142 Cell motility
04810 Regulation of actin cytoskeleton
100053889 (SOS1)
09150 Organismal Systems
09151 Immune system
04650 Natural killer cell mediated cytotoxicity
100053889 (SOS1)
04660 T cell receptor signaling pathway
100053889 (SOS1)
04662 B cell receptor signaling pathway
100053889 (SOS1)
04664 Fc epsilon RI signaling pathway
100053889 (SOS1)
04062 Chemokine signaling pathway
100053889 (SOS1)
09152 Endocrine system
04910 Insulin signaling pathway
100053889 (SOS1)
04912 GnRH signaling pathway
100053889 (SOS1)
04915 Estrogen signaling pathway
100053889 (SOS1)
04917 Prolactin signaling pathway
100053889 (SOS1)
04926 Relaxin signaling pathway
100053889 (SOS1)
04935 Growth hormone synthesis, secretion and action
100053889 (SOS1)
09156 Nervous system
04722 Neurotrophin signaling pathway
100053889 (SOS1)
09159 Environmental adaptation
04714 Thermogenesis
100053889 (SOS1)
09160 Human Diseases
09161 Cancer: overview
05200 Pathways in cancer
100053889 (SOS1)
05206 MicroRNAs in cancer
100053889 (SOS1)
05205 Proteoglycans in cancer
100053889 (SOS1)
05231 Choline metabolism in cancer
100053889 (SOS1)
09162 Cancer: specific types
05210 Colorectal cancer
100053889 (SOS1)
05225 Hepatocellular carcinoma
100053889 (SOS1)
05226 Gastric cancer
100053889 (SOS1)
05214 Glioma
100053889 (SOS1)
05221 Acute myeloid leukemia
100053889 (SOS1)
05220 Chronic myeloid leukemia
100053889 (SOS1)
05211 Renal cell carcinoma
100053889 (SOS1)
05215 Prostate cancer
100053889 (SOS1)
05213 Endometrial cancer
100053889 (SOS1)
05224 Breast cancer
100053889 (SOS1)
05223 Non-small cell lung cancer
100053889 (SOS1)
09165 Substance dependence
05034 Alcoholism
100053889 (SOS1)
09172 Infectious disease: viral
05161 Hepatitis B
100053889 (SOS1)
05160 Hepatitis C
100053889 (SOS1)
05163 Human cytomegalovirus infection
100053889 (SOS1)
05165 Human papillomavirus infection
100053889 (SOS1)
09176 Drug resistance: antineoplastic
01521 EGFR tyrosine kinase inhibitor resistance
100053889 (SOS1)
01522 Endocrine resistance
100053889 (SOS1)
09180 Brite Hierarchies
09183 Protein families: signaling and cellular processes
04990 Domain-containing proteins not elsewhere classified [BR:
ecb04990
]
100053889 (SOS1)
Domain-containing proteins not elsewhere classified [BR:
ecb04990
]
Pleckstrin homology (PH) domain-containing proteins
Dbl-Like RhoGEF family proteins
100053889 (SOS1)
BRITE hierarchy
SSDB
Ortholog
Paralog
GFIT
Motif
Pfam:
RasGEF
RasGEF_N
RhoGEF
Histone
PH
PH_19
PH_10
IQ_SEC7_PH
PH_13
Motif
Other DBs
NCBI-GeneID:
100053889
NCBI-ProteinID:
XP_005600089
Ensembl:
ENSECAG00000017604
VGNC:
23454
UniProt:
F6RQQ4
LinkDB
All DBs
Position
15
AA seq
1333 aa
AA seq
DB search
MQAQQLPYEFFSEENAPKWRGLLVPALKKVQGQVHPTLESSDDALQYVEELILQLLNMLC
QAQPRSASDVEERVQKSFPHPIDKWAIADAQSAIEKRKRRNPLSLPVEKIHPLLKEVLGY
KIDHQVSVYIVAVLEYISADILKLVGNYVRNIRHYEITKQDIKVAMCADKVLMDMFHQDV
EDINILSLTDEEPSTSGEQTYYDLVKAFMAEIRQYIRELNLIIKVFREPFVSNSKLFSAN
DVENIFSRIVDIHELSVKLLGHIEDTVEMTDEGSPHPLVGSCFEDLAEELAFDPYESYAR
DILRPGFHDHFLSQLSKPGAALYLQSIGEGFKEAVQYVLPRLLLAPVYHCLHYFELLKQL
EEKSEDQEDKECLKQAITALLNVQSGMEKICSKSLAKRRLSESACRFYSQQMKGKQLAIK
KMNEIQKNIDGWEGKDIGQCCNEFIMEGTLTRVGAKHERHIFLFDGLMICCKSNHGQPRL
PGASNAEYRLKEKFFMRKVQINDKDDTSEYKHAFEIILKDENSVIFSAKSAEEKNNWMAA
LISLQYRSTLERMLDVTMLQEEKEEQMRLPSADVYRFAEPDSEENIIFEENMQPKAGIPI
IKAGTVIKLIERLTYHMYADPNFVRTFLTTYRSFCKPQELLSLIIERFEIPEPEPTEADR
IAIENGDQPLSAELKRFRKEYIQPVQLRVLNVCRHWVEHHFYDFERDADLLQRMEEFIGT
VRGKAMKKWVESITKIIQRKKIARDNGPGHNITFQSSPPTVEWHISRPGHIETFDLLTLH
PIEIARQLTLLESDLYRAVQPSELVGSVWTKEDKEINSPNLLKMIRHTTNLTLWFEKCIV
ETENLEERVAVVNRIIEILQVFQELNNFNGVLEVVSAMNSSPVYRLDHTFEQIPSRQKKI
LEEAHELSEDHYKKYLAKLRSINPPCVPFFGIYLTNILKTEEGNPEVLKRHGKDLINFSK
RRKVAEITGEIQQYQNQPYCLRVESDIKRFFENLNPMGNSMEKEFTDYLFNKSLEIEPRN
PKPLPRFPKKYSYPLKSPGVRPSNPRPGTMRHPTPLQQEPRKISYSRIPESETESTASAP
NSPRTPLTPPPASGASSTTDVCSVFDSDHSSPFHSSSDTVFIQVTLPHGPRSASVSSISL
TKGTDEVPVPPPVPPRRRPESAPAESSPSKIMSKHLDSPPAIPPRQPTSKAYSPRYSISE
RTSISDPPESPPLLPPREPVRTPDVFSSSPLHLQPPPLGKKSDHGNAFFPNSPSPFTPPP
PQTPSPHGTRRHLPSPPLTQEVDLHSIAGPPVPPRQSTSQHIPKLPPKTYKREHTHPSMH
RDGPPLLENAHSS
NT seq
4002 nt
NT seq
+upstream
nt +downstream
nt
atgcaggcgcagcagctgccgtacgagtttttcagcgaggagaacgcgcccaagtggcgg
gggctgctggtgccggcgctgaaaaaggtccaggggcaagttcatcctactcttgagtct
agtgatgatgctcttcagtatgttgaggaattaattttgcagttattaaatatgctatgc
caagctcagcctcgaagtgcttcagatgtagaggaacgtgttcaaaaaagtttccctcat
ccgattgataagtgggcaatagctgatgcccagtcagctattgaaaagaggaagcgaaga
aaccctttatctctcccagtagaaaaaattcatcctttattaaaggaggtcctaggttat
aaaattgaccaccaggtttctgtttacatagtagcagtattagagtatatttctgcagac
attttaaagctggtagggaattatgtacggaatatacggcattatgaaattacaaaacaa
gatattaaagtggcaatgtgtgctgataaggtattgatggatatgtttcatcaagatgta
gaagatataaatatattatctttaactgatgaagaaccttccacctcaggagagcaaact
tattatgatttggtaaaagcatttatggcagaaattcgacaatacataagggaactaaat
ttaattataaaagtttttagagagccctttgtctccaattcaaaattgttttcagctaat
gatgtagaaaatatatttagtcgtatagtagatatacacgaacttagtgtaaagttactg
ggccacatagaagatactgtagaaatgacagatgaaggcagtccccatccattagtagga
agctgctttgaagacttagcagaggaactggcatttgatccatatgaatcatatgctcga
gatattttacgacctggctttcatgatcatttccttagtcagctatcaaagcctggagca
gcactctatttgcagtcaataggcgaaggtttcaaagaagctgttcagtatgttttaccc
aggctacttctagcccctgtttaccactgtctacattactttgaacttttgaagcagtta
gaagaaaagagtgaagatcaagaagacaaggaatgtttgaaacaagcaataacagctttg
cttaatgttcagagtggtatggaaaaaatatgttctaaaagtcttgcaaaacgaagactg
agtgaatctgcatgtcggttttatagtcagcaaatgaaggggaaacaactagcaatcaag
aaaatgaacgagattcagaagaatattgatggatgggagggaaaagacattggacagtgt
tgcaatgagtttataatggaaggaactcttacacgtgtaggagccaaacatgagagacac
atatttctctttgatggcttaatgatttgctgtaaatcgaatcatgggcaaccaagactt
cctggtgctagcaatgcagaatatcgtcttaaagaaaagttttttatgagaaaggtacaa
attaatgacaaagatgacaccagtgagtacaagcatgcttttgaaataattttaaaagat
gaaaatagtgttatattttctgccaagtcagctgaagagaaaaacaattggatggcagca
ttgatatctttacagtaccggagtacactggaaaggatgcttgatgtgacaatgctacag
gaggagaaggaggagcagatgaggctccctagtgctgatgtttatagatttgcagagccc
gactctgaagagaatattatatttgaagaaaacatgcagcccaaggctggaattccaatt
atcaaagcaggaactgttattaaacttatagagaggctcacataccacatgtatgcagat
cccaattttgttcggacatttcttacaacatacagatccttttgtaaacctcaagaactg
ctgagtcttataatagaaaggtttgaaattccagagcctgagccaacagaagctgatcgc
atagctatagagaatggagatcaacccttgagtgcagaactaaaaaggtttagaaaagag
tatatacagcctgtacaactgcgagtattaaatgtatgtcggcactgggtagaacaccac
ttctatgattttgaaagagatgcagatcttttgcagcgaatggaagaatttattggaaca
gtaagaggtaaagcaatgaaaaaatgggtcgaatccatcactaaaataatccaaaggaaa
aaaattgcaagagacaatggaccaggtcataatattacatttcagagttcacctcccaca
gttgagtggcatataagcagacctgggcacatagagacttttgacctgctcaccttacac
ccaatagaaattgctcgacaacttactttacttgaatcagatctgtatcgagctgtacag
ccatcagaattagttggaagtgtgtggacaaaagaagacaaagaaattaattctcctaat
cttctgaaaatgatccggcacaccactaatctcactttgtggtttgagaaatgtattgta
gaaactgaaaacttagaagaaagagtagctgtggtgaatcgaataattgagattctgcaa
gtctttcaagagctgaacaacttcaatggcgtccttgaggttgttagtgctatgaactca
tcacctgtttacagactagaccacacgtttgagcaaataccaagtcgccaaaagaaaatt
ttagaagaagctcatgaattaagtgaagatcactataagaaatatttggcaaaactcagg
tctattaatccaccatgtgtgcctttctttggaatttatctaactaatatcttgaaaaca
gaagaaggcaaccctgaggtcctgaaaaggcatggaaaagaccttataaactttagcaaa
aggaggaaggtagcagaaattacaggagagatccagcagtaccaaaatcagccttattgt
ttacgagtagaatcggatatcaaaaggttctttgaaaacttgaatccaatgggaaatagc
atggagaaagaatttacagattatcttttcaacaaatccctagaaatagaaccacgaaat
cctaagcctctcccaagatttccaaaaaaatatagctatcccctaaaatctcctggtgtt
cgtccatcaaacccaagaccaggtaccatgagacatcccacacctctgcagcaggagcca
aggaaaattagttatagtaggatccctgaaagtgaaacagagagtacagcatctgcacca
aattctccaagaacaccgttaacacctcctcctgcttctggtgcttccagtaccacagat
gtttgcagcgtatttgattctgatcattcgagcccttttcactcaagcagcgataccgtc
tttatccaagttacactgccccatggcccaagatctgcttcagtatcatctataagttta
accaaaggcactgatgaagtgcctgtcccccctcctgttcctccacgaagacgaccagaa
tctgccccagcggaatcttcgccatctaagattatgtctaagcatttggacagcccccca
gcaattcctcctaggcaacccacatcaaaagcctattcaccacgctattcaatatcagag
cggacctctatatcagaccctcctgagagccctcccttattaccaccacgagaacctgtg
aggacacctgatgttttctcaagctcaccactacatctccaacctccccctttgggcaaa
aaaagtgaccatggcaatgccttcttcccaaacagcccttctccctttacaccacctcct
cctcaaacaccttctcctcacggcacgagaaggcatctgccgtcaccaccattgacacaa
gaagtggaccttcattccattgctgggccgcccgttcctcctcgacaaagcacttctcag
catatccctaaactccctcccaaaacttacaaaagggagcacacacacccatccatgcac
agagatggaccaccactgttggagaatgcccattcttcctga
Equus caballus (horse): 100066282
Help
Entry
100066282 CDS
T01058
Gene name
SOS2
Definition
(RefSeq) son of sevenless homolog 2 isoform X1
KO
K03099
son of sevenless
Organism
ecb
Equus caballus (horse)
Pathway
ecb01521
EGFR tyrosine kinase inhibitor resistance
ecb01522
Endocrine resistance
ecb04010
MAPK signaling pathway
ecb04012
ErbB signaling pathway
ecb04014
Ras signaling pathway
ecb04062
Chemokine signaling pathway
ecb04068
FoxO signaling pathway
ecb04072
Phospholipase D signaling pathway
ecb04150
mTOR signaling pathway
ecb04151
PI3K-Akt signaling pathway
ecb04510
Focal adhesion
ecb04540
Gap junction
ecb04630
JAK-STAT signaling pathway
ecb04650
Natural killer cell mediated cytotoxicity
ecb04660
T cell receptor signaling pathway
ecb04662
B cell receptor signaling pathway
ecb04664
Fc epsilon RI signaling pathway
ecb04714
Thermogenesis
ecb04722
Neurotrophin signaling pathway
ecb04810
Regulation of actin cytoskeleton
ecb04910
Insulin signaling pathway
ecb04912
GnRH signaling pathway
ecb04915
Estrogen signaling pathway
ecb04917
Prolactin signaling pathway
ecb04926
Relaxin signaling pathway
ecb04935
Growth hormone synthesis, secretion and action
ecb05034
Alcoholism
ecb05160
Hepatitis C
ecb05161
Hepatitis B
ecb05163
Human cytomegalovirus infection
ecb05165
Human papillomavirus infection
ecb05200
Pathways in cancer
ecb05205
Proteoglycans in cancer
ecb05206
MicroRNAs in cancer
ecb05210
Colorectal cancer
ecb05211
Renal cell carcinoma
ecb05213
Endometrial cancer
ecb05214
Glioma
ecb05215
Prostate cancer
ecb05220
Chronic myeloid leukemia
ecb05221
Acute myeloid leukemia
ecb05223
Non-small cell lung cancer
ecb05224
Breast cancer
ecb05225
Hepatocellular carcinoma
ecb05226
Gastric cancer
ecb05231
Choline metabolism in cancer
Brite
KEGG Orthology (KO) [BR:
ecb00001
]
09130 Environmental Information Processing
09132 Signal transduction
04010 MAPK signaling pathway
100066282 (SOS2)
04012 ErbB signaling pathway
100066282 (SOS2)
04014 Ras signaling pathway
100066282 (SOS2)
04630 JAK-STAT signaling pathway
100066282 (SOS2)
04068 FoxO signaling pathway
100066282 (SOS2)
04072 Phospholipase D signaling pathway
100066282 (SOS2)
04151 PI3K-Akt signaling pathway
100066282 (SOS2)
04150 mTOR signaling pathway
100066282 (SOS2)
09140 Cellular Processes
09144 Cellular community - eukaryotes
04510 Focal adhesion
100066282 (SOS2)
04540 Gap junction
100066282 (SOS2)
09142 Cell motility
04810 Regulation of actin cytoskeleton
100066282 (SOS2)
09150 Organismal Systems
09151 Immune system
04650 Natural killer cell mediated cytotoxicity
100066282 (SOS2)
04660 T cell receptor signaling pathway
100066282 (SOS2)
04662 B cell receptor signaling pathway
100066282 (SOS2)
04664 Fc epsilon RI signaling pathway
100066282 (SOS2)
04062 Chemokine signaling pathway
100066282 (SOS2)
09152 Endocrine system
04910 Insulin signaling pathway
100066282 (SOS2)
04912 GnRH signaling pathway
100066282 (SOS2)
04915 Estrogen signaling pathway
100066282 (SOS2)
04917 Prolactin signaling pathway
100066282 (SOS2)
04926 Relaxin signaling pathway
100066282 (SOS2)
04935 Growth hormone synthesis, secretion and action
100066282 (SOS2)
09156 Nervous system
04722 Neurotrophin signaling pathway
100066282 (SOS2)
09159 Environmental adaptation
04714 Thermogenesis
100066282 (SOS2)
09160 Human Diseases
09161 Cancer: overview
05200 Pathways in cancer
100066282 (SOS2)
05206 MicroRNAs in cancer
100066282 (SOS2)
05205 Proteoglycans in cancer
100066282 (SOS2)
05231 Choline metabolism in cancer
100066282 (SOS2)
09162 Cancer: specific types
05210 Colorectal cancer
100066282 (SOS2)
05225 Hepatocellular carcinoma
100066282 (SOS2)
05226 Gastric cancer
100066282 (SOS2)
05214 Glioma
100066282 (SOS2)
05221 Acute myeloid leukemia
100066282 (SOS2)
05220 Chronic myeloid leukemia
100066282 (SOS2)
05211 Renal cell carcinoma
100066282 (SOS2)
05215 Prostate cancer
100066282 (SOS2)
05213 Endometrial cancer
100066282 (SOS2)
05224 Breast cancer
100066282 (SOS2)
05223 Non-small cell lung cancer
100066282 (SOS2)
09165 Substance dependence
05034 Alcoholism
100066282 (SOS2)
09172 Infectious disease: viral
05161 Hepatitis B
100066282 (SOS2)
05160 Hepatitis C
100066282 (SOS2)
05163 Human cytomegalovirus infection
100066282 (SOS2)
05165 Human papillomavirus infection
100066282 (SOS2)
09176 Drug resistance: antineoplastic
01521 EGFR tyrosine kinase inhibitor resistance
100066282 (SOS2)
01522 Endocrine resistance
100066282 (SOS2)
09180 Brite Hierarchies
09183 Protein families: signaling and cellular processes
04990 Domain-containing proteins not elsewhere classified [BR:
ecb04990
]
100066282 (SOS2)
Domain-containing proteins not elsewhere classified [BR:
ecb04990
]
Pleckstrin homology (PH) domain-containing proteins
Dbl-Like RhoGEF family proteins
100066282 (SOS2)
BRITE hierarchy
SSDB
Ortholog
Paralog
GFIT
Motif
Pfam:
RasGEF
RasGEF_N
RhoGEF
PH
Histone
PH_19
IQ_SEC7_PH
PH_13
Motif
Other DBs
NCBI-GeneID:
100066282
NCBI-ProteinID:
XP_023480621
Ensembl:
ENSECAG00000004420
VGNC:
23455
LinkDB
All DBs
Position
1
AA seq
1332 aa
AA seq
DB search
MQQAPQPYEFFSEENSPRWRGLLVPALRKVQEQVHPNLSANEESLYYIEELIFQLLNKLC
MAQPRTVQDVEERVQKTFPHPIDKWAIADAQSAIEKRKRRHPLLLPVDKIHPSLKEVLGY
KVDYHVSLYIVAVLEYISADILKLAGNYVFNIRHYEISQQDIKVSMCADKVLMDMFDQDD
IGLVSLCEDEPSSSGELNYYDLVRTEIAEERQYLRELNMIIKVFREAFLSDRKLFKPSDI
EKIFSNILDIHELTVKLLGLIEDTVEMTDESSPHPLAGSCFEDLAEEQAFDPYETLSQDI
LSPKFNEHFSKLMARPAVALHFQSIADGFREAVRYVLPRLMLVPVYHCWHYFELLKQLKA
CSEEQEDRECLNQAITALMNLQGSMDRIYKQYSPRRRPGDPVCPFYNRQLRSKHLAIKKM
NEIQKNIDGWEGKDIGQCCNEFIMEGPLTKIGAKHERHIFLFDGLMISCKPNHSQSRLPG
YSSAEYRLKEKFVMRKIQICDKEDTCECRHAFELVSKDENSITFAAKSAEEKNNWMAALI
SLHYRSTLDRMLDSVLLKEENEQPLRLPSPEMYRFVVKDSEENIVFEDSLQSRSGIPIIK
GGTVVKLIERLTYHMYADPNFVRTFLTTYRSFCKPQELLSLLIERFEIPEPEPTEADKLA
VEKGEQPISADLKRFRKEYVQPVQLRILNVFRHWVEHHFYDFERDLELLERLESFISSVR
GKAMKKWVESIAKIIKRKKQTQANGISHNITFESPPPPIEWHISRPGQFETFDLMTLHPI
EIARQLTLLESDLYRKVQPSELVGSVWTKEDKEINSPNLLKMIRHTTNLTLWFEKCIVEA
ENFEERVAVLSRVIEILQVFQDLNNFNGVLEIVSAVNSVSVYRLDHTFEALQERKRRILD
EAVELSQDHFKKYLVKLKSINPPCVPFFGIYLTNILKTEEGNNDFLKKKGKDLINFSKRR
KVAEITGEIQQYQNQPYCLRIEPEMRRFFENLNPMGSSSEKEFTDYLFNKSLEIEPRNCK
QPPRFPRKSTFSLKSPGIRPNTGRHGSTSGTLRGHPTPLEREPCKISFSRIAEAELGSAV
SAPTSPNTPSTPPASAASDLSVFPDVDLSASCGSNSIFAPVLLPHSKSFFSSCGSLHKLS
DEPLIPPPLPPRKKFDHDASNPKGNMKSDDDPPAIPPRQPPPPKVKPRVPAPSGAFEGSL
HSPPPPPPREPLPDTPPPVPLRPPEHFINCPFNLQPPPLGHLHRDPDWFRDVSTRPDSPN
TPPSTPSPRVPRRCCVLSSSHSNLTYPQAPPVPPRQNSSPHLPKLPPKTYKRELSHPPMY
RLSLLENAETPQ
NT seq
3999 nt
NT seq
+upstream
nt +downstream
nt
atgcagcaggcgccgcagccgtacgagttcttcagcgaggagaacagcccgagatggcgg
gggctgctggtgccggccctgcggaaggttcaggagcaggtacatcccaatctctcagct
aatgaagaatctctctattacattgaagagctgatttttcaactgcttaataaattatgc
atggctcagccaaggactgttcaagatgtggaggaacgagttcaaaagacctttccgcat
ccaattgataaatgggctattgctgatgcacaatccgccatagagaaacgaaaacgaaga
caccctctcttactgcctgtggacaaaatccatccttcattgaaggaggttttagggtac
aaagtggactaccatgtgtccctgtatattgtggctgtactggagtatatctcagctgat
attttaaaattggctggtaattatgtttttaatattcgacattatgaaatatctcagcag
gacattaaagtgtcaatgtgtgcagataaggttttgatggacatgtttgatcaggatgac
ataggtttggtttctctctgtgaagatgagccgagttcttcaggtgaattgaattactat
gaccttgtcagaactgaaattgcagaagaaagacagtatctccgggaactaaatatgatc
ataaaagtatttcgagaagcctttctttctgacagaaagctgtttaaaccttctgatatt
gagaagattttcagtaacattttagatatacatgaattgactgtgaaacttctaggtttg
attgaggacacagttgagatgactgatgaaagcagtcctcatcccttagctggcagctgt
ttcgaagatttggcagaagagcaggcttttgatccttatgaaacattatcacaggacatc
ctttcaccaaaattcaatgaacatttcagtaagttgatggccagacctgcagtggctcta
cactttcagtccattgctgatgggttcagagaggcggttcgttatgtccttccacgcctg
atgcttgtgccagtgtatcattgttggcactacttcgaattattaaagcaattgaaagca
tgtagtgaagaacaagaagacagagaatgtttgaaccaagctattactgctctcatgaat
ctccagggtagtatggaccgaatttataagcagtattcgcctagacgtcgacctggggat
cctgtttgccctttttataatcgtcaattaagaagcaagcacctggctattaaaaaaatg
aatgaaattcagaaaaacatagatggatgggaaggcaaagacatcggacagtgttgtaat
gaattcattatggaaggtcctttgacaaaaattggtgctaaacatgaacggcatattttt
ctctttgatggcttaatgattagctgcaaacccaatcatagccagtcgcggcttccaggg
tacagtagtgcagaatacagactgaaagagaagtttgtcatgaggaaaatacaaatatgt
gataaagaagacacttgcgagtgcagacatgcttttgaattagtctccaaagatgaaaac
agcatcacatttgctgccaagtctgccgaagagaagaataactggatggcagcactcatt
tctcttcattaccgtagcacactggaccgaatgctggactcagtgttactgaaggaagaa
aatgagcagccactgaggttaccaagtcctgagatgtatcgtttcgtggtgaaagactct
gaggaaaacatagtttttgaagacagcttgcaaagtagaagtggaatccccattattaaa
ggaggaactgtggtgaaattaattgaaaggttgacatatcatatgtatgcagatcccaat
tttgttcgtacttttcttactacatatcgttcattttgtaaaccacaggaattgctaagc
ttactaattgaacgatttgaaattccagagccagaacctactgaagcagataaattggca
gtagagaaaggcgagcagcccatcagtgcagaccttaaaaggtttcgcaaggaatacgtc
caaccagtacaacttaggatcttaaatgtgtttcggcactgggttgaacaccatttttat
gattttgaaagagatttggagttgcttgaacggctagaatccttcatttcaagtgtaaga
gggaaagctatgaagaaatgggtagagtcaattgctaagatcatcaagaggaagaaacag
actcaggcaaacggaataagccataatatcacctttgaaagcccaccccccccgattgaa
tggcacatcagcagacccgggcagttcgagacctttgatctcatgacacttcatccaata
gaaattgcacgccagctgacgcttttggaatctgatctctataggaaagtccagccttct
gaacttgtagggagtgtgtggaccaaagaagataaagaaataaattctccaaatttatta
aaaatgattcgccacaccacaaatctcaccctctggtttgaaaagtgcattgtggaagca
gaaaactttgaggaacgagtggcagtactaagtagagttatagaaattctgcaagttttt
caagatttgaataatttcaatggcgtactggagatagtcagtgcagtgaattcagtgtca
gtgtacagactagaccataccttcgaggcattgcaggaaagaaaaaggagaattttggat
gaagctgtggaattaagtcaagatcactttaaaaaatacctagtaaaacttaagtcaatc
aatccaccttgtgtgcctttttttggaatatatttaacaaatattctgaagactgaagaa
gggaataatgattttttaaaaaagaaagggaaagatttaatcaatttcagtaagaggagg
aaagtagctgaaattactggagaaattcagcagtatcagaatcaaccttactgtttacgg
atagaaccagagatgaggcggttctttgaaaaccttaaccccatgggaagttcttctgaa
aaagagtttacggattatttgttcaacaaatcactagaaattgaaccccgcaactgcaaa
cagccacctcgatttcctaggaaatcaactttctctttaaaatctcctggaataaggcct
aacacgggccgacatggctctacctcaggcactttacgaggtcatcccaccccgttagaa
agagaaccgtgtaaaatcagctttagtcggattgctgaggctgagctgggatcagcagtg
tcggcaccaacctctcccaacacgccgtccaccccgccggcgtctgctgcttcagacctc
agtgtgttcccagacgtggacctcagcgcttcctgtggcagcaatagcatctttgctcca
gtcctcttgccacactcaaagtccttcttcagttcatgcggtagtttacataaactaagt
gacgagcccctgattcctcctccgcttcctcctcggaagaagtttgatcacgatgcttcg
aatcccaagggaaatatgaaatctgatgatgaccctcctgctattccaccaagacaacct
cctcctccaaaggtgaaacccagagttcccgctccttctggtgcatttgaggggtctctg
cacagcccacctccaccgccgcccagagagcctcttcctgacacgcctccaccggttccc
cttcggcctccagaacactttataaactgtccgtttaatcttcagccacctcccctggga
catcttcacagagatccagactggttcagagacgttagcacacgaccagattcgcccaac
actcctcccagcacaccgtctccacgggtgccacgtcgatgctgtgtgctcagttctagt
cacagtaatctcacttatcctcaagctccccctgttccaccaaggcagaattcaagccct
cacctaccaaaactgccaccaaagacttacaaacgggagctttcgcaccccccaatgtac
agactgtctttgctagaaaacgcggaaactcctcaatga
DBGET
integrated database retrieval system