Desmodus rotundus (common vampire bat): 112297971
Help
Entry
112297971 CDS
T05907
Gene name
SOS1
Definition
(RefSeq) son of sevenless homolog 1 isoform X1
KO
K03099
son of sevenless
Organism
dro
Desmodus rotundus (common vampire bat)
Pathway
dro01521
EGFR tyrosine kinase inhibitor resistance
dro01522
Endocrine resistance
dro04010
MAPK signaling pathway
dro04012
ErbB signaling pathway
dro04014
Ras signaling pathway
dro04062
Chemokine signaling pathway
dro04068
FoxO signaling pathway
dro04072
Phospholipase D signaling pathway
dro04150
mTOR signaling pathway
dro04151
PI3K-Akt signaling pathway
dro04510
Focal adhesion
dro04540
Gap junction
dro04630
JAK-STAT signaling pathway
dro04650
Natural killer cell mediated cytotoxicity
dro04660
T cell receptor signaling pathway
dro04662
B cell receptor signaling pathway
dro04664
Fc epsilon RI signaling pathway
dro04714
Thermogenesis
dro04722
Neurotrophin signaling pathway
dro04810
Regulation of actin cytoskeleton
dro04910
Insulin signaling pathway
dro04912
GnRH signaling pathway
dro04915
Estrogen signaling pathway
dro04917
Prolactin signaling pathway
dro04926
Relaxin signaling pathway
dro04935
Growth hormone synthesis, secretion and action
dro05034
Alcoholism
dro05160
Hepatitis C
dro05161
Hepatitis B
dro05163
Human cytomegalovirus infection
dro05165
Human papillomavirus infection
dro05200
Pathways in cancer
dro05205
Proteoglycans in cancer
dro05206
MicroRNAs in cancer
dro05210
Colorectal cancer
dro05211
Renal cell carcinoma
dro05213
Endometrial cancer
dro05214
Glioma
dro05215
Prostate cancer
dro05220
Chronic myeloid leukemia
dro05221
Acute myeloid leukemia
dro05223
Non-small cell lung cancer
dro05224
Breast cancer
dro05225
Hepatocellular carcinoma
dro05226
Gastric cancer
dro05231
Choline metabolism in cancer
Brite
KEGG Orthology (KO) [BR:
dro00001
]
09130 Environmental Information Processing
09132 Signal transduction
04010 MAPK signaling pathway
112297971 (SOS1)
04012 ErbB signaling pathway
112297971 (SOS1)
04014 Ras signaling pathway
112297971 (SOS1)
04630 JAK-STAT signaling pathway
112297971 (SOS1)
04068 FoxO signaling pathway
112297971 (SOS1)
04072 Phospholipase D signaling pathway
112297971 (SOS1)
04151 PI3K-Akt signaling pathway
112297971 (SOS1)
04150 mTOR signaling pathway
112297971 (SOS1)
09140 Cellular Processes
09144 Cellular community - eukaryotes
04510 Focal adhesion
112297971 (SOS1)
04540 Gap junction
112297971 (SOS1)
09142 Cell motility
04810 Regulation of actin cytoskeleton
112297971 (SOS1)
09150 Organismal Systems
09151 Immune system
04650 Natural killer cell mediated cytotoxicity
112297971 (SOS1)
04660 T cell receptor signaling pathway
112297971 (SOS1)
04662 B cell receptor signaling pathway
112297971 (SOS1)
04664 Fc epsilon RI signaling pathway
112297971 (SOS1)
04062 Chemokine signaling pathway
112297971 (SOS1)
09152 Endocrine system
04910 Insulin signaling pathway
112297971 (SOS1)
04912 GnRH signaling pathway
112297971 (SOS1)
04915 Estrogen signaling pathway
112297971 (SOS1)
04917 Prolactin signaling pathway
112297971 (SOS1)
04926 Relaxin signaling pathway
112297971 (SOS1)
04935 Growth hormone synthesis, secretion and action
112297971 (SOS1)
09156 Nervous system
04722 Neurotrophin signaling pathway
112297971 (SOS1)
09159 Environmental adaptation
04714 Thermogenesis
112297971 (SOS1)
09160 Human Diseases
09161 Cancer: overview
05200 Pathways in cancer
112297971 (SOS1)
05206 MicroRNAs in cancer
112297971 (SOS1)
05205 Proteoglycans in cancer
112297971 (SOS1)
05231 Choline metabolism in cancer
112297971 (SOS1)
09162 Cancer: specific types
05210 Colorectal cancer
112297971 (SOS1)
05225 Hepatocellular carcinoma
112297971 (SOS1)
05226 Gastric cancer
112297971 (SOS1)
05214 Glioma
112297971 (SOS1)
05221 Acute myeloid leukemia
112297971 (SOS1)
05220 Chronic myeloid leukemia
112297971 (SOS1)
05211 Renal cell carcinoma
112297971 (SOS1)
05215 Prostate cancer
112297971 (SOS1)
05213 Endometrial cancer
112297971 (SOS1)
05224 Breast cancer
112297971 (SOS1)
05223 Non-small cell lung cancer
112297971 (SOS1)
09165 Substance dependence
05034 Alcoholism
112297971 (SOS1)
09172 Infectious disease: viral
05161 Hepatitis B
112297971 (SOS1)
05160 Hepatitis C
112297971 (SOS1)
05163 Human cytomegalovirus infection
112297971 (SOS1)
05165 Human papillomavirus infection
112297971 (SOS1)
09176 Drug resistance: antineoplastic
01521 EGFR tyrosine kinase inhibitor resistance
112297971 (SOS1)
01522 Endocrine resistance
112297971 (SOS1)
09180 Brite Hierarchies
09183 Protein families: signaling and cellular processes
04990 Domain-containing proteins not elsewhere classified [BR:
dro04990
]
112297971 (SOS1)
Domain-containing proteins not elsewhere classified [BR:
dro04990
]
Pleckstrin homology (PH) domain-containing proteins
Dbl-Like RhoGEF family proteins
112297971 (SOS1)
BRITE hierarchy
SSDB
Ortholog
Paralog
GFIT
Motif
Pfam:
RasGEF
RasGEF_N
RhoGEF
Histone
PH
PH_19
PH_10
IQ_SEC7_PH
PH_13
Motif
Other DBs
NCBI-GeneID:
112297971
NCBI-ProteinID:
XP_024408671
LinkDB
All DBs
Position
Unknown
AA seq
1333 aa
AA seq
DB search
MQAQQLPYEFFSEENVSKWRGLLVSALKKVQGQVHPTLESSDDALQYVEELILQLLNMLC
QAQPRSVSDVEERVQKSFPHPIDKWAIADAQSAIEKRKRRNPLSLPVEKIHPLLKEVLGY
KIDHQVSVYIVAVLEYISADILKLVGNYVRNIRHYEITKQDIKVAMCADKVLMDMFHQDV
EDINILSLTDEEPSTSGEQTYYDLVKAFMAEIRQYIRELNLIIKVFREPFVSNSKLFSAN
DVENIFSRIVDIHELSVKLLGHIEDTVEMTDEGSPHPLVGSCFEDLAEELAFDPYESYAR
DILRPGFHDHFLSQLSKPGAALYLQSIGEGFKEAVQYVLPRLLLAPVYHCLHYFELLKQL
EEKSEDQEDKECLKQAITALLNVQSGMEKICSKSLAKRRLSESACRFYSQQMKGKQLAIK
KMNEIQKNIDGWEGKDIGQCCNEFIMEGTLTRVGAKHERHIFLFDGLMICCKSNHGQPRL
PGASNAEYRLKEKFFMRKVQINDKDDTSEYKHAFEIILKDENSVIFSAKSAEEKNNWMAA
LISLQYRSTLERMLDVTMLQEEKEEQMRLPSADIYRFAEPDSEENIIFEENMQPKAGIPI
IKAGTVIKLIERLTYHMYADPNFVRTFLTTYRSFCKPQELLSLIIERFEIPEPEPTEADR
IAIENGDQPLSAELKRFRKEYIQPVQLRVLNVCRHWVEHHFYDFERDTDLLRRMEEFIGT
VRGKAMRKWVESITKIIQRKKMARDSGPGHNITFQSSPPTVEWHISRPGHIETFDLLTLH
PIEIARQLTLLESDLYRAVQPSELVGSVWTKEDKEINSPNLLKMIRHTTNLTLWFEKCIV
ETENLEERVAVVSRIIEILQVFQELNNFNGVLEVVSAMNSSPVYRLDHTFEQIPSRQKKI
LEEAHELSEDHYKKYLAKLRSINPPCVPFFGIYLTNILKTEEGNPEVLKRHGKELINFSK
RRKVAEITGEIQQYQNQPYCLRVESDIKRFFENLNPMGNSMEKEFTDYLFNKSLEIEPRN
PKPLPRFPKKYSYPLKSPGVRPSNPRPGTMRHPTPLQQEPRKISYSRIPESETESTASAP
NSPRTPLTPPPASGASSTTDVCSVFDSDHSSPFHSSSDTVFIQVTLSHGPRSASVSSISL
TKGTDEVPVPPPVPPRRRPESAPAESSPSKMMSKHLDSPPAIPPRQPTSKAYSPRYSVSD
RTSLSDAPESPPLLPPREPVRTPDVFSSSPLHLQPPPLGKKSDHSNAFFPNSPSPFTPPP
PQTPSPHGTRRHLPSPPLTQEVDLHSMAGPPVPPRQSTSQHIPKLPPKTYKREHTHPSMH
RDGPPLLENAHSS
NT seq
4002 nt
NT seq
+upstream
nt +downstream
nt
atgcaggcgcagcagctgccctacgagtttttcagcgaggagaacgtgtccaagtggcgg
gggctgctagtgtctgcgctgaaaaaggtccaggggcaagttcatcctactcttgagtcc
agtgatgatgcgcttcagtatgttgaagaattaattttgcagttattaaatatgctatgc
caagctcagccccgaagtgtttcagatgtagaggaacgtgttcaaaaaagtttccctcat
ccaatcgataagtgggcaatagctgatgcccagtcggccattgaaaagaggaaacgaaga
aaccctttatctctcccagtagaaaaaatccatcctttactaaaggaggtcctaggttac
aaaattgaccaccaggtttctgtttacatagtagcagtgttagaatacatttctgcagac
attttaaagctggtgggtaattatgtacgaaatatacggcattatgaaatcacaaaacaa
gatattaaagtagcaatgtgtgctgataaggtattgatggatatgtttcatcaagatgta
gaagatataaatatattatctttaactgatgaagaaccttccacctcaggagagcaaact
tattatgatttggtaaaagcatttatggcagaaattcgacaatacataagggaactaaat
ttaattataaaagtttttagagaaccctttgtctccaactcaaaattgttttcagctaat
gatgtagaaaatatatttagtcgtatagtagatatacatgaacttagtgtaaagttactg
ggccacatagaagatactgtagaaatgacagatgaaggcagtccccacccactagtagga
agctgctttgaagatttagcagaggaactggcatttgatccatatgaatcatatgctcga
gatattttacgacctggttttcatgatcatttccttagtcagttatccaagcctggagca
gcactttatttgcagtcaataggcgaaggcttcaaagaagcggttcaatacgttttgccc
aggctacttctagccccggtttaccactgtctacattactttgagcttttaaagcaatta
gaagagaagagtgaagatcaagaagataaggaatgtttgaaacaagcaataacagctttg
cttaatgttcagagtggtatggaaaaaatatgttctaaaagtcttgcaaaacgaagactt
agtgaatctgcatgtcggttttatagtcagcaaatgaaggggaaacaactagcaatcaag
aaaatgaatgagattcagaagaatattgatggttgggagggaaaagacattggacagtgt
tgcaatgagtttataatggaaggaactcttacacgtgtaggagccaaacacgagagacac
atatttctctttgatggcttaatgatttgctgtaaatcaaatcatgggcagccaagactt
cctggagctagcaatgcagaatatcgtcttaaagaaaagttttttatgagaaaggtacaa
attaatgacaaagatgacaccagtgagtacaagcatgcttttgagatcattttaaaagat
gaaaatagtgttatattttctgccaagtcagctgaagagaaaaacaactggatggcagca
ctgatatctctgcagtaccggagcacactggaaaggatgcttgatgtgacgatgctgcag
gaagagaaggaggagcagatgaggctccctagtgctgacatttatagatttgcagagcct
gactctgaagagaatataatatttgaagaaaatatgcagcccaaggctggaattccaatt
atcaaagcaggaactgttattaaactcatagagaggctcacataccatatgtacgcagat
cccaattttgttcggacatttcttacaacatacagatccttttgtaaacctcaagaacta
ctgagtcttataatagaaaggtttgaaattccagagcctgagccaacagaagctgaccgt
atagctatagagaatggagaccaacccctgagtgcagagctaaaaagatttagaaaagaa
tacatacagcctgtacagctgcgagtgctgaatgtgtgtcggcactgggtagagcaccac
ttctatgacttcgagagagatacagatcttttgcggcgaatggaggaatttattggaaca
gtaagaggtaaagcaatgagaaaatgggttgaatccatcactaaaataatccagaggaaa
aaaatggcaagagacagtggaccgggtcataatattacatttcagagttcacctcccaca
gttgaatggcatataagcagacctgggcacatagagacttttgacctgctcaccttacac
ccaatagaaattgctcggcaactcactttacttgaatcagatctgtatcgagctgtccag
ccatccgaattagttggaagtgtgtggacaaaagaagacaaagaaattaattctcctaat
cttctgaaaatgatccggcataccactaatctcactctgtggttcgagaaatgcattgta
gaaactgaaaacttagaagaaagagtggccgtggtgagtcgaataattgagattctgcaa
gtctttcaggagctgaacaacttcaatggtgtccttgaggttgtcagtgctatgaactcg
tcacctgtttacaggctggaccacacatttgagcaaataccaagtcgccaaaagaaaatt
ttagaagaagctcatgaattaagtgaagatcactataagaaatatttggcaaaactcagg
tctattaatccaccatgtgtgcctttctttggaatttacctaactaatatcttgaaaaca
gaagaaggcaaccctgaggtcctaaaaaggcatggaaaagagcttataaattttagcaaa
aggagaaaagtagcagaaataacaggagagatacagcagtaccaaaatcagccttattgt
ttacgagtagaatcagatatcaaaaggttcttcgaaaacttgaatccaatgggaaatagc
atggaaaaagaatttacagattatcttttcaacaaatccctagaaatagaaccacgaaac
cctaaacctctcccaagattcccaaaaaaatatagctatcccctaaaatctcctggtgtt
cgtccatcaaacccaagaccaggtaccatgagacatcccacgcctctgcagcaggagcca
agaaaaattagttatagtaggatacccgagagtgaaacagaaagtacagcctctgcacca
aattctccaagaacaccgttaacacctcctcctgcttctggtgcttctagtaccacggat
gtttgcagtgtatttgattccgatcattcaagcccttttcactcaagcagcgataccgtc
tttatccaagttacactgtcccatggcccaagatctgcttcagtatcatctataagttta
accaaaggcactgatgaagtgccggtcccccctcctgttcctccacggagacgaccagaa
tctgcccccgcggaatcgtcaccatctaagatgatgtctaagcacttggacagcccccca
gcaattcctcctaggcaacccacatcgaaagcctattcaccacgatactcagtatcagac
cggacctctctatcagatgctcctgaaagccctcccctacttccaccacgagaacctgtg
aggacacctgatgttttctcaagctcaccactacatctccaacctcctcctttgggcaaa
aaaagtgatcacagcaatgccttctttccaaacagcccttccccctttacaccaccacct
cctcaaacaccttctcctcacggcacaagaaggcatctgccgtcaccaccgttaacacaa
gaagtggaccttcattccatggctgggccacctgttcctccacgacaaagcacttctcag
catatccctaaactccctccaaaaacttacaaaagggagcatacacacccatccatgcac
agagatggaccaccactgttggagaatgcccattcttcctga
Desmodus rotundus (common vampire bat): 112310995
Help
Entry
112310995 CDS
T05907
Gene name
SOS2
Definition
(RefSeq) son of sevenless homolog 2
KO
K03099
son of sevenless
Organism
dro
Desmodus rotundus (common vampire bat)
Pathway
dro01521
EGFR tyrosine kinase inhibitor resistance
dro01522
Endocrine resistance
dro04010
MAPK signaling pathway
dro04012
ErbB signaling pathway
dro04014
Ras signaling pathway
dro04062
Chemokine signaling pathway
dro04068
FoxO signaling pathway
dro04072
Phospholipase D signaling pathway
dro04150
mTOR signaling pathway
dro04151
PI3K-Akt signaling pathway
dro04510
Focal adhesion
dro04540
Gap junction
dro04630
JAK-STAT signaling pathway
dro04650
Natural killer cell mediated cytotoxicity
dro04660
T cell receptor signaling pathway
dro04662
B cell receptor signaling pathway
dro04664
Fc epsilon RI signaling pathway
dro04714
Thermogenesis
dro04722
Neurotrophin signaling pathway
dro04810
Regulation of actin cytoskeleton
dro04910
Insulin signaling pathway
dro04912
GnRH signaling pathway
dro04915
Estrogen signaling pathway
dro04917
Prolactin signaling pathway
dro04926
Relaxin signaling pathway
dro04935
Growth hormone synthesis, secretion and action
dro05034
Alcoholism
dro05160
Hepatitis C
dro05161
Hepatitis B
dro05163
Human cytomegalovirus infection
dro05165
Human papillomavirus infection
dro05200
Pathways in cancer
dro05205
Proteoglycans in cancer
dro05206
MicroRNAs in cancer
dro05210
Colorectal cancer
dro05211
Renal cell carcinoma
dro05213
Endometrial cancer
dro05214
Glioma
dro05215
Prostate cancer
dro05220
Chronic myeloid leukemia
dro05221
Acute myeloid leukemia
dro05223
Non-small cell lung cancer
dro05224
Breast cancer
dro05225
Hepatocellular carcinoma
dro05226
Gastric cancer
dro05231
Choline metabolism in cancer
Brite
KEGG Orthology (KO) [BR:
dro00001
]
09130 Environmental Information Processing
09132 Signal transduction
04010 MAPK signaling pathway
112310995 (SOS2)
04012 ErbB signaling pathway
112310995 (SOS2)
04014 Ras signaling pathway
112310995 (SOS2)
04630 JAK-STAT signaling pathway
112310995 (SOS2)
04068 FoxO signaling pathway
112310995 (SOS2)
04072 Phospholipase D signaling pathway
112310995 (SOS2)
04151 PI3K-Akt signaling pathway
112310995 (SOS2)
04150 mTOR signaling pathway
112310995 (SOS2)
09140 Cellular Processes
09144 Cellular community - eukaryotes
04510 Focal adhesion
112310995 (SOS2)
04540 Gap junction
112310995 (SOS2)
09142 Cell motility
04810 Regulation of actin cytoskeleton
112310995 (SOS2)
09150 Organismal Systems
09151 Immune system
04650 Natural killer cell mediated cytotoxicity
112310995 (SOS2)
04660 T cell receptor signaling pathway
112310995 (SOS2)
04662 B cell receptor signaling pathway
112310995 (SOS2)
04664 Fc epsilon RI signaling pathway
112310995 (SOS2)
04062 Chemokine signaling pathway
112310995 (SOS2)
09152 Endocrine system
04910 Insulin signaling pathway
112310995 (SOS2)
04912 GnRH signaling pathway
112310995 (SOS2)
04915 Estrogen signaling pathway
112310995 (SOS2)
04917 Prolactin signaling pathway
112310995 (SOS2)
04926 Relaxin signaling pathway
112310995 (SOS2)
04935 Growth hormone synthesis, secretion and action
112310995 (SOS2)
09156 Nervous system
04722 Neurotrophin signaling pathway
112310995 (SOS2)
09159 Environmental adaptation
04714 Thermogenesis
112310995 (SOS2)
09160 Human Diseases
09161 Cancer: overview
05200 Pathways in cancer
112310995 (SOS2)
05206 MicroRNAs in cancer
112310995 (SOS2)
05205 Proteoglycans in cancer
112310995 (SOS2)
05231 Choline metabolism in cancer
112310995 (SOS2)
09162 Cancer: specific types
05210 Colorectal cancer
112310995 (SOS2)
05225 Hepatocellular carcinoma
112310995 (SOS2)
05226 Gastric cancer
112310995 (SOS2)
05214 Glioma
112310995 (SOS2)
05221 Acute myeloid leukemia
112310995 (SOS2)
05220 Chronic myeloid leukemia
112310995 (SOS2)
05211 Renal cell carcinoma
112310995 (SOS2)
05215 Prostate cancer
112310995 (SOS2)
05213 Endometrial cancer
112310995 (SOS2)
05224 Breast cancer
112310995 (SOS2)
05223 Non-small cell lung cancer
112310995 (SOS2)
09165 Substance dependence
05034 Alcoholism
112310995 (SOS2)
09172 Infectious disease: viral
05161 Hepatitis B
112310995 (SOS2)
05160 Hepatitis C
112310995 (SOS2)
05163 Human cytomegalovirus infection
112310995 (SOS2)
05165 Human papillomavirus infection
112310995 (SOS2)
09176 Drug resistance: antineoplastic
01521 EGFR tyrosine kinase inhibitor resistance
112310995 (SOS2)
01522 Endocrine resistance
112310995 (SOS2)
09180 Brite Hierarchies
09183 Protein families: signaling and cellular processes
04990 Domain-containing proteins not elsewhere classified [BR:
dro04990
]
112310995 (SOS2)
Domain-containing proteins not elsewhere classified [BR:
dro04990
]
Pleckstrin homology (PH) domain-containing proteins
Dbl-Like RhoGEF family proteins
112310995 (SOS2)
BRITE hierarchy
SSDB
Ortholog
Paralog
GFIT
Motif
Pfam:
RasGEF
RasGEF_N
RhoGEF
Histone
PH
IQ_SEC7_PH
PH_19
Motif
Other DBs
NCBI-GeneID:
112310995
NCBI-ProteinID:
XP_024423038
LinkDB
All DBs
Position
Unknown
AA seq
1312 aa
AA seq
DB search
MGVPEQEESVQEQVHPNLSANEESLYYIEELIFQLLNKLCMAQPRTVQDVEERVQKTFPH
PIDKWAIADAQSAIEKRKRRNPLLLPVDKIHPSLKEVLGYKVDYHVSLYIVAVLEYISAD
ILKLAGNYVFNIRHYEISQQDIKVSMCADKVLMDMFDQDDIGLVSLCEDEPSSSGELNYY
DLVRTEIAEERQYLRELNMIMKVFREAFLSDKKLFKPSDIEKIFSNISDIHELTVKLLGL
IEDTVEMTDESSPHPLAGSCFEDLAEEQAFDPYETLSQDILSPKFNEHFSKLMARPAVAQ
YFQSIADGFREAVRYVLPRLMLVPVYHCWHYFELLKQLKACSEEQEDRECLNQAITALMN
LQGSMDRIYKQYSPRRRPGDPVCPFYNRQLRSKHLAIKKMNEIQKNIDGWEGKDIGQCCN
EFIMEGPLTRIGAKHERHIFLFDGLMISCKPNHSQSRLPGYSSAEYRLKEKFIMRKTQIC
DKEDSCEYKHAFELISKDENSIIFAAKSAEEKTNWMAALISLHYRSTLDRMLDSVLLKEE
NEQPLRLPSPEVYRFVVKDSEENIVFEDNLQSRSGIPIIKGGTVVKLIERLTYHMYADPN
FVRTFLTTYRSFCKPQELLSLLIERFEIPEPEPTEADKLAAEKGEQPISADLKRFRKEYV
QPVQLRILNVFRHWVEHHFYDFERDLELLEKLESFISSVRGKAMKKWVESIAKIIKRKKQ
AQANGISHNITFESPPPPIEWHISKPGQFETFDLMTLHPIEIARQLTLLESDLYRKVQPS
ELVGSVWTKEDKEINSPNLLKMIRHTTNLTLWFEKCIVEAENFEERVAVLSRIIEILQVF
QDLNNFNGVLEIVSAVNSVSVYRLDHTFEALQERKRRILDEAVELSQDHFKKYLAKLKSI
NPPCVPFFGIYLTNILKTEEGNNDFLKKKGKDLINFSKRRKVAEITGEIQQYQNQPYCLR
IEPEIRRFFENLNPMGSASEKEFTDYLFNKSLEIEPRNCKQPPRFPRKSTFSLKSPGIRP
NTGRHGSTSGTLRGHPTPLEREPCKISFSRIAETELESTVSAPTSPNTPSTPPVSASSDL
SVFLDVDLNSSCGSNSIFAPVLLPHSKSFFSSCGSLHKLSEEPLIPPPLPPRKKFDHDAS
NSKGNMKSDDDPPAIPPRQPPPPKVKPRVPAPTGAFDGPLHSPPPPPPRDPLPDSPPPVP
LRPPEHFINCPFNLQPPPLGHLHRDPDWFRDISTCPNSPNTPPSTPSPRVPRRCSVLSSS
HSNVVHPQAPPVPPRQNSSPHLPKLPPKTYKRELSHTPLYRLPLLENAETPQ
NT seq
3939 nt
NT seq
+upstream
nt +downstream
nt
atgggagtgccagaacaagaggagagtgttcaggaacaagtacatcccaatctctcagct
aatgaagaatctctctattatatcgaagagctgatttttcagctacttaataaattatgc
atggcccaaccaaggactgttcaagatgtggaggaacgagtccaaaagacctttcctcat
ccaattgataaatgggctatagctgatgcacagtcggccatagagaaacgaaaacgaaga
aatcctctcttactgcctgtggacaaaatccatccttcactgaaggaagttttagggtac
aaagtggactaccatgtgtccttatatattgtggctgtactagagtatatctcagctgat
attttgaaattagctggtaattatgtttttaatattcgacattatgaaatatctcagcag
gacattaaagtgtcaatgtgtgcagataaggttttgatggatatgtttgatcaggatgac
ataggtttggtttctctctgtgaagatgaacctagttcttcaggtgaattaaactattac
gaccttgtcagaactgaaattgcagaagaaagacagtatctgcgggaactaaatatgatc
atgaaagtgtttcgagaagcctttctttctgacaaaaagctgtttaaaccttctgacatt
gaaaagattttcagtaatatttcagatatacatgagttgacagtaaaacttttagggttg
attgaagacacagttgaaatgactgatgaaagtagtcctcatcccttagctggcagctgt
tttgaagatttggcagaagagcaagcatttgatccttacgaaacattatcacaggacatt
ctttcaccaaaatttaatgaacatttcagtaagttgatggccagacctgcagtggctcaa
tactttcagtccattgctgatggttttagagaggcagttcgctatgtccttccacgcctt
atgctggtgccagtgtatcactgttggcactattttgaattgttaaagcaattgaaagca
tgtagtgaagaacaagaagacagagaatgtttgaatcaagcaattactgctctcatgaat
ctccaaggtagtatggaccgaatttacaagcaatattcacctcgacgccgacctggggat
cctgtttgccctttttataaccgtcaattaagaagcaagcacttggctattaaaaaaatg
aatgaaattcagaaaaacatagatggatgggaaggcaaagatattggacagtgttgtaat
gaatttattatggaaggtccattgacaagaattggtgctaaacacgaacggcatattttc
ctctttgatggcttaatgattagctgcaaacccaatcatagccagtcacgccttccaggt
tatagtagtgcagaatacagattaaaagaaaaatttatcatgagaaaaacacaaatatgt
gataaagaagatagttgtgagtacaaacatgcatttgaattgatatccaaagatgaaaac
agcataatatttgctgctaagtctgcagaagagaaaactaattggatggcagcacttatt
tctcttcattatcgtagcactctggatcgaatgctagattcagtattattgaaagaagaa
aatgagcaaccactgagattaccaagtcctgaagtgtatcgttttgtggtaaaagactct
gaggaaaacattgtttttgaagacaacttgcaaagtagaagtggaatccctattattaaa
ggaggaacggtggtgaagttaattgaaaggttgacatatcatatgtatgcagatcccaat
tttgttcgtactttccttactacatatcgttcattttgtaaaccacaggaactgctaagc
ttactgattgaacgatttgaaattccagagccagaacctactgaagcagataaattggca
gcagagaaaggcgaacaaccaattagtgcagaccttaaaagatttcgcaaggaatatgtc
caaccagtacaacttaggatcctaaatgtatttcggcactgggttgaacaccatttctat
gactttgaaagagatttggagttgcttgaaaagctagaatccttcatttcaagtgtaaga
gggaaagccatgaagaaatgggtagagtcaattgctaagatcatcaagagaaagaaacaa
gctcaggcaaatggaataagccataatattacctttgaaagtccacctccaccaattgaa
tggcatatcagcaaaccaggacagtttgaaacatttgatctcatgacacttcatccaata
gaaattgcacgtcagctgacacttttggaatctgatctctacaggaaagttcaaccttct
gaacttgtagggagtgtatggaccaaagaagataaagaaataaattctccaaatttatta
aaaatgattcgtcatactacaaatcttacactctggtttgaaaagtgcattgtggaagca
gaaaattttgaggaacgggtggcagtactaagtagaattatagaaattctgcaagttttt
caagatttgaataatttcaatggtgtattggagatagtcagtgcagtaaattcagtgtca
gtatacagactggatcatacctttgaggcattgcaggaaagaaaaaggagaattttggat
gaagctgtggaattaagtcaggatcactttaaaaaatacctagcaaaacttaagtcaatc
aatccaccttgtgtgcctttttttggaatatatttaacaaatattctgaagactgaagaa
gggaataatgattttttaaaaaagaaagggaaggatttaatcaatttcagtaagaggagg
aaagtggctgaaattactggagaaattcagcagtatcagaatcaaccttactgtttacgg
atagaaccagaaataaggaggttctttgaaaaccttaaccccatgggaagtgcttctgaa
aaagagtttacagattatttgttcaacaagtcactagaaattgaaccccgaaactgcaaa
cagccacctcgatttcccaggaaatcaactttctccttaaaatctcctggaataaggcct
aatacaggccgacatggctctacctcaggcactttaagaggtcatccaacaccattagaa
agagaaccatgtaaaataagctttagtcggattgctgaaactgagcttgaatcaacagtg
tcggcaccaacctctccaaatacaccatctactccaccagtatctgcttcttcagacctt
agcgtgtttttagatgtggatctcaacagttcttgtggcagcaatagcatctttgctcca
gttctgttgccacattcaaaatctttcttcagttcgtgtggtagtttacataaactaagt
gaagagccactgattcctcctccacttcctcctcgaaaaaagtttgatcatgatgcttca
aattccaagggaaatatgaaatctgatgatgacccccctgctattccaccaagacaacct
cctcctccaaaggtaaaacccagagttcctgctcctactggtgcatttgatgggccactg
catagtccacctccaccgccaccaagagatcctcttcctgatagccctccaccggttccc
cttcggcctccggaacactttataaactgtccatttaatcttcagccgcctccactggga
catcttcacagagatccagactggttcagagacattagcacttgtccaaattcgccaaac
actcctcctagcacaccctctccgagggtaccacgtcgatgctctgtgctcagttctagt
cacagtaatgttgttcatccccaagctccccctgttccaccaaggcagaattcaagccct
cacctaccaaaactgccaccaaagacttacaaacgggagctttcacacaccccattgtat
agactgcctttgctagaaaatgcagaaactcctcaatga
DBGET
integrated database retrieval system