Callithrix jacchus (white-tufted-ear marmoset): 100410124
Help
Entry
100410124 CDS
T03264
Gene name
SOS2
Definition
(RefSeq) SOS Ras/Rho guanine nucleotide exchange factor 2
KO
K03099
son of sevenless
Organism
cjc
Callithrix jacchus (white-tufted-ear marmoset)
Pathway
cjc01521
EGFR tyrosine kinase inhibitor resistance
cjc01522
Endocrine resistance
cjc04010
MAPK signaling pathway
cjc04012
ErbB signaling pathway
cjc04014
Ras signaling pathway
cjc04062
Chemokine signaling pathway
cjc04068
FoxO signaling pathway
cjc04072
Phospholipase D signaling pathway
cjc04150
mTOR signaling pathway
cjc04151
PI3K-Akt signaling pathway
cjc04510
Focal adhesion
cjc04540
Gap junction
cjc04630
JAK-STAT signaling pathway
cjc04650
Natural killer cell mediated cytotoxicity
cjc04660
T cell receptor signaling pathway
cjc04662
B cell receptor signaling pathway
cjc04664
Fc epsilon RI signaling pathway
cjc04714
Thermogenesis
cjc04722
Neurotrophin signaling pathway
cjc04810
Regulation of actin cytoskeleton
cjc04910
Insulin signaling pathway
cjc04912
GnRH signaling pathway
cjc04915
Estrogen signaling pathway
cjc04917
Prolactin signaling pathway
cjc04926
Relaxin signaling pathway
cjc04935
Growth hormone synthesis, secretion and action
cjc05034
Alcoholism
cjc05160
Hepatitis C
cjc05161
Hepatitis B
cjc05163
Human cytomegalovirus infection
cjc05165
Human papillomavirus infection
cjc05200
Pathways in cancer
cjc05205
Proteoglycans in cancer
cjc05206
MicroRNAs in cancer
cjc05210
Colorectal cancer
cjc05211
Renal cell carcinoma
cjc05213
Endometrial cancer
cjc05214
Glioma
cjc05215
Prostate cancer
cjc05220
Chronic myeloid leukemia
cjc05221
Acute myeloid leukemia
cjc05223
Non-small cell lung cancer
cjc05224
Breast cancer
cjc05225
Hepatocellular carcinoma
cjc05226
Gastric cancer
cjc05231
Choline metabolism in cancer
Brite
KEGG Orthology (KO) [BR:
cjc00001
]
09130 Environmental Information Processing
09132 Signal transduction
04010 MAPK signaling pathway
100410124 (SOS2)
04012 ErbB signaling pathway
100410124 (SOS2)
04014 Ras signaling pathway
100410124 (SOS2)
04630 JAK-STAT signaling pathway
100410124 (SOS2)
04068 FoxO signaling pathway
100410124 (SOS2)
04072 Phospholipase D signaling pathway
100410124 (SOS2)
04151 PI3K-Akt signaling pathway
100410124 (SOS2)
04150 mTOR signaling pathway
100410124 (SOS2)
09140 Cellular Processes
09144 Cellular community - eukaryotes
04510 Focal adhesion
100410124 (SOS2)
04540 Gap junction
100410124 (SOS2)
09142 Cell motility
04810 Regulation of actin cytoskeleton
100410124 (SOS2)
09150 Organismal Systems
09151 Immune system
04650 Natural killer cell mediated cytotoxicity
100410124 (SOS2)
04660 T cell receptor signaling pathway
100410124 (SOS2)
04662 B cell receptor signaling pathway
100410124 (SOS2)
04664 Fc epsilon RI signaling pathway
100410124 (SOS2)
04062 Chemokine signaling pathway
100410124 (SOS2)
09152 Endocrine system
04910 Insulin signaling pathway
100410124 (SOS2)
04912 GnRH signaling pathway
100410124 (SOS2)
04915 Estrogen signaling pathway
100410124 (SOS2)
04917 Prolactin signaling pathway
100410124 (SOS2)
04926 Relaxin signaling pathway
100410124 (SOS2)
04935 Growth hormone synthesis, secretion and action
100410124 (SOS2)
09156 Nervous system
04722 Neurotrophin signaling pathway
100410124 (SOS2)
09159 Environmental adaptation
04714 Thermogenesis
100410124 (SOS2)
09160 Human Diseases
09161 Cancer: overview
05200 Pathways in cancer
100410124 (SOS2)
05206 MicroRNAs in cancer
100410124 (SOS2)
05205 Proteoglycans in cancer
100410124 (SOS2)
05231 Choline metabolism in cancer
100410124 (SOS2)
09162 Cancer: specific types
05210 Colorectal cancer
100410124 (SOS2)
05225 Hepatocellular carcinoma
100410124 (SOS2)
05226 Gastric cancer
100410124 (SOS2)
05214 Glioma
100410124 (SOS2)
05221 Acute myeloid leukemia
100410124 (SOS2)
05220 Chronic myeloid leukemia
100410124 (SOS2)
05211 Renal cell carcinoma
100410124 (SOS2)
05215 Prostate cancer
100410124 (SOS2)
05213 Endometrial cancer
100410124 (SOS2)
05224 Breast cancer
100410124 (SOS2)
05223 Non-small cell lung cancer
100410124 (SOS2)
09165 Substance dependence
05034 Alcoholism
100410124 (SOS2)
09172 Infectious disease: viral
05161 Hepatitis B
100410124 (SOS2)
05160 Hepatitis C
100410124 (SOS2)
05163 Human cytomegalovirus infection
100410124 (SOS2)
05165 Human papillomavirus infection
100410124 (SOS2)
09176 Drug resistance: antineoplastic
01521 EGFR tyrosine kinase inhibitor resistance
100410124 (SOS2)
01522 Endocrine resistance
100410124 (SOS2)
09180 Brite Hierarchies
09183 Protein families: signaling and cellular processes
04990 Domain-containing proteins not elsewhere classified [BR:
cjc04990
]
100410124 (SOS2)
Domain-containing proteins not elsewhere classified [BR:
cjc04990
]
Pleckstrin homology (PH) domain-containing proteins
Dbl-Like RhoGEF family proteins
100410124 (SOS2)
BRITE hierarchy
SSDB
Ortholog
Paralog
GFIT
Motif
Pfam:
RasGEF
RasGEF_N
RhoGEF
PH
Histone
IQ_SEC7_PH
PH_19
PH_13
Motif
Other DBs
NCBI-GeneID:
100410124
NCBI-ProteinID:
XP_009004262
Ensembl:
ENSCJAG00000006256
LinkDB
All DBs
Position
10
AA seq
1272 aa
AA seq
DB search
MAQPRTVQDVEERVQKTFPHPIDKWAIADAQSAIEKRKRRNPLLLPVDKIHPSLKEVLGY
KVDYHVSLYIVAVLEYISADILKLAGNYVFNIRHYEISQQDIKVSMCADKVLMDMFDQDD
IGLVSLCEDEPTSSGELNYYDLVRTEIAEERQYLRELNMIIKVFREAFLSDRKLFKPSDI
EKIFSNISDIHELTVKLLGLIEDTVEMTDESSPHPLAGSCFEDLAEEQAFDPYETLSQDI
LSPEFSEHFSKLMARPAVALHFQSIADGFKEAVRYVLPRLMLVPVYHCWHYFELLKQLKA
CSEEQEDRECLNQAITALLNLQGSMDRIYKQYSPRRRPGDPVCPFYGHQLRSKHLAIKKM
NEIQKNIDGWEGKDIGQCCNEFIMEGPLTRIGAKHERHIFLFDGLMISCKPNHGQTRLPG
YSSAEYRLKEKFVMRKIQICDKEDTCEYKHAFELVSKDENSIIFAAKSAEEKNNWMAALI
SLHYRSTLDRMLDSVLLKEENEQPLRLPSPEVYRFVVKDSEENIVFEDSLQSRSGIPIIK
GGTVVKLIERLTYHMYADPNFVRTFLTTYRSFCKPQELLSLLIERFEIPEPEPTDADKLA
IEKGEQPISADLKRFRKEYVQPVQLRILNVFRHWVEHHFYDFERDLELLERLESFISSVR
GKAMKKWVESIAKIIKRKKQAQANGISHNITFESPPPPIEWHISRPGQFETFDLMTLHPI
EIARQLTLLESDLYRKVQPSELVGSVWTKEDKEINSPNLLKMIRHTTNLTLWFEKCIVEA
ENFEERVAILSRIIEILQVFQDLNNFNGVLEIVSAVNSVSVYRLDHTFEALQERKRKILD
EAVELSQDHFKKYLVKLKSINPPCVPFFGIYLTNILKTEEGNNDFLKKKGKDLINFSKRR
KVAEITGEIQQYQNQPYCLRIEPDMRRFFENLNPMGSASEKEFTDYLFNKSLEIEPRNCK
QPPRFPRKSTFSLKSPGIRPNTGRHGSTSGTLRGHPTPLEREPCKISFSRIAETELESTV
SAPTSPNTPSTPPVSASSDVSVFLDVDLNSSCGSNSIFAPVLLPHSKSFFSSCGSLHKLS
EEPLIPPPLPPRKKFDHDASNSKGNMKSDDDPPAIPPRQPPPPKVKPRVPVPTGAFDGPL
HSPPPPPPRDPLPDTPPPVPLRPPEHFINCPVNLQPPPLGHLHRDSDWLRDISTCPNSPS
TPPSTPSPRVPRRCYVLSSSQNNLVHPPAPPVPPRQNSSPHLPKLPPKTYKRELSHPPLY
RLPLLENAETPQ
NT seq
3819 nt
NT seq
+upstream
nt +downstream
nt
atggcccagccaaggactgttcaagatgtggaggaacgagttcagaagacctttcctcac
ccaattgataagtgggccattgctgatgcacaatctgccatagaaaaacgaaaacgaaga
aatcctcttttattgcctgtggacaaaatccatccttcattgaaggaggttttagggtac
aaagtggactaccatgtgtccctgtatattgtggctgtactagagtatatctcagctgat
attttaaaattggctggtaattatgttttcaatatccggcattatgaaatatctcagcag
gacattaaagtgtcaatgtgtgcagataaggttttgatggacatgtttgatcaagatgac
ataggtttggtttctctctgtgaagatgaacctacttcttctggtgaattaaactactat
gatctcgtcagaactgaaattgcggaagaaagacagtatctacgggaattaaatatgatc
ataaaagtgtttcgagaagcctttctttctgatagaaagctgtttaaaccttctgatatt
gaaaagatttttagtaacatttcagatatacatgaattgactgtgaaacttttaggtttg
attgaagacacagttgaaatgactgatgaaagcagtcctcatcccttagctggcagctgt
tttgaagatttggcagaagagcaagcatttgatccttatgaaacattgtcacaggatatt
ctttcgccagagtttagtgaacatttcagtaaactgatggccagacctgcagttgctcta
cacttccagtccatagccgatggttttaaagaagcagttcgttatgtccttccgcgtctt
atgctggtgccagtgtatcactgttggcactattttgaattactaaagcaattgaaagca
tgtagtgaagaacaagaagacagagaatgtttgaaccaagctattaccgctctcctgaat
ctccaaggtagcatggaccgaatttacaagcagtattcacctagacgccgacctggagat
cctgtttgccctttttatggtcaccaattaagaagcaaacacctggctatcaaaaaaatg
aatgaaattcagaaaaatattgatggatgggaaggcaaagatattggacagtgttgtaat
gaattcattatggagggaccattgacaagaatcggtgctaaacatgaacgtcatattttt
ctatttgatggcttaatgatcagttgtaagcctaatcatggccagactcggcttccaggt
tatagtagtgcagaatacaggttaaaagaaaaatttgtcatgaggaaaatacagatttgt
gacaaagaagatacttgtgagtacaagcatgcttttgagttagtatccaaagatgagaac
agcataatatttgccgcaaagtctgctgaagaaaaaaataattggatggcagcccttatt
tctcttcattatcgtagtactctagatcgaatgttagattcagtattattgaaagaagaa
aatgaacaaccactgagattaccaagtcctgaagtctatcgttttgtagtaaaagactct
gaggaaaacattgtttttgaagacagtttgcaaagtaggagtggaatccccattattaaa
ggaggaactgtggtgaaattaattgaaaggctgacatatcatatgtatgcagatcccaat
tttgttcgtacttttcttaccacatatcgttcattttgtaaaccacaggaattgctgagc
ttactgattgaacgctttgaaattccagagccagaacctactgatgcagataaattggca
atagagaaaggcgagcagccaatcagtgcagaccttaaaagatttcgcaaggaatacgtc
caaccagtacaacttaggatcttaaatgtatttcggcattgggttgaacatcatttttat
gactttgaaagagatttggaattacttgaaagactagaatccttcatttcaagtgtaaga
gggaaagctatgaagaaatgggtagagtcaattgctaagatcatcaagaggaagaagcaa
gctcaggcaaatggaataagccataatattacctttgaaagtccacctccaccaattgaa
tggcatatcagcagaccaggacagtttgaaacatttgatctcatgacacttcatccaata
gaaattgcacgtcagcttacacttttggagtctgatctctacaggaaagttcaaccttct
gaacttgtagggagtgtgtggaccaaagaagataaagaaataaattctccaaatttatta
aaaatgattcgccataccacaaatctcaccctctggtttgaaaaatgcattgtggaagca
gaaaattttgaagaacgggtggcaatactaagtagaattatagaaattctgcaagttttt
caagatttgaataatttcaatggcgtattggagatagtcagtgcagtaaattcagtgtca
gtatacagactagaccatacctttgaggcattgcaggaaagaaaacggaaaattttggat
gaagccgtggaattaagtcaagatcactttaaaaaatatctggtaaaacttaagtcaatc
aatccaccttgtgtgcctttttttggaatatatttaacaaatattctgaagaccgaagaa
gggaataatgattttttaaaaaagaaagggaaagatttaatcaatttcagtaagaggaga
aaagtagctgaaattactggagaaattcagcagtatcagaatcagccttattgtttacgg
atagaaccagatatgaggagattctttgaaaaccttaaccccatgggaagtgcatctgaa
aaagagtttacagattatttgttcaacaagtcactagaaattgaacctcgaaactgcaaa
cagccacctcgatttcctaggaaatcaactttctccttaaaatctcctggaataaggcct
aacacaggccgacatggctctacctcaggtactttacgaggtcatccaacaccattagaa
agagaaccatgtaaaataagctttagtcggattgctgaaactgagctggaatcaacagtg
tcagcaccaacctctcctaatacaccatctactccaccagtatctgcttcttcagacgtt
agtgtatttttagatgtggatctcaacagctcctgtggcagcaatagcatctttgctcca
gtgcttttgccacattcaaagtctttctttagttcatgtggtagtttacataaactaagt
gaagagccactaattcctcctccacttcctcctcgaaaaaagtttgatcatgatgcttca
aattccaagggaaatatgaaatctgatgatgatcctcctgctattccaccaagacagcct
cctcctccaaaggtaaaacccagagttcctgttcctactggtgcatttgatgggcctcta
catagtccacctccaccaccaccaagagatcctcttcctgatacccctccaccggttccc
cttcggcctccagaacactttataaactgtccagttaatcttcagccacctccactgggg
catcttcacagagattcagactggctgagagacattagtacgtgtccaaattcaccaagc
actcctcctagcacaccctctccaagggtaccacgtcgatgctatgtgctcagttctagt
cagaataatcttgttcatcctccagctccccctgttccaccaaggcagaattcaagccct
catctgccaaaactgccaccaaagacttacaaacgggagctttcgcaccctccattgtac
agactgcctttgctagaaaatgcagaaactccccaatga
Callithrix jacchus (white-tufted-ear marmoset): 100414931
Help
Entry
100414931 CDS
T03264
Gene name
SOS1
Definition
(RefSeq) SOS Ras/Rac guanine nucleotide exchange factor 1
KO
K03099
son of sevenless
Organism
cjc
Callithrix jacchus (white-tufted-ear marmoset)
Pathway
cjc01521
EGFR tyrosine kinase inhibitor resistance
cjc01522
Endocrine resistance
cjc04010
MAPK signaling pathway
cjc04012
ErbB signaling pathway
cjc04014
Ras signaling pathway
cjc04062
Chemokine signaling pathway
cjc04068
FoxO signaling pathway
cjc04072
Phospholipase D signaling pathway
cjc04150
mTOR signaling pathway
cjc04151
PI3K-Akt signaling pathway
cjc04510
Focal adhesion
cjc04540
Gap junction
cjc04630
JAK-STAT signaling pathway
cjc04650
Natural killer cell mediated cytotoxicity
cjc04660
T cell receptor signaling pathway
cjc04662
B cell receptor signaling pathway
cjc04664
Fc epsilon RI signaling pathway
cjc04714
Thermogenesis
cjc04722
Neurotrophin signaling pathway
cjc04810
Regulation of actin cytoskeleton
cjc04910
Insulin signaling pathway
cjc04912
GnRH signaling pathway
cjc04915
Estrogen signaling pathway
cjc04917
Prolactin signaling pathway
cjc04926
Relaxin signaling pathway
cjc04935
Growth hormone synthesis, secretion and action
cjc05034
Alcoholism
cjc05160
Hepatitis C
cjc05161
Hepatitis B
cjc05163
Human cytomegalovirus infection
cjc05165
Human papillomavirus infection
cjc05200
Pathways in cancer
cjc05205
Proteoglycans in cancer
cjc05206
MicroRNAs in cancer
cjc05210
Colorectal cancer
cjc05211
Renal cell carcinoma
cjc05213
Endometrial cancer
cjc05214
Glioma
cjc05215
Prostate cancer
cjc05220
Chronic myeloid leukemia
cjc05221
Acute myeloid leukemia
cjc05223
Non-small cell lung cancer
cjc05224
Breast cancer
cjc05225
Hepatocellular carcinoma
cjc05226
Gastric cancer
cjc05231
Choline metabolism in cancer
Brite
KEGG Orthology (KO) [BR:
cjc00001
]
09130 Environmental Information Processing
09132 Signal transduction
04010 MAPK signaling pathway
100414931 (SOS1)
04012 ErbB signaling pathway
100414931 (SOS1)
04014 Ras signaling pathway
100414931 (SOS1)
04630 JAK-STAT signaling pathway
100414931 (SOS1)
04068 FoxO signaling pathway
100414931 (SOS1)
04072 Phospholipase D signaling pathway
100414931 (SOS1)
04151 PI3K-Akt signaling pathway
100414931 (SOS1)
04150 mTOR signaling pathway
100414931 (SOS1)
09140 Cellular Processes
09144 Cellular community - eukaryotes
04510 Focal adhesion
100414931 (SOS1)
04540 Gap junction
100414931 (SOS1)
09142 Cell motility
04810 Regulation of actin cytoskeleton
100414931 (SOS1)
09150 Organismal Systems
09151 Immune system
04650 Natural killer cell mediated cytotoxicity
100414931 (SOS1)
04660 T cell receptor signaling pathway
100414931 (SOS1)
04662 B cell receptor signaling pathway
100414931 (SOS1)
04664 Fc epsilon RI signaling pathway
100414931 (SOS1)
04062 Chemokine signaling pathway
100414931 (SOS1)
09152 Endocrine system
04910 Insulin signaling pathway
100414931 (SOS1)
04912 GnRH signaling pathway
100414931 (SOS1)
04915 Estrogen signaling pathway
100414931 (SOS1)
04917 Prolactin signaling pathway
100414931 (SOS1)
04926 Relaxin signaling pathway
100414931 (SOS1)
04935 Growth hormone synthesis, secretion and action
100414931 (SOS1)
09156 Nervous system
04722 Neurotrophin signaling pathway
100414931 (SOS1)
09159 Environmental adaptation
04714 Thermogenesis
100414931 (SOS1)
09160 Human Diseases
09161 Cancer: overview
05200 Pathways in cancer
100414931 (SOS1)
05206 MicroRNAs in cancer
100414931 (SOS1)
05205 Proteoglycans in cancer
100414931 (SOS1)
05231 Choline metabolism in cancer
100414931 (SOS1)
09162 Cancer: specific types
05210 Colorectal cancer
100414931 (SOS1)
05225 Hepatocellular carcinoma
100414931 (SOS1)
05226 Gastric cancer
100414931 (SOS1)
05214 Glioma
100414931 (SOS1)
05221 Acute myeloid leukemia
100414931 (SOS1)
05220 Chronic myeloid leukemia
100414931 (SOS1)
05211 Renal cell carcinoma
100414931 (SOS1)
05215 Prostate cancer
100414931 (SOS1)
05213 Endometrial cancer
100414931 (SOS1)
05224 Breast cancer
100414931 (SOS1)
05223 Non-small cell lung cancer
100414931 (SOS1)
09165 Substance dependence
05034 Alcoholism
100414931 (SOS1)
09172 Infectious disease: viral
05161 Hepatitis B
100414931 (SOS1)
05160 Hepatitis C
100414931 (SOS1)
05163 Human cytomegalovirus infection
100414931 (SOS1)
05165 Human papillomavirus infection
100414931 (SOS1)
09176 Drug resistance: antineoplastic
01521 EGFR tyrosine kinase inhibitor resistance
100414931 (SOS1)
01522 Endocrine resistance
100414931 (SOS1)
09180 Brite Hierarchies
09183 Protein families: signaling and cellular processes
04990 Domain-containing proteins not elsewhere classified [BR:
cjc04990
]
100414931 (SOS1)
Domain-containing proteins not elsewhere classified [BR:
cjc04990
]
Pleckstrin homology (PH) domain-containing proteins
Dbl-Like RhoGEF family proteins
100414931 (SOS1)
BRITE hierarchy
SSDB
Ortholog
Paralog
GFIT
Motif
Pfam:
RasGEF
RasGEF_N
RhoGEF
Histone
PH
PH_19
PH_10
IQ_SEC7_PH
PH_13
Motif
Other DBs
NCBI-GeneID:
100414931
NCBI-ProteinID:
XP_008979072
Ensembl:
ENSCJAG00000015029
LinkDB
All DBs
Position
14
AA seq
1333 aa
AA seq
DB search
MQAQQLPYEFFSEENAPKWRGLLVPALKKVQGQVHPTLESNDDALQYVEELILQLLNMLC
QAQPRSASDVEERVQKSFPHPIDKWAIADAQSAIEKRKRRNPLSLPVEKIHPLLKEVLGY
KIDHQVSVYIVAVLEYISADILKLVGNYVRNIRHYEITKQDIKVAMCADKVLMDMFHQDV
EDINILSLTDEEPSTSGEQTYYDLVKAFMAEIRQYIRELNLIIKVFREPFVSNSKLFSAN
DVENIFSRIVDIHELSVKLLGHIEDTVEMTDEGSPHPLVGSCFEDLAEELAFDPYESYAR
DILRPGFHDRFLSQLSKPGAALYLQSIGEGFKEAVQYVLPRLLLAPVYHCLHYFELLKQL
EEKSEDQEDKECLKQAITALLNVQSGMEKICSKSLAKRRLSESACRFYSQQMKGKQLAIK
KMNEIQKNIDGWEGKDIGQCCNEFIMEGTLTRVGAKHERHIFLFDGLMICCKSNHGQPRL
PGASNAEYRLKEKFFMRKVQINDKDDTNEYKHAFEIILKDENSVIFSAKSAEEKNNWMAA
LISLQYRSTLERMLDVTMLQEEKEEQMRLPSADVYRFAEPDSEENIIFEENMQPKAGIPI
IKAGTVIKLIERLTYHMYADPNFVRTFLTTYRSFCKPQELLSLIIERFEIPEPEPTEADR
IAIENGDQPLSAELKRFRKEYIQPVQLRVLNVCRHWVEHHFYDFERDADLLQRMEEFIGT
VRGKAMKKWVESITKIIQRKKIARDNGPGHNITFQSSPPTVEWHISRPGHIETFDLLTLH
PIEIARQLTLLESDLYRAVQPSELVGSVWTKEDKEINSPNLLKMIRHTTNLTLWFEKCIV
ETENLEERVAVVSRIIEILQVFQELNNFNGVLEVVSAMNSSPVYRLDHTFEQIPSRQKKI
LEEAHELSEDHYKKYLAKLRSINPPCVPFFGIYLTNILKTEEGNPEVLKRHGKELINFSK
RRKVAEITGEIQQYQNQPYCLRVESDIKRFFENLNPMGNSMEKEFTDYLFNKSLEIEPRN
PKPLPRFPKKYSYPLKSPGVRPSNPRPGTMRHPTPLQQEPRKISYSRIPESETESTASAP
NSPRTPLTPPPASGASSTTDVCSVFDSDHSSPFHSSNDTVFIQVTLPYGPRSASVSSISL
TKGTDEMPVPPPVPPRRRPESAPAESSPSKIMSKHLDSPPAIPPRQPTSKAYSPRYSISD
RTSISDPPDSPPLLPPREPVRTPDVFSSSPLHLQPPPLGKKSDHGNAFFPNSPSPFTPPP
PQTPSPHGTRRHLPSPPLTQEVDLHSIAGPPVPPRQSTSQHIPKLPPKTYKREHTHPSMH
RDGPPLLENAHSS
NT seq
4002 nt
NT seq
+upstream
nt +downstream
nt
atgcaggcgcagcagctgccctacgagtttttcagcgaagagaacgcgcccaagtggcga
ggactgctggtgcctgccctgaaaaaggtccaggggcaagttcatcctactcttgagtct
aatgatgatgctcttcagtatgttgaagaattaattttgcagttattaaatatgctatgc
caagctcagccccgaagtgcttcagatgtagaggaacgtgttcaaaaaagtttccctcat
ccaattgataagtgggcaatagctgatgcccaatcagctattgaaaagaggaagcgaaga
aaccctttatctctcccagtagaaaaaattcatcctttattaaaggaggtcctaggttat
aaaattgaccaccaggtttctgtttacatagtagcagtcttagaatacatttctgcagac
attttaaagctggttgggaattatgtaagaaatatacggcattatgaaattacaaaacaa
gatattaaagtggcaatgtgtgctgacaaggtattgatggatatgtttcatcaagatgta
gaagatattaatatattatctttaactgatgaagaaccttccacttcaggagaacaaact
tactatgatttggtaaaagcatttatggcagaaattcgacaatatataagggaactaaat
ctaattataaaagtttttagagagccttttgtctccaattcaaaattgttttcagctaat
gatgtagaaaatatatttagtcgtatagtagatatacatgaacttagtgtaaagttactg
ggccatatagaagatacagtagaaatgacagatgaagggagtccccatccactagtagga
agctgctttgaagacttagcagaggaactggcatttgatccatatgaatcgtatgctcga
gatattttgcggcctggttttcatgatcgtttccttagtcagttatcgaagcctggggca
gcactttatttgcagtcaataggtgaaggtttcaaagaagctgttcaatatgttttaccc
aggctgcttctggctcctgtttaccactgtctccattattttgaacttttgaagcagtta
gaagaaaaaagtgaagatcaagaagacaaggaatgtttaaaacaagcaataacagctttg
cttaatgttcagagtggtatggaaaaaatttgttctaaaagtcttgcaaaacgaagactg
agtgaatctgcatgtcggttttatagtcaacaaatgaaggggaaacaactagcaatcaag
aaaatgaacgagattcagaagaatattgatggttgggagggaaaagacattggacagtgt
tgcaatgaatttataatggaaggaactcttacacgtgtaggagccaaacatgagagacac
atatttctctttgatggcttaatgatttgctgtaaatcaaatcatgggcagccaagactt
cctggtgctagcaatgcagaatatcgtcttaaagaaaagttttttatgcgaaaggtacaa
attaatgataaagatgacaccaatgaatacaagcatgcttttgaaataattttaaaagat
gaaaatagtgttatattttctgccaagtcagctgaagaaaaaaacaactggatggcagca
ttgatatctttacagtaccggagtacattggaaaggatgcttgatgtaacaatgctacag
gaagagaaagaggagcagatgaggctgcctagtgctgatgtttatagatttgcagagcct
gactctgaagagaatattatatttgaagagaacatgcagcccaaggctggaattccaatt
atcaaagcaggaactgttattaaacttatagagaggcttacataccatatgtatgcagat
cccaattttgttcggacatttcttacaacatacagatccttttgcaaacctcaagaacta
ctgagtcttataatagaaaggtttgaaattccagagcctgagccaacagaagctgatcgc
atagctatagagaatggagatcaacctttgagtgcagaattaaaaagatttagaaaagaa
tatatacagcctgtgcagcttcgagtattaaatgtatgtcggcactgggtagagcaccac
ttctatgattttgaaagagatgcagatcttttacaacgaatggaagaatttattggaaca
gtaagaggtaaagcaatgaaaaaatgggttgaatccatcactaaaataatccaaaggaaa
aaaattgcaagagacaatggaccaggtcataatattacatttcagagttcacctcccaca
gttgagtggcatataagcagacctgggcacatagagacttttgacctgctcaccttacac
ccaatagaaattgctcgacaactcactttacttgaatcagatctataccgagctgtacag
ccatcagaattagttggaagtgtgtggacaaaagaagacaaagaaattaactctcctaat
cttctgaagatgattcgacataccaccaacctcactctatggtttgagaaatgtattgta
gaaactgaaaatttggaagaaagagtagctgtggtgagtcgaataattgagattctacaa
gtctttcaagagctgaacaactttaatggtgtccttgaagttgtcagtgccatgaattca
tcacctgtttacagattagaccacacatttgagcaaataccaagtcgccagaagaaaatt
ttagaagaagctcatgaattgagtgaagatcactataagaaatatttggcaaaactcagg
tctattaatccaccatgtgtgcctttctttggaatttatctcactaatatcttgaaaaca
gaagaaggcaaccctgaggtcctaaaaagacatggaaaagagcttataaactttagcaaa
aggaggaaagtagcagaaataacaggagagatccagcagtaccaaaatcagccttactgt
ttacgagtagaatcagatatcaaaaggttctttgaaaacttgaatccaatgggaaatagc
atggaaaaagaatttacagattatcttttcaacaaatccctagaaatagaaccacgaaac
cctaagcctctcccaagatttccaaaaaaatatagctatcccctaaaatctcctggtgtt
cgtccatcaaacccaagaccaggtaccatgaggcatcccacacctcttcagcaggagcca
aggaaaattagttatagtaggatccctgaaagtgaaacagaaagtacggcatctgcacca
aattctccaagaacaccgttaacacctccgcctgcttctggtgcttccagtaccacagat
gtttgcagtgtatttgattccgatcattcgagcccttttcactcaagcaatgataccgtc
tttatccaagtcacactgccctatggcccaagatctgcttctgtatcatctataagttta
accaaaggcacggatgaaatgcctgtccctcctcctgttcctccacgaagacgaccagaa
tctgccccagcagaatcttcaccatctaagattatgtctaagcatttggacagcccccca
gccattcctcctaggcaacccacatcaaaagcctattcaccacggtattcaatatcagac
cggacttctatatcagatcctcctgatagccctcccttattaccaccacgagaacctgtg
aggacacctgatgttttctcaagctcaccactacatctccaacctccccctttgggcaaa
aaaagtgatcatggcaatgccttcttcccaaatagcccctccccctttacaccacctcct
cctcaaacaccttctcctcacggcacaagaaggcatctgccatcaccaccattgacacaa
gaagtggaccttcattccattgctgggccgcctgttcctccacgacaaagcacttctcaa
cacatccctaaactccctccaaaaacttacaaaagggagcacacacacccatccatgcac
agagatggaccaccactgttggagaatgcccattcttcctga
DBGET
integrated database retrieval system