Sus scrofa (pig): 100156602
Help
Entry
100156602 CDS
T01009
Gene name
SOS2
Definition
(RefSeq) son of sevenless homolog 2
KO
K03099
son of sevenless
Organism
ssc
Sus scrofa (pig)
Pathway
ssc01521
EGFR tyrosine kinase inhibitor resistance
ssc01522
Endocrine resistance
ssc04010
MAPK signaling pathway
ssc04012
ErbB signaling pathway
ssc04014
Ras signaling pathway
ssc04062
Chemokine signaling pathway
ssc04068
FoxO signaling pathway
ssc04072
Phospholipase D signaling pathway
ssc04150
mTOR signaling pathway
ssc04151
PI3K-Akt signaling pathway
ssc04510
Focal adhesion
ssc04540
Gap junction
ssc04630
JAK-STAT signaling pathway
ssc04650
Natural killer cell mediated cytotoxicity
ssc04660
T cell receptor signaling pathway
ssc04662
B cell receptor signaling pathway
ssc04664
Fc epsilon RI signaling pathway
ssc04714
Thermogenesis
ssc04722
Neurotrophin signaling pathway
ssc04810
Regulation of actin cytoskeleton
ssc04910
Insulin signaling pathway
ssc04912
GnRH signaling pathway
ssc04915
Estrogen signaling pathway
ssc04917
Prolactin signaling pathway
ssc04926
Relaxin signaling pathway
ssc04935
Growth hormone synthesis, secretion and action
ssc05034
Alcoholism
ssc05160
Hepatitis C
ssc05161
Hepatitis B
ssc05163
Human cytomegalovirus infection
ssc05165
Human papillomavirus infection
ssc05200
Pathways in cancer
ssc05205
Proteoglycans in cancer
ssc05206
MicroRNAs in cancer
ssc05210
Colorectal cancer
ssc05211
Renal cell carcinoma
ssc05213
Endometrial cancer
ssc05214
Glioma
ssc05215
Prostate cancer
ssc05220
Chronic myeloid leukemia
ssc05221
Acute myeloid leukemia
ssc05223
Non-small cell lung cancer
ssc05224
Breast cancer
ssc05225
Hepatocellular carcinoma
ssc05226
Gastric cancer
ssc05231
Choline metabolism in cancer
Brite
KEGG Orthology (KO) [BR:
ssc00001
]
09130 Environmental Information Processing
09132 Signal transduction
04010 MAPK signaling pathway
100156602 (SOS2)
04012 ErbB signaling pathway
100156602 (SOS2)
04014 Ras signaling pathway
100156602 (SOS2)
04630 JAK-STAT signaling pathway
100156602 (SOS2)
04068 FoxO signaling pathway
100156602 (SOS2)
04072 Phospholipase D signaling pathway
100156602 (SOS2)
04151 PI3K-Akt signaling pathway
100156602 (SOS2)
04150 mTOR signaling pathway
100156602 (SOS2)
09140 Cellular Processes
09144 Cellular community - eukaryotes
04510 Focal adhesion
100156602 (SOS2)
04540 Gap junction
100156602 (SOS2)
09142 Cell motility
04810 Regulation of actin cytoskeleton
100156602 (SOS2)
09150 Organismal Systems
09151 Immune system
04650 Natural killer cell mediated cytotoxicity
100156602 (SOS2)
04660 T cell receptor signaling pathway
100156602 (SOS2)
04662 B cell receptor signaling pathway
100156602 (SOS2)
04664 Fc epsilon RI signaling pathway
100156602 (SOS2)
04062 Chemokine signaling pathway
100156602 (SOS2)
09152 Endocrine system
04910 Insulin signaling pathway
100156602 (SOS2)
04912 GnRH signaling pathway
100156602 (SOS2)
04915 Estrogen signaling pathway
100156602 (SOS2)
04917 Prolactin signaling pathway
100156602 (SOS2)
04926 Relaxin signaling pathway
100156602 (SOS2)
04935 Growth hormone synthesis, secretion and action
100156602 (SOS2)
09156 Nervous system
04722 Neurotrophin signaling pathway
100156602 (SOS2)
09159 Environmental adaptation
04714 Thermogenesis
100156602 (SOS2)
09160 Human Diseases
09161 Cancer: overview
05200 Pathways in cancer
100156602 (SOS2)
05206 MicroRNAs in cancer
100156602 (SOS2)
05205 Proteoglycans in cancer
100156602 (SOS2)
05231 Choline metabolism in cancer
100156602 (SOS2)
09162 Cancer: specific types
05210 Colorectal cancer
100156602 (SOS2)
05225 Hepatocellular carcinoma
100156602 (SOS2)
05226 Gastric cancer
100156602 (SOS2)
05214 Glioma
100156602 (SOS2)
05221 Acute myeloid leukemia
100156602 (SOS2)
05220 Chronic myeloid leukemia
100156602 (SOS2)
05211 Renal cell carcinoma
100156602 (SOS2)
05215 Prostate cancer
100156602 (SOS2)
05213 Endometrial cancer
100156602 (SOS2)
05224 Breast cancer
100156602 (SOS2)
05223 Non-small cell lung cancer
100156602 (SOS2)
09165 Substance dependence
05034 Alcoholism
100156602 (SOS2)
09172 Infectious disease: viral
05161 Hepatitis B
100156602 (SOS2)
05160 Hepatitis C
100156602 (SOS2)
05163 Human cytomegalovirus infection
100156602 (SOS2)
05165 Human papillomavirus infection
100156602 (SOS2)
09176 Drug resistance: antineoplastic
01521 EGFR tyrosine kinase inhibitor resistance
100156602 (SOS2)
01522 Endocrine resistance
100156602 (SOS2)
09180 Brite Hierarchies
09183 Protein families: signaling and cellular processes
04990 Domain-containing proteins not elsewhere classified [BR:
ssc04990
]
100156602 (SOS2)
Domain-containing proteins not elsewhere classified [BR:
ssc04990
]
Pleckstrin homology (PH) domain-containing proteins
Dbl-Like RhoGEF family proteins
100156602 (SOS2)
BRITE hierarchy
SSDB
Ortholog
Paralog
GFIT
Motif
Pfam:
RasGEF
RasGEF_N
RhoGEF
PH
Histone
IQ_SEC7_PH
PH_19
PH_13
Motif
Other DBs
NCBI-GeneID:
100156602
NCBI-ProteinID:
XP_020957098
Ensembl:
ENSSSCG00000005015
LinkDB
All DBs
Position
1
AA seq
1332 aa
AA seq
DB search
MQQAPQPYEFFSEENSPKWRGLLVSALRKVQEQVHPNLSANEESLYYIEELIFQLLNKLC
LAQPRTVQDVEERVQKTFPHPIDKWAIADAQSAIEKRKRRNPLLLPVDKIHPSLKEVLGY
KVDYHVSLYIVAVLEYISADILKLAGNYVFNIRHYEISQQDIKVSMCADKVLMDMFDQDD
IGLVSLCEDEPSSSGELNYYDLVRTEIAEERQYLRELNMIIKVFREAFLSDRKLFKPSDI
EKIFSNILDIHELTVKLLGLVEDTVEMTDESSPHPLAGSCFEDLAEEQAFDPYETLSQDI
LSPEFNEHFSKLMARPAVALHFQSIADGFKEAVRYVLPRLMLVPVYHCWHYFELLKQLKA
CSEEQEDRECLNQAITALLNLQGSMDRIYKQHSPRRRPGDPVCPFYNRQLRSKHLAIKKM
NEIQKNIDGWEGKDIGQCCNEFIMEGPLTRVGAKHERHIFLFDGLMISCKPNHSQSRLPG
YSNAEYRLKEKFVMRKIQICDKEDTCEYKHAFELVSKDENSIIFAAKSAEEKSNWMAALI
SLHYRSTLDRMLDSVLLKEENEQPLRLPSPEVYRFVVKDSEENIVFEDNLQSRSGIPIIK
GGTVVKLIERLTYHMYADPNFVRTFLTTYRSFCKPQELLSLLIERFEIPEPEPTEADKLA
VEKGEQPISADLKRFRKEYVQPVQLRILNVFRHWVEHHFYDFERDLELLERLESFISSVR
GKAMKKWVESIAKIIKRKKQAQANGISHNITFESPPPPIEWHISRPGQFETFDLMTLHPI
EIARQLTLLESDLYRKVQPSELVGSVWTKEDKEINSPNLLKMIRHTTNLTLWFEKCIVEA
ENFEERVAVLSRIIEILQVFQDLNNFNGVLEIVSAVNSVSVYRLDHTFEALQERKRKILD
EAVELSQDHFKKYLVKLKSINPPCVPFFGIYLTNILKTEEGNNDFLKKKGKDLINFSKRR
KVAEITGEIQQYQNQPYCLRIEPEMRRFFENLNPMGNSSEKEFTDYLFNKSLEIEPRNCK
QPPRFPRKSTFSLKSPGIRPNTGRHGSTSGTLRGHPTPLEREPCKISFSRIAETDLESTV
SAPTSPNTPSTPPVSASSDLSVFLDVDLNSSCGSNSIFAPVLLPHSKSFFSSCGSLHKLS
EEPLIPPPLPPRKKFDHDASNSKGNMKSDDDPPAIPPRQPPPPKVKPRVPAPTGVFDGPL
PSPPLPPPRDPLPDTPPPVPLRPPEHFINCPFTLQPPPLGHLHRDPDWFRDVSMCPNSPN
TPPSTPSPRVPRRCYVLSSSQNNLAHPQAPPIPPRQNSSPHLPKLPPKTYKRELSHPPLY
RLPLLENAETPQ
NT seq
3999 nt
NT seq
+upstream
nt +downstream
nt
atgcagcaggcgccgcagccctacgagttcttcagcgaagagaacagtccgaaatggcgg
ggactcttagtctcggccctgcggaaggttcaggaacaagtacatcccaatctctcagca
aatgaagagtctctctattatattgaagagctgatttttcagctgcttaataaattatgc
ctggcccaaccaaggactgttcaagatgtggaggaacgagttcagaagacttttcctcat
ccaattgataaatgggctattgctgatgcacagtctgccatagagaaacgaaaacgaaga
aatcctctcttgctgcctgtggataaaatccatccttcattgaaggaagttttagggtac
aaggtggactaccatgtatccctgtatattgtggctgtactagagtatatctcagctgat
attttgaaattggctggtaattatgtttttaatatccgacattatgaaatatcccaacaa
gacattaaagtgtcaatgtgtgcagataaggtattgatggacatgtttgatcaggatgac
ataggcttggtttctctctgtgaagatgaacctagttcttcaggtgaattaaactactac
gaccttgtcagaactgaaattgcagaagaaaggcagtatctacgggaactaaatatgatc
ataaaagtgtttcgagaagcttttctttctgacagaaagctttttaaaccttctgatatt
gaaaagattttcagtaacattttagatatacatgaattgacggtgaaacttttaggtttg
gttgaagacacagttgaaatgactgatgaaagcagccctcatcccttagctggcagctgt
tttgaagatttggcagaagagcaagcatttgatccttatgaaacgttatcacaggatatt
ctttcaccagaatttaatgaacatttcagtaagttgatggccagacctgcagtggctcta
cactttcagtccattgctgatggctttaaagaggcagttcgctatgttcttccccgccta
atgttggtgccagtgtatcattgttggcactattttgaattattaaagcaattgaaagca
tgtagtgaagagcaggaagacagagagtgtttgaaccaagctataacagctctcttgaac
ctccaaggtagtatggaccgaatttacaagcagcattcacctagacgccgacctggggat
cctgtttgccctttttataatcgtcaattaagaagcaaacacctggctattaaaaaaatg
aatgaaattcagaaaaacatagatggctgggaaggcaaagatattggacagtgttgtaat
gaatttattatggaaggcccattgacaagagttggtgctaaacatgaacgacatattttt
ctttttgatggcttaatgattagttgcaaacccaatcatagccagtcacgccttccagga
tacagtaatgcagaatacagattaaaagaaaaatttgtcatgaggaaaatacaaatatgt
gataaagaagatacttgtgagtacaaacatgcttttgaattagtatccaaagatgaaaac
agcataatattcgctgctaagtctgctgaagagaaaagtaattggatggcagcacttatt
tctcttcattatcgtagtactctggatcgaatgctagattcagtattgttgaaggaagaa
aatgaacaaccactgagattaccaagtcctgaagtgtatcgttttgtggtaaaggactct
gaagaaaacattgtttttgaagacaacttgcaaagtagaagtggaatccccattattaaa
ggaggaactgtagtgaaattaattgaaaggttaacttatcatatgtatgcagatcccaat
tttgttcgtacttttcttactacgtatcgttcattttgtaaaccacaggaattgctaagc
ttactgattgaacgatttgaaattccagagccagaacctactgaagcagataaattggca
gtagagaaaggcgagcagcctatcagtgcagaccttaaaagatttcgcaaggaatacgtc
caaccagtacaacttaggatcttaaatgtgtttcggcactgggttgaacatcatttttat
gactttgaaagagatttggagttgcttgaaagactagaatccttcatttcaagtgtaaga
gggaaagctatgaagaaatgggtagagtcaattgctaagattatcaagaggaagaaacaa
gctcaggcaaatggaataagccataatattacctttgaaagcccacctcccccaattgaa
tggcatatcagtagaccaggacagtttgaaacatttgatctcatgacacttcatccaata
gaaattgcacgtcagctaacacttttggaatctgatctctacaggaaagtccaaccttct
gaacttgtagggagtgtatggaccaaagaagataaagaaataaattctccaaatttatta
aaaatgattcgtcacaccacaaatctcaccctctggtttgaaaagtgcattgtggaagca
gaaaattttgaggaacgggtggcagtactgagtagaattatagaaattctgcaagttttt
caagatttgaataatttcaatggtgtattggagatcgtcagtgcagtaaattcagtgtca
gtttatagactagatcatacctttgaggcattgcaggaaagaaaaaggaaaattttggat
gaagctgtggaattaagtcaagatcattttaaaaaatatctagtgaaacttaagtcaatc
aatccaccttgtgtgcctttttttggaatatatttaacaaatattctgaagactgaagaa
gggaataatgattttttaaaaaagaaagggaaagatttaatcaatttcagtaagaggagg
aaagtagctgaaattactggagaaattcagcagtatcagaatcaaccttactgtttacgg
atagaaccagaaatgaggaggttctttgaaaaccttaaccccatgggaaattcttctgaa
aaagagtttacagattatttgttcaacaagtcactagaaattgaaccccgaaattgcaaa
cagccacctcgatttcctaggaaatcaactttctctttaaaatctcctggaataaggcct
aatacaggccgacatggctctacctcaggtactttacgaggccatccaacaccattagaa
agagaaccatgtaaaataagctttagtcggattgctgaaacagaccttgaatcaacagtg
tcagcaccaacctctccaaacacaccatctactccaccagtatctgcttcttcagacctt
agtgtgtttttagatgtggatctcaacagttcctgtggaagcaatagcatctttgctcca
gtcctcttgccacattcaaagtctttcttcagttcatgtggtagtttacataaactaagt
gaagagccactgattcctcctccgcttcctcctcgaaaaaagtttgatcatgatgcttca
aattccaagggaaatatgaaatccgatgatgacccccctgctattccgccaagacaacct
cctcctccaaaggtaaaacccagagttcctgctcctactggtgtgtttgacggacctctg
cctagtccacctctaccaccgccaagagatcctcttcctgatacccctccaccggttccc
cttcggcctccagaacactttataaactgtccattcactctccagccacctccactggga
catcttcacagagatccagactggttcagagatgttagtatgtgtccaaattctccaaac
actcctccaagcacaccatctccaagagtaccacgtcgatgctatgtgctcagttctagt
caaaataatctcgctcatcctcaagccccccccattccaccaaggcagaattcaagccct
cacctaccaaaactgccaccaaagacttataaacgggagctttcgcaccccccattgtat
agactgcctttgctagaaaatgcagaaactcctcaatga
Sus scrofa (pig): 100520187
Help
Entry
100520187 CDS
T01009
Gene name
SOS1
Definition
(RefSeq) son of sevenless homolog 1 isoform X1
KO
K03099
son of sevenless
Organism
ssc
Sus scrofa (pig)
Pathway
ssc01521
EGFR tyrosine kinase inhibitor resistance
ssc01522
Endocrine resistance
ssc04010
MAPK signaling pathway
ssc04012
ErbB signaling pathway
ssc04014
Ras signaling pathway
ssc04062
Chemokine signaling pathway
ssc04068
FoxO signaling pathway
ssc04072
Phospholipase D signaling pathway
ssc04150
mTOR signaling pathway
ssc04151
PI3K-Akt signaling pathway
ssc04510
Focal adhesion
ssc04540
Gap junction
ssc04630
JAK-STAT signaling pathway
ssc04650
Natural killer cell mediated cytotoxicity
ssc04660
T cell receptor signaling pathway
ssc04662
B cell receptor signaling pathway
ssc04664
Fc epsilon RI signaling pathway
ssc04714
Thermogenesis
ssc04722
Neurotrophin signaling pathway
ssc04810
Regulation of actin cytoskeleton
ssc04910
Insulin signaling pathway
ssc04912
GnRH signaling pathway
ssc04915
Estrogen signaling pathway
ssc04917
Prolactin signaling pathway
ssc04926
Relaxin signaling pathway
ssc04935
Growth hormone synthesis, secretion and action
ssc05034
Alcoholism
ssc05160
Hepatitis C
ssc05161
Hepatitis B
ssc05163
Human cytomegalovirus infection
ssc05165
Human papillomavirus infection
ssc05200
Pathways in cancer
ssc05205
Proteoglycans in cancer
ssc05206
MicroRNAs in cancer
ssc05210
Colorectal cancer
ssc05211
Renal cell carcinoma
ssc05213
Endometrial cancer
ssc05214
Glioma
ssc05215
Prostate cancer
ssc05220
Chronic myeloid leukemia
ssc05221
Acute myeloid leukemia
ssc05223
Non-small cell lung cancer
ssc05224
Breast cancer
ssc05225
Hepatocellular carcinoma
ssc05226
Gastric cancer
ssc05231
Choline metabolism in cancer
Brite
KEGG Orthology (KO) [BR:
ssc00001
]
09130 Environmental Information Processing
09132 Signal transduction
04010 MAPK signaling pathway
100520187 (SOS1)
04012 ErbB signaling pathway
100520187 (SOS1)
04014 Ras signaling pathway
100520187 (SOS1)
04630 JAK-STAT signaling pathway
100520187 (SOS1)
04068 FoxO signaling pathway
100520187 (SOS1)
04072 Phospholipase D signaling pathway
100520187 (SOS1)
04151 PI3K-Akt signaling pathway
100520187 (SOS1)
04150 mTOR signaling pathway
100520187 (SOS1)
09140 Cellular Processes
09144 Cellular community - eukaryotes
04510 Focal adhesion
100520187 (SOS1)
04540 Gap junction
100520187 (SOS1)
09142 Cell motility
04810 Regulation of actin cytoskeleton
100520187 (SOS1)
09150 Organismal Systems
09151 Immune system
04650 Natural killer cell mediated cytotoxicity
100520187 (SOS1)
04660 T cell receptor signaling pathway
100520187 (SOS1)
04662 B cell receptor signaling pathway
100520187 (SOS1)
04664 Fc epsilon RI signaling pathway
100520187 (SOS1)
04062 Chemokine signaling pathway
100520187 (SOS1)
09152 Endocrine system
04910 Insulin signaling pathway
100520187 (SOS1)
04912 GnRH signaling pathway
100520187 (SOS1)
04915 Estrogen signaling pathway
100520187 (SOS1)
04917 Prolactin signaling pathway
100520187 (SOS1)
04926 Relaxin signaling pathway
100520187 (SOS1)
04935 Growth hormone synthesis, secretion and action
100520187 (SOS1)
09156 Nervous system
04722 Neurotrophin signaling pathway
100520187 (SOS1)
09159 Environmental adaptation
04714 Thermogenesis
100520187 (SOS1)
09160 Human Diseases
09161 Cancer: overview
05200 Pathways in cancer
100520187 (SOS1)
05206 MicroRNAs in cancer
100520187 (SOS1)
05205 Proteoglycans in cancer
100520187 (SOS1)
05231 Choline metabolism in cancer
100520187 (SOS1)
09162 Cancer: specific types
05210 Colorectal cancer
100520187 (SOS1)
05225 Hepatocellular carcinoma
100520187 (SOS1)
05226 Gastric cancer
100520187 (SOS1)
05214 Glioma
100520187 (SOS1)
05221 Acute myeloid leukemia
100520187 (SOS1)
05220 Chronic myeloid leukemia
100520187 (SOS1)
05211 Renal cell carcinoma
100520187 (SOS1)
05215 Prostate cancer
100520187 (SOS1)
05213 Endometrial cancer
100520187 (SOS1)
05224 Breast cancer
100520187 (SOS1)
05223 Non-small cell lung cancer
100520187 (SOS1)
09165 Substance dependence
05034 Alcoholism
100520187 (SOS1)
09172 Infectious disease: viral
05161 Hepatitis B
100520187 (SOS1)
05160 Hepatitis C
100520187 (SOS1)
05163 Human cytomegalovirus infection
100520187 (SOS1)
05165 Human papillomavirus infection
100520187 (SOS1)
09176 Drug resistance: antineoplastic
01521 EGFR tyrosine kinase inhibitor resistance
100520187 (SOS1)
01522 Endocrine resistance
100520187 (SOS1)
09180 Brite Hierarchies
09183 Protein families: signaling and cellular processes
04990 Domain-containing proteins not elsewhere classified [BR:
ssc04990
]
100520187 (SOS1)
Domain-containing proteins not elsewhere classified [BR:
ssc04990
]
Pleckstrin homology (PH) domain-containing proteins
Dbl-Like RhoGEF family proteins
100520187 (SOS1)
BRITE hierarchy
SSDB
Ortholog
Paralog
GFIT
Motif
Pfam:
RasGEF
RasGEF_N
RhoGEF
Histone
PH
PH_19
PH_10
IQ_SEC7_PH
PH_13
Motif
Other DBs
NCBI-GeneID:
100520187
NCBI-ProteinID:
XP_020943245
Ensembl:
ENSSSCG00000030405
LinkDB
All DBs
Position
3
AA seq
1333 aa
AA seq
DB search
MQAQQLPYEFFSEENAPKWRGLLVPALKKVQGQVHPTLESSDDALQYVEELILQLLNMLC
QAQPRSASDVEERVQKSFPHPIDKWAIADAQSAIEKRKRRNPLSLPVEKIHPLLKEVLGY
KIDHQVSVYIVAVLEYISADILKLVGNYVRNIRHYEITKQDIKVAMCADKVLMDMFHQDV
EDINILSLTDEEPSTSGEQTYYDLVKAFMAEIRQYIRELNLIIKVFREPFVSNSKLFSAN
DVENIFSRIVDIHELSVKLLGHIEDTVEMTDEGSPHPLVGSCFEDLAEELAFDPYESYAR
DILRPGFHDHFLSQLSKPGAALYLQSIGEGFKEAVQYVLPRLLLAPVYHCLHYFELLKQL
EEKSEDQEDKECLKQAITALLNVQSGMEKICSKSLAKRRLSESACRFYSQQMKGKQLAIK
KMNEIQKNIDGWEGKDIGQCCNEFIMEGTLTRVGAKHERHIFLFDGLMICCKSNHGQPRL
PGASNAEYRLKEKFFMRKVQINDKDDTSEYKHAFEIILKDENSVIFSAKSAEEKNNWMAA
LISLQYRSTLERMLDVTMLQEEKEEQMRLPSADVYRFAEPDSEENIIFEENMQPKAGIPI
IKAGTVIKLIERLTYHMYADPNFVRTFLTTYRSFCKPQELLSLIIERFEIPEPEPTEADR
IAIENGDQPLSAELKRFRKEYIQPVQLRVLNVCRHWVEHHFYDFERDADLLQRMEEFIGT
VRGKAMKKWVESITKIIQRKKIARDNGPGHNITFQSSPPTVEWHISRPGHIESFDLLTLH
PIEIARQLTLLESDLYRAVQPSELVGSVWTKEDKEINSPNLLKMIRHTTNLTLWFEKCIV
ETENLEERVAVVSRIIEILQVFQELNNFNGVLEVVSAMNSSPVYRLDHTFEQIPSRQKKI
LEEAHELSEDHYKKYLAKLRSINPPCVPFFGIYLTNILKTEEGNPEVLKRHGKELINFSK
RRKVAEITGEIQQYQNQPYCLRVESDIKRFFENLNPMGSSMEKEFTDYLFNKSLEIEPRN
PKPLPRFPKKYNYPLKSPGVRPSNPRPGTMRHPTPLQQEPRKISYSRIPESETESTASAP
NSPRTPLTPPPASGASSTTDVCSVFDSDHSSPFHSSSDTVFIQVTLPHGPRSASVSSISL
TKSTDEVPVPPPVPPRRRPESAPAESSPSKIMSKHLDSPPAIPPRQPTSKAYSPRYSISD
RTSISDPPESPPLLPPREPVRTPDVFSSSPLHLQPPPLGKKSDHSNAFFPNSPSPFTPPP
PQTPSPHGTRRHLPSPPLTQEVDLHSIAGPPVPPRQSTSQHIPKLPPKTYKREHTHPSMH
RDGPPLLENAHSS
NT seq
4002 nt
NT seq
+upstream
nt +downstream
nt
atgcaggcgcagcagctgccgtacgagtttttcagcgaggagaacgcgcccaagtggcgg
gggctgctggttcctgcgctgaaaaaggttcaggggcaagttcatcctactcttgagtct
agtgatgatgctcttcagtatgttgaagaattaattttgcaattattaaatatgctgtgc
caagctcagccccgaagtgcttcagatgtagaggaacgagttcaaaaaagtttccctcat
ccaattgataagtgggcaatagctgatgcccagtcggctattgaaaagaggaagcgaaga
aaccctttatctctcccagtagaaaaaattcatcctttgttaaaggaggtcctaggttat
aaaattgaccaccaggtttctgtttacatagtagcagtattagaatacatttctgcagac
attttaaagctggtggggaattatgtgcgaaatatacggcactatgaaattacaaaacaa
gatattaaagtggcaatgtgtgctgataaggtattgatggatatgtttcatcaagatgta
gaagatataaatatattgtctttaactgatgaagagccttccacctcaggagagcaaact
tattatgatttggtaaaagcatttatggcagaaattcgacaatacataagggaactaaat
ttaattataaaagtttttagagaaccctttgtctccaactcaaaattgttttcagctaat
gatgtagaaaatatatttagtcgtatagtagatatacatgaacttagtgtaaagttactg
ggccatatagaagatactgtagaaatgacagatgaaggtagtccccacccattagtagga
agctgctttgaagacttagcagaggaactggcatttgacccatatgaatcatatgctcga
gatattttacgacctggttttcatgatcatttccttagtcagttatcaaagcctggagcg
gcactttatttacagtcaataggcgaaggtttcaaagaagctgttcagtatgttttaccc
cggctgcttctagctcctgtttaccactgtctacattacttcgaacttttgaagcagtta
gaagagaagagtgaagatcaagaagacaaggaatgtttgaaacaagcaataacagctttg
cttaatgttcagagtggtatggaaaaaatatgttctaaaagtcttgcaaaacgaagactg
agtgaatctgcatgtcggttttatagtcaacaaatgaaggggaaacaactagcaatcaag
aaaatgaacgagattcagaagaatattgatggttgggagggaaaagacattggacagtgt
tgcaatgagtttataatggaaggaactcttacacgtgtaggagccaaacatgagagacac
atatttctctttgatggcttaatgatttgctgtaaatcaaatcatgggcagccaagactt
cctggtgctagcaatgcagaatatcgtcttaaagaaaagttttttatgagaaaggtacaa
attaatgacaaagatgacaccagtgagtacaagcatgcttttgaaataattttaaaagat
gaaaatagtgtaatattttctgccaagtcagctgaagagaaaaacaattggatggcagcg
ttgatatctttacagtaccggagtacactggaaaggatgcttgatgtgacaatgctacag
gaagaaaaggaggagcagatgaggcttcctagtgctgatgtttatagatttgcagagcct
gactctgaagaaaatatcatatttgaagaaaacatgcagcccaaagctggaattccaatt
atcaaagcaggaactgttattaaacttatagagaggctcacataccatatgtacgcagat
cccaattttgttcggacatttcttacaacatacagatccttttgtaaacctcaagaacta
ctgagtcttataatagaaaggtttgaaattccagagcctgagccaacagaagctgatcgc
atagctatagaaaatggagatcagcccttgagtgcagaactaaaaaggtttagaaaagaa
tatatacagcccgtacaactgcgagtattaaatgtatgtcggcactgggtagagcaccac
ttctatgattttgaaagagatgcagatcttttgcagcgaatggaggaatttattggaaca
gtaaggggtaaagcaatgaagaaatgggttgaatcaatcactaaaataattcaaaggaaa
aaaattgcaagagacaatggaccaggtcataatattacatttcagagttcacctcccaca
gttgagtggcatataagcagacctgggcacatagagtcttttgacctgctcaccttacac
ccaatagaaattgctcgacaactcactttacttgaatcagatctgtatcgagctgtgcag
ccatcagaattagttggaagtgtgtggacaaaagaagacaaagaaattaattctcctaat
cttctgaaaatgatccggcataccactaatctcactctgtggtttgagaaatgtattgta
gaaactgaaaacttagaagaaagggtagctgtggtgagtcgaataattgagattctgcaa
gtctttcaagagctgaacaatttcaatggtgtacttgaggttgtcagtgctatgaactca
tcacctgtttacagactagaccacaccttcgagcaaataccaagtcgccaaaagaaaatt
ttagaagaagctcatgaactaagtgaagatcactataagaaatatttggcaaaactcagg
tctattaatccaccatgtgtgcctttctttggaatttatctaactaatatcttgaaaaca
gaagaaggcaaccctgaggtcctaaaaaggcatggaaaagagcttataaactttagcaaa
aggaggaaagtggcagaaataacaggcgagatccagcagtaccaaaatcagccttattgt
ttacgagtagaatcagatatcaaaaggttctttgaaaacttgaatccaatgggaagtagc
atggaaaaagaatttacagattatcttttcaacaaatccctagaaatagaaccacgaaac
cctaaacctctcccaagatttccaaaaaaatacaactatcccctaaaatctcctggcgtt
cgtccatcaaacccaagaccaggtaccatgagacatcccacacctctgcagcaggagcca
aggaaaattagttatagtaggatccctgaaagtgaaacagaaagtacagcatctgcacca
aattctccaagaacaccgttaacacctcctcctgcttctggtgcttctagtaccacagat
gtttgcagcgtatttgattctgatcattcaagcccttttcactcaagcagcgataccgtc
tttatccaagttacactgccccatggcccaagatctgcttcagtatcatctataagctta
accaagagcactgatgaagtgcctgtcccccctcctgttcctccacgaagacgaccagaa
tctgccccagcggaatcttcgccatctaagattatgtctaagcatttggacagcccccca
gcaattcctcctaggcaacccacatcaaaagcctattcaccacgatattccatatcagac
cggacctctatatcagaccctcctgaaagccctcccttactaccaccacgagaacctgtg
aggacacctgatgttttctcaagctcaccactacatctccaacctccccctttgggcaaa
aaaagtgaccatagtaatgccttcttcccaaatagcccttccccctttacaccacctcct
cctcaaacaccttctcctcacggcaccagaaggcatctaccatcaccaccattgacacag
gaagtagaccttcattccattgctgggccacctgttcctccacgacaaagcacttctcaa
catatccccaaactccctccaaaaacttacaaaagggagcacacacacccatccatgcac
agagatggaccaccactgttggagaacgcccattcttcctga
DBGET
integrated database retrieval system