KEGG   Equus caballus (horse): 100053889
Entry
100053889         CDS       T01058                                 

Gene name
SOS1
Definition
(RefSeq) son of sevenless homolog 1 isoform X1
  KO
K03099  son of sevenless
Organism
ecb  Equus caballus (horse)
Pathway
ecb01521  EGFR tyrosine kinase inhibitor resistance
ecb01522  Endocrine resistance
ecb04010  MAPK signaling pathway
ecb04012  ErbB signaling pathway
ecb04014  Ras signaling pathway
ecb04062  Chemokine signaling pathway
ecb04068  FoxO signaling pathway
ecb04072  Phospholipase D signaling pathway
ecb04150  mTOR signaling pathway
ecb04151  PI3K-Akt signaling pathway
ecb04510  Focal adhesion
ecb04540  Gap junction
ecb04630  JAK-STAT signaling pathway
ecb04650  Natural killer cell mediated cytotoxicity
ecb04660  T cell receptor signaling pathway
ecb04662  B cell receptor signaling pathway
ecb04664  Fc epsilon RI signaling pathway
ecb04714  Thermogenesis
ecb04722  Neurotrophin signaling pathway
ecb04810  Regulation of actin cytoskeleton
ecb04910  Insulin signaling pathway
ecb04912  GnRH signaling pathway
ecb04915  Estrogen signaling pathway
ecb04917  Prolactin signaling pathway
ecb04926  Relaxin signaling pathway
ecb04935  Growth hormone synthesis, secretion and action
ecb05034  Alcoholism
ecb05160  Hepatitis C
ecb05161  Hepatitis B
ecb05163  Human cytomegalovirus infection
ecb05165  Human papillomavirus infection
ecb05200  Pathways in cancer
ecb05205  Proteoglycans in cancer
ecb05206  MicroRNAs in cancer
ecb05210  Colorectal cancer
ecb05211  Renal cell carcinoma
ecb05213  Endometrial cancer
ecb05214  Glioma
ecb05215  Prostate cancer
ecb05220  Chronic myeloid leukemia
ecb05221  Acute myeloid leukemia
ecb05223  Non-small cell lung cancer
ecb05224  Breast cancer
ecb05225  Hepatocellular carcinoma
ecb05226  Gastric cancer
ecb05231  Choline metabolism in cancer
Brite
KEGG Orthology (KO) [BR:ecb00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04010 MAPK signaling pathway
    100053889 (SOS1)
   04012 ErbB signaling pathway
    100053889 (SOS1)
   04014 Ras signaling pathway
    100053889 (SOS1)
   04630 JAK-STAT signaling pathway
    100053889 (SOS1)
   04068 FoxO signaling pathway
    100053889 (SOS1)
   04072 Phospholipase D signaling pathway
    100053889 (SOS1)
   04151 PI3K-Akt signaling pathway
    100053889 (SOS1)
   04150 mTOR signaling pathway
    100053889 (SOS1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100053889 (SOS1)
   04540 Gap junction
    100053889 (SOS1)
  09142 Cell motility
   04810 Regulation of actin cytoskeleton
    100053889 (SOS1)
 09150 Organismal Systems
  09151 Immune system
   04650 Natural killer cell mediated cytotoxicity
    100053889 (SOS1)
   04660 T cell receptor signaling pathway
    100053889 (SOS1)
   04662 B cell receptor signaling pathway
    100053889 (SOS1)
   04664 Fc epsilon RI signaling pathway
    100053889 (SOS1)
   04062 Chemokine signaling pathway
    100053889 (SOS1)
  09152 Endocrine system
   04910 Insulin signaling pathway
    100053889 (SOS1)
   04912 GnRH signaling pathway
    100053889 (SOS1)
   04915 Estrogen signaling pathway
    100053889 (SOS1)
   04917 Prolactin signaling pathway
    100053889 (SOS1)
   04926 Relaxin signaling pathway
    100053889 (SOS1)
   04935 Growth hormone synthesis, secretion and action
    100053889 (SOS1)
  09156 Nervous system
   04722 Neurotrophin signaling pathway
    100053889 (SOS1)
  09159 Environmental adaptation
   04714 Thermogenesis
    100053889 (SOS1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    100053889 (SOS1)
   05206 MicroRNAs in cancer
    100053889 (SOS1)
   05205 Proteoglycans in cancer
    100053889 (SOS1)
   05231 Choline metabolism in cancer
    100053889 (SOS1)
  09162 Cancer: specific types
   05210 Colorectal cancer
    100053889 (SOS1)
   05225 Hepatocellular carcinoma
    100053889 (SOS1)
   05226 Gastric cancer
    100053889 (SOS1)
   05214 Glioma
    100053889 (SOS1)
   05221 Acute myeloid leukemia
    100053889 (SOS1)
   05220 Chronic myeloid leukemia
    100053889 (SOS1)
   05211 Renal cell carcinoma
    100053889 (SOS1)
   05215 Prostate cancer
    100053889 (SOS1)
   05213 Endometrial cancer
    100053889 (SOS1)
   05224 Breast cancer
    100053889 (SOS1)
   05223 Non-small cell lung cancer
    100053889 (SOS1)
  09165 Substance dependence
   05034 Alcoholism
    100053889 (SOS1)
  09172 Infectious disease: viral
   05161 Hepatitis B
    100053889 (SOS1)
   05160 Hepatitis C
    100053889 (SOS1)
   05163 Human cytomegalovirus infection
    100053889 (SOS1)
   05165 Human papillomavirus infection
    100053889 (SOS1)
  09176 Drug resistance: antineoplastic
   01521 EGFR tyrosine kinase inhibitor resistance
    100053889 (SOS1)
   01522 Endocrine resistance
    100053889 (SOS1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04990 Domain-containing proteins not elsewhere classified [BR:ecb04990]
    100053889 (SOS1)
Domain-containing proteins not elsewhere classified [BR:ecb04990]
 Pleckstrin homology (PH) domain-containing proteins
  Dbl-Like RhoGEF family proteins
   100053889 (SOS1)
SSDB
Motif
Pfam: RasGEF RasGEF_N RhoGEF Histone PH PH_19 PH_10 IQ_SEC7_PH PH_13
Other DBs
NCBI-GeneID: 100053889
NCBI-ProteinID: XP_005600089
Ensembl: ENSECAG00000017604
VGNC: 23454
UniProt: F6RQQ4
LinkDB
Position
15
AA seq 1333 aa
MQAQQLPYEFFSEENAPKWRGLLVPALKKVQGQVHPTLESSDDALQYVEELILQLLNMLC
QAQPRSASDVEERVQKSFPHPIDKWAIADAQSAIEKRKRRNPLSLPVEKIHPLLKEVLGY
KIDHQVSVYIVAVLEYISADILKLVGNYVRNIRHYEITKQDIKVAMCADKVLMDMFHQDV
EDINILSLTDEEPSTSGEQTYYDLVKAFMAEIRQYIRELNLIIKVFREPFVSNSKLFSAN
DVENIFSRIVDIHELSVKLLGHIEDTVEMTDEGSPHPLVGSCFEDLAEELAFDPYESYAR
DILRPGFHDHFLSQLSKPGAALYLQSIGEGFKEAVQYVLPRLLLAPVYHCLHYFELLKQL
EEKSEDQEDKECLKQAITALLNVQSGMEKICSKSLAKRRLSESACRFYSQQMKGKQLAIK
KMNEIQKNIDGWEGKDIGQCCNEFIMEGTLTRVGAKHERHIFLFDGLMICCKSNHGQPRL
PGASNAEYRLKEKFFMRKVQINDKDDTSEYKHAFEIILKDENSVIFSAKSAEEKNNWMAA
LISLQYRSTLERMLDVTMLQEEKEEQMRLPSADVYRFAEPDSEENIIFEENMQPKAGIPI
IKAGTVIKLIERLTYHMYADPNFVRTFLTTYRSFCKPQELLSLIIERFEIPEPEPTEADR
IAIENGDQPLSAELKRFRKEYIQPVQLRVLNVCRHWVEHHFYDFERDADLLQRMEEFIGT
VRGKAMKKWVESITKIIQRKKIARDNGPGHNITFQSSPPTVEWHISRPGHIETFDLLTLH
PIEIARQLTLLESDLYRAVQPSELVGSVWTKEDKEINSPNLLKMIRHTTNLTLWFEKCIV
ETENLEERVAVVNRIIEILQVFQELNNFNGVLEVVSAMNSSPVYRLDHTFEQIPSRQKKI
LEEAHELSEDHYKKYLAKLRSINPPCVPFFGIYLTNILKTEEGNPEVLKRHGKDLINFSK
RRKVAEITGEIQQYQNQPYCLRVESDIKRFFENLNPMGNSMEKEFTDYLFNKSLEIEPRN
PKPLPRFPKKYSYPLKSPGVRPSNPRPGTMRHPTPLQQEPRKISYSRIPESETESTASAP
NSPRTPLTPPPASGASSTTDVCSVFDSDHSSPFHSSSDTVFIQVTLPHGPRSASVSSISL
TKGTDEVPVPPPVPPRRRPESAPAESSPSKIMSKHLDSPPAIPPRQPTSKAYSPRYSISE
RTSISDPPESPPLLPPREPVRTPDVFSSSPLHLQPPPLGKKSDHGNAFFPNSPSPFTPPP
PQTPSPHGTRRHLPSPPLTQEVDLHSIAGPPVPPRQSTSQHIPKLPPKTYKREHTHPSMH
RDGPPLLENAHSS
NT seq 4002 nt   +upstreamnt  +downstreamnt
atgcaggcgcagcagctgccgtacgagtttttcagcgaggagaacgcgcccaagtggcgg
gggctgctggtgccggcgctgaaaaaggtccaggggcaagttcatcctactcttgagtct
agtgatgatgctcttcagtatgttgaggaattaattttgcagttattaaatatgctatgc
caagctcagcctcgaagtgcttcagatgtagaggaacgtgttcaaaaaagtttccctcat
ccgattgataagtgggcaatagctgatgcccagtcagctattgaaaagaggaagcgaaga
aaccctttatctctcccagtagaaaaaattcatcctttattaaaggaggtcctaggttat
aaaattgaccaccaggtttctgtttacatagtagcagtattagagtatatttctgcagac
attttaaagctggtagggaattatgtacggaatatacggcattatgaaattacaaaacaa
gatattaaagtggcaatgtgtgctgataaggtattgatggatatgtttcatcaagatgta
gaagatataaatatattatctttaactgatgaagaaccttccacctcaggagagcaaact
tattatgatttggtaaaagcatttatggcagaaattcgacaatacataagggaactaaat
ttaattataaaagtttttagagagccctttgtctccaattcaaaattgttttcagctaat
gatgtagaaaatatatttagtcgtatagtagatatacacgaacttagtgtaaagttactg
ggccacatagaagatactgtagaaatgacagatgaaggcagtccccatccattagtagga
agctgctttgaagacttagcagaggaactggcatttgatccatatgaatcatatgctcga
gatattttacgacctggctttcatgatcatttccttagtcagctatcaaagcctggagca
gcactctatttgcagtcaataggcgaaggtttcaaagaagctgttcagtatgttttaccc
aggctacttctagcccctgtttaccactgtctacattactttgaacttttgaagcagtta
gaagaaaagagtgaagatcaagaagacaaggaatgtttgaaacaagcaataacagctttg
cttaatgttcagagtggtatggaaaaaatatgttctaaaagtcttgcaaaacgaagactg
agtgaatctgcatgtcggttttatagtcagcaaatgaaggggaaacaactagcaatcaag
aaaatgaacgagattcagaagaatattgatggatgggagggaaaagacattggacagtgt
tgcaatgagtttataatggaaggaactcttacacgtgtaggagccaaacatgagagacac
atatttctctttgatggcttaatgatttgctgtaaatcgaatcatgggcaaccaagactt
cctggtgctagcaatgcagaatatcgtcttaaagaaaagttttttatgagaaaggtacaa
attaatgacaaagatgacaccagtgagtacaagcatgcttttgaaataattttaaaagat
gaaaatagtgttatattttctgccaagtcagctgaagagaaaaacaattggatggcagca
ttgatatctttacagtaccggagtacactggaaaggatgcttgatgtgacaatgctacag
gaggagaaggaggagcagatgaggctccctagtgctgatgtttatagatttgcagagccc
gactctgaagagaatattatatttgaagaaaacatgcagcccaaggctggaattccaatt
atcaaagcaggaactgttattaaacttatagagaggctcacataccacatgtatgcagat
cccaattttgttcggacatttcttacaacatacagatccttttgtaaacctcaagaactg
ctgagtcttataatagaaaggtttgaaattccagagcctgagccaacagaagctgatcgc
atagctatagagaatggagatcaacccttgagtgcagaactaaaaaggtttagaaaagag
tatatacagcctgtacaactgcgagtattaaatgtatgtcggcactgggtagaacaccac
ttctatgattttgaaagagatgcagatcttttgcagcgaatggaagaatttattggaaca
gtaagaggtaaagcaatgaaaaaatgggtcgaatccatcactaaaataatccaaaggaaa
aaaattgcaagagacaatggaccaggtcataatattacatttcagagttcacctcccaca
gttgagtggcatataagcagacctgggcacatagagacttttgacctgctcaccttacac
ccaatagaaattgctcgacaacttactttacttgaatcagatctgtatcgagctgtacag
ccatcagaattagttggaagtgtgtggacaaaagaagacaaagaaattaattctcctaat
cttctgaaaatgatccggcacaccactaatctcactttgtggtttgagaaatgtattgta
gaaactgaaaacttagaagaaagagtagctgtggtgaatcgaataattgagattctgcaa
gtctttcaagagctgaacaacttcaatggcgtccttgaggttgttagtgctatgaactca
tcacctgtttacagactagaccacacgtttgagcaaataccaagtcgccaaaagaaaatt
ttagaagaagctcatgaattaagtgaagatcactataagaaatatttggcaaaactcagg
tctattaatccaccatgtgtgcctttctttggaatttatctaactaatatcttgaaaaca
gaagaaggcaaccctgaggtcctgaaaaggcatggaaaagaccttataaactttagcaaa
aggaggaaggtagcagaaattacaggagagatccagcagtaccaaaatcagccttattgt
ttacgagtagaatcggatatcaaaaggttctttgaaaacttgaatccaatgggaaatagc
atggagaaagaatttacagattatcttttcaacaaatccctagaaatagaaccacgaaat
cctaagcctctcccaagatttccaaaaaaatatagctatcccctaaaatctcctggtgtt
cgtccatcaaacccaagaccaggtaccatgagacatcccacacctctgcagcaggagcca
aggaaaattagttatagtaggatccctgaaagtgaaacagagagtacagcatctgcacca
aattctccaagaacaccgttaacacctcctcctgcttctggtgcttccagtaccacagat
gtttgcagcgtatttgattctgatcattcgagcccttttcactcaagcagcgataccgtc
tttatccaagttacactgccccatggcccaagatctgcttcagtatcatctataagttta
accaaaggcactgatgaagtgcctgtcccccctcctgttcctccacgaagacgaccagaa
tctgccccagcggaatcttcgccatctaagattatgtctaagcatttggacagcccccca
gcaattcctcctaggcaacccacatcaaaagcctattcaccacgctattcaatatcagag
cggacctctatatcagaccctcctgagagccctcccttattaccaccacgagaacctgtg
aggacacctgatgttttctcaagctcaccactacatctccaacctccccctttgggcaaa
aaaagtgaccatggcaatgccttcttcccaaacagcccttctccctttacaccacctcct
cctcaaacaccttctcctcacggcacgagaaggcatctgccgtcaccaccattgacacaa
gaagtggaccttcattccattgctgggccgcccgttcctcctcgacaaagcacttctcag
catatccctaaactccctcccaaaacttacaaaagggagcacacacacccatccatgcac
agagatggaccaccactgttggagaatgcccattcttcctga

KEGG   Equus caballus (horse): 100066282
Entry
100066282         CDS       T01058                                 

Gene name
SOS2
Definition
(RefSeq) son of sevenless homolog 2 isoform X1
  KO
K03099  son of sevenless
Organism
ecb  Equus caballus (horse)
Pathway
ecb01521  EGFR tyrosine kinase inhibitor resistance
ecb01522  Endocrine resistance
ecb04010  MAPK signaling pathway
ecb04012  ErbB signaling pathway
ecb04014  Ras signaling pathway
ecb04062  Chemokine signaling pathway
ecb04068  FoxO signaling pathway
ecb04072  Phospholipase D signaling pathway
ecb04150  mTOR signaling pathway
ecb04151  PI3K-Akt signaling pathway
ecb04510  Focal adhesion
ecb04540  Gap junction
ecb04630  JAK-STAT signaling pathway
ecb04650  Natural killer cell mediated cytotoxicity
ecb04660  T cell receptor signaling pathway
ecb04662  B cell receptor signaling pathway
ecb04664  Fc epsilon RI signaling pathway
ecb04714  Thermogenesis
ecb04722  Neurotrophin signaling pathway
ecb04810  Regulation of actin cytoskeleton
ecb04910  Insulin signaling pathway
ecb04912  GnRH signaling pathway
ecb04915  Estrogen signaling pathway
ecb04917  Prolactin signaling pathway
ecb04926  Relaxin signaling pathway
ecb04935  Growth hormone synthesis, secretion and action
ecb05034  Alcoholism
ecb05160  Hepatitis C
ecb05161  Hepatitis B
ecb05163  Human cytomegalovirus infection
ecb05165  Human papillomavirus infection
ecb05200  Pathways in cancer
ecb05205  Proteoglycans in cancer
ecb05206  MicroRNAs in cancer
ecb05210  Colorectal cancer
ecb05211  Renal cell carcinoma
ecb05213  Endometrial cancer
ecb05214  Glioma
ecb05215  Prostate cancer
ecb05220  Chronic myeloid leukemia
ecb05221  Acute myeloid leukemia
ecb05223  Non-small cell lung cancer
ecb05224  Breast cancer
ecb05225  Hepatocellular carcinoma
ecb05226  Gastric cancer
ecb05231  Choline metabolism in cancer
Brite
KEGG Orthology (KO) [BR:ecb00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04010 MAPK signaling pathway
    100066282 (SOS2)
   04012 ErbB signaling pathway
    100066282 (SOS2)
   04014 Ras signaling pathway
    100066282 (SOS2)
   04630 JAK-STAT signaling pathway
    100066282 (SOS2)
   04068 FoxO signaling pathway
    100066282 (SOS2)
   04072 Phospholipase D signaling pathway
    100066282 (SOS2)
   04151 PI3K-Akt signaling pathway
    100066282 (SOS2)
   04150 mTOR signaling pathway
    100066282 (SOS2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100066282 (SOS2)
   04540 Gap junction
    100066282 (SOS2)
  09142 Cell motility
   04810 Regulation of actin cytoskeleton
    100066282 (SOS2)
 09150 Organismal Systems
  09151 Immune system
   04650 Natural killer cell mediated cytotoxicity
    100066282 (SOS2)
   04660 T cell receptor signaling pathway
    100066282 (SOS2)
   04662 B cell receptor signaling pathway
    100066282 (SOS2)
   04664 Fc epsilon RI signaling pathway
    100066282 (SOS2)
   04062 Chemokine signaling pathway
    100066282 (SOS2)
  09152 Endocrine system
   04910 Insulin signaling pathway
    100066282 (SOS2)
   04912 GnRH signaling pathway
    100066282 (SOS2)
   04915 Estrogen signaling pathway
    100066282 (SOS2)
   04917 Prolactin signaling pathway
    100066282 (SOS2)
   04926 Relaxin signaling pathway
    100066282 (SOS2)
   04935 Growth hormone synthesis, secretion and action
    100066282 (SOS2)
  09156 Nervous system
   04722 Neurotrophin signaling pathway
    100066282 (SOS2)
  09159 Environmental adaptation
   04714 Thermogenesis
    100066282 (SOS2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    100066282 (SOS2)
   05206 MicroRNAs in cancer
    100066282 (SOS2)
   05205 Proteoglycans in cancer
    100066282 (SOS2)
   05231 Choline metabolism in cancer
    100066282 (SOS2)
  09162 Cancer: specific types
   05210 Colorectal cancer
    100066282 (SOS2)
   05225 Hepatocellular carcinoma
    100066282 (SOS2)
   05226 Gastric cancer
    100066282 (SOS2)
   05214 Glioma
    100066282 (SOS2)
   05221 Acute myeloid leukemia
    100066282 (SOS2)
   05220 Chronic myeloid leukemia
    100066282 (SOS2)
   05211 Renal cell carcinoma
    100066282 (SOS2)
   05215 Prostate cancer
    100066282 (SOS2)
   05213 Endometrial cancer
    100066282 (SOS2)
   05224 Breast cancer
    100066282 (SOS2)
   05223 Non-small cell lung cancer
    100066282 (SOS2)
  09165 Substance dependence
   05034 Alcoholism
    100066282 (SOS2)
  09172 Infectious disease: viral
   05161 Hepatitis B
    100066282 (SOS2)
   05160 Hepatitis C
    100066282 (SOS2)
   05163 Human cytomegalovirus infection
    100066282 (SOS2)
   05165 Human papillomavirus infection
    100066282 (SOS2)
  09176 Drug resistance: antineoplastic
   01521 EGFR tyrosine kinase inhibitor resistance
    100066282 (SOS2)
   01522 Endocrine resistance
    100066282 (SOS2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04990 Domain-containing proteins not elsewhere classified [BR:ecb04990]
    100066282 (SOS2)
Domain-containing proteins not elsewhere classified [BR:ecb04990]
 Pleckstrin homology (PH) domain-containing proteins
  Dbl-Like RhoGEF family proteins
   100066282 (SOS2)
SSDB
Motif
Pfam: RasGEF RasGEF_N RhoGEF PH Histone PH_19 IQ_SEC7_PH PH_13
Other DBs
NCBI-GeneID: 100066282
NCBI-ProteinID: XP_023480621
Ensembl: ENSECAG00000004420
VGNC: 23455
LinkDB
Position
1
AA seq 1332 aa
MQQAPQPYEFFSEENSPRWRGLLVPALRKVQEQVHPNLSANEESLYYIEELIFQLLNKLC
MAQPRTVQDVEERVQKTFPHPIDKWAIADAQSAIEKRKRRHPLLLPVDKIHPSLKEVLGY
KVDYHVSLYIVAVLEYISADILKLAGNYVFNIRHYEISQQDIKVSMCADKVLMDMFDQDD
IGLVSLCEDEPSSSGELNYYDLVRTEIAEERQYLRELNMIIKVFREAFLSDRKLFKPSDI
EKIFSNILDIHELTVKLLGLIEDTVEMTDESSPHPLAGSCFEDLAEEQAFDPYETLSQDI
LSPKFNEHFSKLMARPAVALHFQSIADGFREAVRYVLPRLMLVPVYHCWHYFELLKQLKA
CSEEQEDRECLNQAITALMNLQGSMDRIYKQYSPRRRPGDPVCPFYNRQLRSKHLAIKKM
NEIQKNIDGWEGKDIGQCCNEFIMEGPLTKIGAKHERHIFLFDGLMISCKPNHSQSRLPG
YSSAEYRLKEKFVMRKIQICDKEDTCECRHAFELVSKDENSITFAAKSAEEKNNWMAALI
SLHYRSTLDRMLDSVLLKEENEQPLRLPSPEMYRFVVKDSEENIVFEDSLQSRSGIPIIK
GGTVVKLIERLTYHMYADPNFVRTFLTTYRSFCKPQELLSLLIERFEIPEPEPTEADKLA
VEKGEQPISADLKRFRKEYVQPVQLRILNVFRHWVEHHFYDFERDLELLERLESFISSVR
GKAMKKWVESIAKIIKRKKQTQANGISHNITFESPPPPIEWHISRPGQFETFDLMTLHPI
EIARQLTLLESDLYRKVQPSELVGSVWTKEDKEINSPNLLKMIRHTTNLTLWFEKCIVEA
ENFEERVAVLSRVIEILQVFQDLNNFNGVLEIVSAVNSVSVYRLDHTFEALQERKRRILD
EAVELSQDHFKKYLVKLKSINPPCVPFFGIYLTNILKTEEGNNDFLKKKGKDLINFSKRR
KVAEITGEIQQYQNQPYCLRIEPEMRRFFENLNPMGSSSEKEFTDYLFNKSLEIEPRNCK
QPPRFPRKSTFSLKSPGIRPNTGRHGSTSGTLRGHPTPLEREPCKISFSRIAEAELGSAV
SAPTSPNTPSTPPASAASDLSVFPDVDLSASCGSNSIFAPVLLPHSKSFFSSCGSLHKLS
DEPLIPPPLPPRKKFDHDASNPKGNMKSDDDPPAIPPRQPPPPKVKPRVPAPSGAFEGSL
HSPPPPPPREPLPDTPPPVPLRPPEHFINCPFNLQPPPLGHLHRDPDWFRDVSTRPDSPN
TPPSTPSPRVPRRCCVLSSSHSNLTYPQAPPVPPRQNSSPHLPKLPPKTYKRELSHPPMY
RLSLLENAETPQ
NT seq 3999 nt   +upstreamnt  +downstreamnt
atgcagcaggcgccgcagccgtacgagttcttcagcgaggagaacagcccgagatggcgg
gggctgctggtgccggccctgcggaaggttcaggagcaggtacatcccaatctctcagct
aatgaagaatctctctattacattgaagagctgatttttcaactgcttaataaattatgc
atggctcagccaaggactgttcaagatgtggaggaacgagttcaaaagacctttccgcat
ccaattgataaatgggctattgctgatgcacaatccgccatagagaaacgaaaacgaaga
caccctctcttactgcctgtggacaaaatccatccttcattgaaggaggttttagggtac
aaagtggactaccatgtgtccctgtatattgtggctgtactggagtatatctcagctgat
attttaaaattggctggtaattatgtttttaatattcgacattatgaaatatctcagcag
gacattaaagtgtcaatgtgtgcagataaggttttgatggacatgtttgatcaggatgac
ataggtttggtttctctctgtgaagatgagccgagttcttcaggtgaattgaattactat
gaccttgtcagaactgaaattgcagaagaaagacagtatctccgggaactaaatatgatc
ataaaagtatttcgagaagcctttctttctgacagaaagctgtttaaaccttctgatatt
gagaagattttcagtaacattttagatatacatgaattgactgtgaaacttctaggtttg
attgaggacacagttgagatgactgatgaaagcagtcctcatcccttagctggcagctgt
ttcgaagatttggcagaagagcaggcttttgatccttatgaaacattatcacaggacatc
ctttcaccaaaattcaatgaacatttcagtaagttgatggccagacctgcagtggctcta
cactttcagtccattgctgatgggttcagagaggcggttcgttatgtccttccacgcctg
atgcttgtgccagtgtatcattgttggcactacttcgaattattaaagcaattgaaagca
tgtagtgaagaacaagaagacagagaatgtttgaaccaagctattactgctctcatgaat
ctccagggtagtatggaccgaatttataagcagtattcgcctagacgtcgacctggggat
cctgtttgccctttttataatcgtcaattaagaagcaagcacctggctattaaaaaaatg
aatgaaattcagaaaaacatagatggatgggaaggcaaagacatcggacagtgttgtaat
gaattcattatggaaggtcctttgacaaaaattggtgctaaacatgaacggcatattttt
ctctttgatggcttaatgattagctgcaaacccaatcatagccagtcgcggcttccaggg
tacagtagtgcagaatacagactgaaagagaagtttgtcatgaggaaaatacaaatatgt
gataaagaagacacttgcgagtgcagacatgcttttgaattagtctccaaagatgaaaac
agcatcacatttgctgccaagtctgccgaagagaagaataactggatggcagcactcatt
tctcttcattaccgtagcacactggaccgaatgctggactcagtgttactgaaggaagaa
aatgagcagccactgaggttaccaagtcctgagatgtatcgtttcgtggtgaaagactct
gaggaaaacatagtttttgaagacagcttgcaaagtagaagtggaatccccattattaaa
ggaggaactgtggtgaaattaattgaaaggttgacatatcatatgtatgcagatcccaat
tttgttcgtacttttcttactacatatcgttcattttgtaaaccacaggaattgctaagc
ttactaattgaacgatttgaaattccagagccagaacctactgaagcagataaattggca
gtagagaaaggcgagcagcccatcagtgcagaccttaaaaggtttcgcaaggaatacgtc
caaccagtacaacttaggatcttaaatgtgtttcggcactgggttgaacaccatttttat
gattttgaaagagatttggagttgcttgaacggctagaatccttcatttcaagtgtaaga
gggaaagctatgaagaaatgggtagagtcaattgctaagatcatcaagaggaagaaacag
actcaggcaaacggaataagccataatatcacctttgaaagcccaccccccccgattgaa
tggcacatcagcagacccgggcagttcgagacctttgatctcatgacacttcatccaata
gaaattgcacgccagctgacgcttttggaatctgatctctataggaaagtccagccttct
gaacttgtagggagtgtgtggaccaaagaagataaagaaataaattctccaaatttatta
aaaatgattcgccacaccacaaatctcaccctctggtttgaaaagtgcattgtggaagca
gaaaactttgaggaacgagtggcagtactaagtagagttatagaaattctgcaagttttt
caagatttgaataatttcaatggcgtactggagatagtcagtgcagtgaattcagtgtca
gtgtacagactagaccataccttcgaggcattgcaggaaagaaaaaggagaattttggat
gaagctgtggaattaagtcaagatcactttaaaaaatacctagtaaaacttaagtcaatc
aatccaccttgtgtgcctttttttggaatatatttaacaaatattctgaagactgaagaa
gggaataatgattttttaaaaaagaaagggaaagatttaatcaatttcagtaagaggagg
aaagtagctgaaattactggagaaattcagcagtatcagaatcaaccttactgtttacgg
atagaaccagagatgaggcggttctttgaaaaccttaaccccatgggaagttcttctgaa
aaagagtttacggattatttgttcaacaaatcactagaaattgaaccccgcaactgcaaa
cagccacctcgatttcctaggaaatcaactttctctttaaaatctcctggaataaggcct
aacacgggccgacatggctctacctcaggcactttacgaggtcatcccaccccgttagaa
agagaaccgtgtaaaatcagctttagtcggattgctgaggctgagctgggatcagcagtg
tcggcaccaacctctcccaacacgccgtccaccccgccggcgtctgctgcttcagacctc
agtgtgttcccagacgtggacctcagcgcttcctgtggcagcaatagcatctttgctcca
gtcctcttgccacactcaaagtccttcttcagttcatgcggtagtttacataaactaagt
gacgagcccctgattcctcctccgcttcctcctcggaagaagtttgatcacgatgcttcg
aatcccaagggaaatatgaaatctgatgatgaccctcctgctattccaccaagacaacct
cctcctccaaaggtgaaacccagagttcccgctccttctggtgcatttgaggggtctctg
cacagcccacctccaccgccgcccagagagcctcttcctgacacgcctccaccggttccc
cttcggcctccagaacactttataaactgtccgtttaatcttcagccacctcccctggga
catcttcacagagatccagactggttcagagacgttagcacacgaccagattcgcccaac
actcctcccagcacaccgtctccacgggtgccacgtcgatgctgtgtgctcagttctagt
cacagtaatctcacttatcctcaagctccccctgttccaccaaggcagaattcaagccct
cacctaccaaaactgccaccaaagacttacaaacgggagctttcgcaccccccaatgtac
agactgtctttgctagaaaacgcggaaactcctcaatga

DBGET integrated database retrieval system