KEGG   Gorilla gorilla gorilla (western lowland gorilla): 101130824
Entry
101130824         CDS       T02442                                 

Gene name
SOS1
Definition
(RefSeq) son of sevenless homolog 1 isoform X1
  KO
K03099  son of sevenless
Organism
ggo  Gorilla gorilla gorilla (western lowland gorilla)
Pathway
ggo01521  EGFR tyrosine kinase inhibitor resistance
ggo01522  Endocrine resistance
ggo04010  MAPK signaling pathway
ggo04012  ErbB signaling pathway
ggo04014  Ras signaling pathway
ggo04062  Chemokine signaling pathway
ggo04068  FoxO signaling pathway
ggo04072  Phospholipase D signaling pathway
ggo04150  mTOR signaling pathway
ggo04151  PI3K-Akt signaling pathway
ggo04510  Focal adhesion
ggo04540  Gap junction
ggo04630  JAK-STAT signaling pathway
ggo04650  Natural killer cell mediated cytotoxicity
ggo04660  T cell receptor signaling pathway
ggo04662  B cell receptor signaling pathway
ggo04664  Fc epsilon RI signaling pathway
ggo04714  Thermogenesis
ggo04722  Neurotrophin signaling pathway
ggo04810  Regulation of actin cytoskeleton
ggo04910  Insulin signaling pathway
ggo04912  GnRH signaling pathway
ggo04915  Estrogen signaling pathway
ggo04917  Prolactin signaling pathway
ggo04926  Relaxin signaling pathway
ggo04935  Growth hormone synthesis, secretion and action
ggo05034  Alcoholism
ggo05160  Hepatitis C
ggo05161  Hepatitis B
ggo05163  Human cytomegalovirus infection
ggo05165  Human papillomavirus infection
ggo05200  Pathways in cancer
ggo05205  Proteoglycans in cancer
ggo05206  MicroRNAs in cancer
ggo05210  Colorectal cancer
ggo05211  Renal cell carcinoma
ggo05213  Endometrial cancer
ggo05214  Glioma
ggo05215  Prostate cancer
ggo05220  Chronic myeloid leukemia
ggo05221  Acute myeloid leukemia
ggo05223  Non-small cell lung cancer
ggo05224  Breast cancer
ggo05225  Hepatocellular carcinoma
ggo05226  Gastric cancer
ggo05231  Choline metabolism in cancer
Brite
KEGG Orthology (KO) [BR:ggo00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04010 MAPK signaling pathway
    101130824 (SOS1)
   04012 ErbB signaling pathway
    101130824 (SOS1)
   04014 Ras signaling pathway
    101130824 (SOS1)
   04630 JAK-STAT signaling pathway
    101130824 (SOS1)
   04068 FoxO signaling pathway
    101130824 (SOS1)
   04072 Phospholipase D signaling pathway
    101130824 (SOS1)
   04151 PI3K-Akt signaling pathway
    101130824 (SOS1)
   04150 mTOR signaling pathway
    101130824 (SOS1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    101130824 (SOS1)
   04540 Gap junction
    101130824 (SOS1)
  09142 Cell motility
   04810 Regulation of actin cytoskeleton
    101130824 (SOS1)
 09150 Organismal Systems
  09151 Immune system
   04650 Natural killer cell mediated cytotoxicity
    101130824 (SOS1)
   04660 T cell receptor signaling pathway
    101130824 (SOS1)
   04662 B cell receptor signaling pathway
    101130824 (SOS1)
   04664 Fc epsilon RI signaling pathway
    101130824 (SOS1)
   04062 Chemokine signaling pathway
    101130824 (SOS1)
  09152 Endocrine system
   04910 Insulin signaling pathway
    101130824 (SOS1)
   04912 GnRH signaling pathway
    101130824 (SOS1)
   04915 Estrogen signaling pathway
    101130824 (SOS1)
   04917 Prolactin signaling pathway
    101130824 (SOS1)
   04926 Relaxin signaling pathway
    101130824 (SOS1)
   04935 Growth hormone synthesis, secretion and action
    101130824 (SOS1)
  09156 Nervous system
   04722 Neurotrophin signaling pathway
    101130824 (SOS1)
  09159 Environmental adaptation
   04714 Thermogenesis
    101130824 (SOS1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    101130824 (SOS1)
   05206 MicroRNAs in cancer
    101130824 (SOS1)
   05205 Proteoglycans in cancer
    101130824 (SOS1)
   05231 Choline metabolism in cancer
    101130824 (SOS1)
  09162 Cancer: specific types
   05210 Colorectal cancer
    101130824 (SOS1)
   05225 Hepatocellular carcinoma
    101130824 (SOS1)
   05226 Gastric cancer
    101130824 (SOS1)
   05214 Glioma
    101130824 (SOS1)
   05221 Acute myeloid leukemia
    101130824 (SOS1)
   05220 Chronic myeloid leukemia
    101130824 (SOS1)
   05211 Renal cell carcinoma
    101130824 (SOS1)
   05215 Prostate cancer
    101130824 (SOS1)
   05213 Endometrial cancer
    101130824 (SOS1)
   05224 Breast cancer
    101130824 (SOS1)
   05223 Non-small cell lung cancer
    101130824 (SOS1)
  09165 Substance dependence
   05034 Alcoholism
    101130824 (SOS1)
  09172 Infectious disease: viral
   05161 Hepatitis B
    101130824 (SOS1)
   05160 Hepatitis C
    101130824 (SOS1)
   05163 Human cytomegalovirus infection
    101130824 (SOS1)
   05165 Human papillomavirus infection
    101130824 (SOS1)
  09176 Drug resistance: antineoplastic
   01521 EGFR tyrosine kinase inhibitor resistance
    101130824 (SOS1)
   01522 Endocrine resistance
    101130824 (SOS1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04990 Domain-containing proteins not elsewhere classified [BR:ggo04990]
    101130824 (SOS1)
Domain-containing proteins not elsewhere classified [BR:ggo04990]
 Pleckstrin homology (PH) domain-containing proteins
  Dbl-Like RhoGEF family proteins
   101130824 (SOS1)
SSDB
Motif
Pfam: RasGEF RasGEF_N RhoGEF Histone PH PH_19 PH_10 IQ_SEC7_PH PH_13
Other DBs
NCBI-GeneID: 101130824
NCBI-ProteinID: XP_004029160
Ensembl: ENSGGOG00000026950
UniProt: G3S477
LinkDB
Position
2A
AA seq 1333 aa
MQAQQLPYEFFSEENAPKWRGLLVPALKKVQGQVHPTLESNDDALQYVEELILQLLNMLC
QAQPRSASDVEERVQKSFPHPIDKWAIADAQSAIEKRKRRNPLSLPVEKIHPLLKEVLGY
KIDHQVSVYIVAVLEYISADILKLVGNYVRNIRHYEITKQDIKVAMCADKVLMDMFHQDV
EDINILSLTDEEPSTSGEQTYYDLVKAFMAEIRQYIRELNLIIKVFREPFVSNSKLFSAN
DVENIFSRIVDIHELSVKLLGHIEDTVEMTDEGSPHPLVGSCFEDLAEELAFDPYESYAR
DILRPGFHDRFLSQLSKPGAALYLQSIGEGFKEAVQYVLPRLLLAPVYHCLHYFELLKQL
EEKSEDQEDKECLKQAITALLNVQSGMEKICSKSLAKRRLSESACRFYSQQMKGKQLAIK
KMNEIQKNIDGWEGKDIGQCCNEFIMEGTLTRVGAKHERHIFLFDGLMICCKSNHGQPRL
PGASNAEYRLKEKFFMRKVQINDKDDTNEYKHAFEIILKDENSVIFSAKSAEEKNNWMAA
LISLQYRSTLERMLDVTMLQEEKEEQMRLPSADVYRFAEPDSEENIIFEENMQPKAGIPI
IKAGTVIKLIERLTYHMYADPNFVRTFLTTYRSFCKPQELLSLIIERFEIPEPEPTEADR
IAIENGDQPLSAELKRFRKEYIQPVQLRVLNVCRHWVEHHFYDFERDAYLLQRMEEFIGT
VRGKAMKKWVESITKIIQRKKIARDNGPGHNITFQSSPPTVEWHISRPGHIETFDLLTLH
PIEIARQLTLLESDLYRAVQPSELVGSVWTKEDKEINSPNLLKMIRHTTNLTLWFEKCIV
ETENLEERVAVVSRIIEILQVFQELNNFNGVLEVVSAMNSSPVYRLDHTFEQIPSRQKKI
LEEAHELSEDHYKKYLAKLRSINPPCVPFFGIYLTNILKTEEGNPEVLKRHGKELINFSK
RRKVAEITGEIQQYQNQPYCLRVESDIKRFFENLNPMGNSMEKEFTDYLFNKSLEIEPRN
PKPLPRFPKKYSYPLKSPGVRPSNPRPGTMRHPTPLQQEPRKISYSRIPESETESTASAP
NSPRTPLTPPPASGASSTTDVCSVFDSDHSSPFHSSNDTVFIQVTLPHGPRSASVSSISL
TKGTDEVPVPPPVPPRRRPESAPAESSPSKIMSKHLDSPPAIPPRQPTSKAYSPRYSISD
RTSISDPPESPPLLPPREPVRTPDVFSSSPLHLQPPPLGKKSDHGNAFFPNSPSPFTPPP
PQTPSPHGTRRHLPSPPLTQEVDLHSIAGPPVPPRQSTSQHIPKLPPKTYKREHTHPSMH
RDGPPLLENAHSS
NT seq 4002 nt   +upstreamnt  +downstreamnt
atgcaggcgcagcagctgccctacgaatttttcagcgaagagaacgcgcccaagtggcgg
ggactgctggtgcctgcgctgaaaaaggtccaggggcaagttcatcctactctcgagtct
aatgatgatgctcttcagtatgtagaagaattaattttgcaattattaaatatgctatgc
caagctcagccccgaagtgcttcagatgtagaggaacgtgttcaaaaaagtttccctcat
ccaattgataaatgggcaatagctgatgcccaatcagctattgaaaagaggaagcgaaga
aaccctttatctctcccagtagaaaaaattcatcctttattaaaggaggtcctaggttat
aaaattgaccaccaggtttctgtttacatagtagcagtcttagaatacatttctgccgac
attttaaagctggttgggaattatgtaagaaatatacggcattatgaaattacaaaacaa
gatattaaagtggcaatgtgtgctgacaaggtattgatggatatgtttcatcaagatgta
gaagatattaatatattatctttaactgatgaagagccttccacctcaggagaacaaact
tactatgatttggtaaaagcatttatggcagaaattcgacaatatataagggaactaaat
ctaattataaaagtttttagagagccctttgtctccaattcaaaattgttttcagctaat
gatgtagaaaatatatttagtcgcatagtagatatacatgaacttagtgtaaagttactg
ggccatatagaagatacagtagaaatgacagatgaaggcagtccccatccactagtagga
agctgctttgaagacttagcagaggaactggcatttgatccatatgaatcgtatgctcga
gatattttgcgacctggttttcatgatcgtttccttagtcagttatcaaagcctggggca
gcactttatttgcagtcaataggcgaaggtttcaaagaagctgttcaatatgttttaccc
aggctgcttctggcccctgtttaccactgtctccattactttgaacttttgaagcagtta
gaagaaaaaagtgaagatcaagaagacaaggaatgtttaaaacaagcaataacagctttg
cttaatgttcagagtggtatggaaaaaatatgttctaaaagtcttgcaaaacgaagactg
agtgaatctgcatgtcggttttatagtcagcaaatgaaggggaaacaactagcaatcaag
aaaatgaacgagattcagaagaatattgatggttgggagggaaaagacattggacagtgt
tgtaatgaatttataatggaaggaactcttacacgtgtaggagccaaacatgagagacac
atatttctctttgatggcttaatgatttgctgtaaatcaaatcatgggcagccaagactt
cctggtgctagcaatgcagaatatcgtcttaaagaaaagttttttatgcgaaaggtacaa
attaatgataaagatgacaccaatgaatacaagcatgcttttgaaataattttaaaagat
gaaaatagtgttatattttctgccaagtcagctgaagagaaaaacaattggatggcagca
ttgatatctttacagtaccggagtacactggaaaggatgcttgatgtaacaatgctacag
gaagagaaagaggagcagatgaggctgcctagtgctgatgtttatagatttgcagagcct
gactctgaagagaatattatatttgaagagaacatgcagcccaaggctggaattccaatt
atcaaagcaggaactgttattaaacttatagagaggcttacgtaccatatgtacgcagat
cccaattttgttcggacatttcttacaacatacagatccttttgcaaacctcaagaacta
ctgagtcttataatagaaaggtttgaaattccagagcctgagccgacagaagctgatcgc
atagctatagagaatggagatcaacccttgagtgcagaactaaaaagatttagaaaagaa
tatatacagcctgtgcaactgcgagtattaaatgtatgtcggcactgggtagagcaccac
ttctatgattttgaaagagatgcatatcttttgcaacgaatggaagaatttattggaaca
gtaagaggtaaagcaatgaaaaaatgggttgaatccatcactaaaataatccaaaggaaa
aaaattgcaagagacaatggaccaggtcataatattacatttcagagttcacctcccaca
gttgagtggcatataagcagacctgggcacatagagacttttgacctgctcaccttacac
ccaatagaaattgctcgacaactcactttacttgaatcagatctataccgagctgtacag
ccatcagaattagttggaagtgtgtggacaaaagaagacaaagaaattaactctcctaat
cttctgaaaatgattcgacataccaccaacctcactctgtggtttgagaaatgtattgta
gaaactgaaaatttagaagaaagagtagctgtggtgagtcgaattattgagattctacaa
gtctttcaagagttgaacaactttaatggtgtccttgaggttgtcagtgctatgaattca
tcacctgtttacagactagaccacacatttgagcaaataccaagtcgccagaagaaaatt
ttagaagaagctcatgaattgagtgaagatcactataagaaatatttggcaaaactcagg
tctattaatccaccatgtgtgcctttctttggaatttatctcactaatatcttgaaaaca
gaagaaggcaaccctgaggtcctaaaaagacatggaaaagagcttataaactttagcaaa
aggaggaaagtagcagaaataacaggagagatccagcagtaccaaaatcagccttactgt
ttacgagtagaatcagatatcaaaaggttctttgaaaacttgaatccgatgggaaatagc
atggagaaggaatttacagattatcttttcaacaaatccctagaaatagaaccacgaaac
cctaagcctctcccaagatttccaaaaaaatatagctatcccctaaaatctcctggtgtt
cgtccatcaaacccaagaccaggtaccatgaggcatcccacacctctgcagcaggagcca
aggaaaattagttatagtaggatccctgaaagtgaaacagaaagtacagcatctgcacca
aattctccaagaacaccgttaacacctccgcctgcttctggtgcttccagtaccacagat
gtttgcagtgtatttgattccgatcattcgagcccttttcactcaagcaatgataccgtc
tttatccaagttactctgccccatggcccaagatctgcttctgtatcatctataagttta
accaaaggcactgatgaagtgcctgtccctcctcctgttcctccacgaagacgaccagaa
tctgccccagcagaatcttcaccatctaagattatgtctaagcatttggacagtccccca
gccattcctcctaggcaacccacatcaaaagcctattcaccacgatattcaatatcagac
cggacctctatctcagaccctcctgaaagccctcccttattaccaccacgagaacctgtg
aggacacctgatgttttctcaagctcaccactacatctccaacctccccctttgggcaaa
aaaagtgaccatggcaatgccttcttcccaaacagcccttccccctttacaccacctcct
cctcaaacaccttctcctcatggcacaagaaggcatctgccatcaccaccattgacacaa
gaagtggaccttcattccattgctgggccgcctgttcctccacgacaaagcacttctcaa
catatccctaaactccctccaaaaacttacaaaagggagcacacacacccatccatgcac
agagatggaccaccactgttggagaatgcccattcttcctga

KEGG   Gorilla gorilla gorilla (western lowland gorilla): 101137539
Entry
101137539         CDS       T02442                                 

Gene name
SOS2
Definition
(RefSeq) son of sevenless homolog 2
  KO
K03099  son of sevenless
Organism
ggo  Gorilla gorilla gorilla (western lowland gorilla)
Pathway
ggo01521  EGFR tyrosine kinase inhibitor resistance
ggo01522  Endocrine resistance
ggo04010  MAPK signaling pathway
ggo04012  ErbB signaling pathway
ggo04014  Ras signaling pathway
ggo04062  Chemokine signaling pathway
ggo04068  FoxO signaling pathway
ggo04072  Phospholipase D signaling pathway
ggo04150  mTOR signaling pathway
ggo04151  PI3K-Akt signaling pathway
ggo04510  Focal adhesion
ggo04540  Gap junction
ggo04630  JAK-STAT signaling pathway
ggo04650  Natural killer cell mediated cytotoxicity
ggo04660  T cell receptor signaling pathway
ggo04662  B cell receptor signaling pathway
ggo04664  Fc epsilon RI signaling pathway
ggo04714  Thermogenesis
ggo04722  Neurotrophin signaling pathway
ggo04810  Regulation of actin cytoskeleton
ggo04910  Insulin signaling pathway
ggo04912  GnRH signaling pathway
ggo04915  Estrogen signaling pathway
ggo04917  Prolactin signaling pathway
ggo04926  Relaxin signaling pathway
ggo04935  Growth hormone synthesis, secretion and action
ggo05034  Alcoholism
ggo05160  Hepatitis C
ggo05161  Hepatitis B
ggo05163  Human cytomegalovirus infection
ggo05165  Human papillomavirus infection
ggo05200  Pathways in cancer
ggo05205  Proteoglycans in cancer
ggo05206  MicroRNAs in cancer
ggo05210  Colorectal cancer
ggo05211  Renal cell carcinoma
ggo05213  Endometrial cancer
ggo05214  Glioma
ggo05215  Prostate cancer
ggo05220  Chronic myeloid leukemia
ggo05221  Acute myeloid leukemia
ggo05223  Non-small cell lung cancer
ggo05224  Breast cancer
ggo05225  Hepatocellular carcinoma
ggo05226  Gastric cancer
ggo05231  Choline metabolism in cancer
Brite
KEGG Orthology (KO) [BR:ggo00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04010 MAPK signaling pathway
    101137539 (SOS2)
   04012 ErbB signaling pathway
    101137539 (SOS2)
   04014 Ras signaling pathway
    101137539 (SOS2)
   04630 JAK-STAT signaling pathway
    101137539 (SOS2)
   04068 FoxO signaling pathway
    101137539 (SOS2)
   04072 Phospholipase D signaling pathway
    101137539 (SOS2)
   04151 PI3K-Akt signaling pathway
    101137539 (SOS2)
   04150 mTOR signaling pathway
    101137539 (SOS2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    101137539 (SOS2)
   04540 Gap junction
    101137539 (SOS2)
  09142 Cell motility
   04810 Regulation of actin cytoskeleton
    101137539 (SOS2)
 09150 Organismal Systems
  09151 Immune system
   04650 Natural killer cell mediated cytotoxicity
    101137539 (SOS2)
   04660 T cell receptor signaling pathway
    101137539 (SOS2)
   04662 B cell receptor signaling pathway
    101137539 (SOS2)
   04664 Fc epsilon RI signaling pathway
    101137539 (SOS2)
   04062 Chemokine signaling pathway
    101137539 (SOS2)
  09152 Endocrine system
   04910 Insulin signaling pathway
    101137539 (SOS2)
   04912 GnRH signaling pathway
    101137539 (SOS2)
   04915 Estrogen signaling pathway
    101137539 (SOS2)
   04917 Prolactin signaling pathway
    101137539 (SOS2)
   04926 Relaxin signaling pathway
    101137539 (SOS2)
   04935 Growth hormone synthesis, secretion and action
    101137539 (SOS2)
  09156 Nervous system
   04722 Neurotrophin signaling pathway
    101137539 (SOS2)
  09159 Environmental adaptation
   04714 Thermogenesis
    101137539 (SOS2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    101137539 (SOS2)
   05206 MicroRNAs in cancer
    101137539 (SOS2)
   05205 Proteoglycans in cancer
    101137539 (SOS2)
   05231 Choline metabolism in cancer
    101137539 (SOS2)
  09162 Cancer: specific types
   05210 Colorectal cancer
    101137539 (SOS2)
   05225 Hepatocellular carcinoma
    101137539 (SOS2)
   05226 Gastric cancer
    101137539 (SOS2)
   05214 Glioma
    101137539 (SOS2)
   05221 Acute myeloid leukemia
    101137539 (SOS2)
   05220 Chronic myeloid leukemia
    101137539 (SOS2)
   05211 Renal cell carcinoma
    101137539 (SOS2)
   05215 Prostate cancer
    101137539 (SOS2)
   05213 Endometrial cancer
    101137539 (SOS2)
   05224 Breast cancer
    101137539 (SOS2)
   05223 Non-small cell lung cancer
    101137539 (SOS2)
  09165 Substance dependence
   05034 Alcoholism
    101137539 (SOS2)
  09172 Infectious disease: viral
   05161 Hepatitis B
    101137539 (SOS2)
   05160 Hepatitis C
    101137539 (SOS2)
   05163 Human cytomegalovirus infection
    101137539 (SOS2)
   05165 Human papillomavirus infection
    101137539 (SOS2)
  09176 Drug resistance: antineoplastic
   01521 EGFR tyrosine kinase inhibitor resistance
    101137539 (SOS2)
   01522 Endocrine resistance
    101137539 (SOS2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04990 Domain-containing proteins not elsewhere classified [BR:ggo04990]
    101137539 (SOS2)
Domain-containing proteins not elsewhere classified [BR:ggo04990]
 Pleckstrin homology (PH) domain-containing proteins
  Dbl-Like RhoGEF family proteins
   101137539 (SOS2)
SSDB
Motif
Pfam: RasGEF RasGEF_N RhoGEF PH Histone IQ_SEC7_PH PH_19 PH_13
Other DBs
NCBI-GeneID: 101137539
NCBI-ProteinID: XP_030857684
Ensembl: ENSGGOG00000015477
LinkDB
Position
14
AA seq 1332 aa
MQQAPQPYEFFSEENSPKWRGLLVSALRKVQEQVHPTLSANEESLYYIEELIFQLLNKLC
MAQPRTVQDVEERVQKTFPHPIDKWAIADAQSAIEKRKRRNPLLLPVDKIHPSLKEVLGY
KVDYHVSLYIVAVLEYISADILKLAGNYVFNIRHYEISQQDIKVSMCADKVLMDMFDQDD
IGLVSLCEDEPSSSGELNYYDLVRTEIAEERQYLRELNMIIKVFREAFLSDRKLFKPSDI
EKIFSNISDIHELTVKLLGLIEDTVEMTDESSPHPLAGSCFEDLAEEQAFDPYETLSQDI
LSPEFHEHFNKLMARPAVALHFQSIADGFKEAVRYVLPRLMLVPVYHCWHYFELLKQLKA
CSEEQEDRECLNQAITALMNLQGSMDRIYKQYSPRRRPGDPVCPFYSHQLRSKHLAIKKM
NEIQKNIDGWEGKDIGQCCNEFIMEGPLTRIGAKHERHIFLFDGLMISCKPNHGQTRLPG
YSSAEYRLKEKFVMRKIQICDKEDTCEYKHAFELVSKDENSIIFAAKSAEEKNNWMAALI
SLHYRSTLDRMLDSVLLKEENEQPLRLPSPEVYRFVVKDSEENIVFEDNLQSRSGIPIIK
GGTVVKLIERLTYHMYADPNFVRTFLTTYRSFCKPQELLSLLIERFEIPEPEPTDADKLA
IEKGEQPISADLKRFRKEYVQPVQLRILNVFRHWVEHHFYDFERDLELLERLESFISSVR
GKAMKKWVESIAKIIRRKKQAQANGISHNITFESPPPPIEWHISKPGQFETFDLMTLHPI
EIARQLTLLESDLYRKVQPSELVGSVWTKEDKEINSPNLLKMIRHTTNLTLWFEKCIVEA
ENFEERVAVLSRIIEILQVFQDLNNFNGVLEIVSAVNSVSVYRLDHTFEALQERKRKILD
EAVELSQDHFKKYLVKLKSINPPCVPFFGIYLTNILKTEEGNNDFLKKKGKDLINFSKRR
KVAEITGEIQQYQNQPYCLRIEPDMRRFFENLNPMGSASEKEFTDYLFNKSLEIEPRNCK
QPPRFPRKSTFSLKSPGIRPNTGRHGSTSGTLRGHPTPLEREPCKISFSRIAETELESTV
SAPTSPNTPSTPPVSASSDLSVFLDVDLNSSCGSNSIFAPVLLPHSKSFFSSCGSLHKLS
EEPLIPPPLPPRKKFDHDASNSKGNMKSDDDPPAIPPRQPPPPKVKPRVPVPTGAFDGPL
HSPPPPPPRDPLPDTPPPVPLRPPEHFINCPFNLQPPPLGHLHRDSDWLRDISTCPNSPS
TPPSTPSPRVPRRCYVLSSSQNNLAHPPAPPVPPRQNSSPHLPKLPPKTYKRELSHPPLY
RLPLLENAETPQ
NT seq 3999 nt   +upstreamnt  +downstreamnt
atgcagcaggcgccgcagccttacgagttcttcagcgaggagaacagtccgaaatggcgg
ggactgttggtctcggccctgcggaaggttcaggaacaagtgcatcccactctctcagct
aatgaagagtctctctattatattgaagagctgatttttcagctgcttaataaattatgc
atggcccagccaaggactgttcaagatgtagaggagcgagttcagaagacctttcctcac
ccaattgataaatgggccattgctgatgcacaatctgctatagaaaaacgaaaacgaaga
aatcctcttttactgcctgtggacaaaatccatccttcgttgaaggaagtattagggtac
aaagtggactaccatgtgtccctatatattgtggctgtactagagtatatctcagctgat
attttaaaattggctggtaattatgtttttaatatccggcattatgaaatatctcagcag
gacattaaagtgtcaatgtgtgcggataaggttttgatggacatgtttgatcaggatgac
ataggtttggtttctctctgtgaagatgaacctagttcttctggtgaattaaactactat
gatcttgtcagaactgaaatcgcagaagaaagacagtatctacgggaattaaatatgatc
ataaaagtgtttcgagaagcctttctttctgatagaaagctgtttaaaccttctgatatc
gaaaagatttttagtaacatttcagatatacatgaattgactgtgaaacttttaggtttg
attgaagacacagttgaaatgactgatgaaagcagtcctcatcccttagctggcagctgt
tttgaagatttggcagaagagcaagcatttgatccttatgaaacattatcacaggacatt
ctttcaccagagtttcatgaacatttcaataaattgatggccagacctgcagttgctcta
cactttcagtccattgctgatggttttaaagaggcagttcgttatgtccttccacgtctt
atgctggtgccagtgtatcactgttggcactactttgagttactaaagcaattgaaagca
tgtagtgaagaacaagaagacagagaatgtttgaaccaagctattactgctctcatgaat
ctccaaggtagcatggaccgaatttacaagcagtattcacctagacgtcgacctggagat
cctgtttgccctttttatagtcaccaattaagaagcaaacacctggctatcaaaaaaatg
aatgaaattcagaaaaatatcgatggatgggaaggcaaagatattggacagtgttgtaat
gaattcattatggagggaccattgacaagaatcggtgccaaacatgaacggcatattttt
ctgtttgatggcttaatgatcagttgtaaacctaatcatggccagactcggcttccaggt
tacagtagtgcagaatacaggttaaaagaaaaatttgtcatgaggaaaatacaaatttgt
gataaagaagatacttgtgagtacaagcatgcatttgaattagtatccaaagatgagaat
agcataatatttgctgctaagtctgctgaagaaaaaaacaactggatggcagcccttatt
tctcttcattatcgtagtactctagatcgaatgttagattcagtattattgaaagaagaa
aatgagcaaccactgagattaccaagtcctgaagtatatcgttttgtagtaaaagactct
gaggaaaacattgtttttgaagacaacttgcaaagtagaagtggcatccccattattaaa
ggaggaactgtagtgaaattaattgaaaggttaacatatcatatgtatgcagatcccaat
tttgttcgtacttttcttaccacatatcgttcattttgtaaaccacaggaattgctgagc
ttactgattgaacggtttgaaattccagagccagaacctactgacgcagacaaattggca
atagagaaaggcgaacagccaatcagtgcagaccttaaaagatttcgcaaggaatatgtc
caaccagtacaacttaggatcttaaatgtatttcggcattgggttgaacatcatttttat
gactttgaaagagacttggaattgcttgaaagactagaatccttcatttcaagtgtaaga
gggaaagctatgaaaaaatgggtagagtcaatcgctaagatcatcaggaggaagaagcaa
gctcaggcaaacggaataagccataatattacctttgaaagtccacctccaccaattgaa
tggcatatcagcaaaccaggacagtttgaaacatttgatctcatgacacttcatccaata
gaaattgcacgtcagctgacacttttggagtctgatctctacaggaaagttcaaccgtct
gaacttgtagggagtgtgtggaccaaagaagataaagaaataaattctccaaatttatta
aaaatgattcgccataccacaaatctcaccctctggtttgaaaaatgcattgtggaagca
gaaaattttgaagaacgggtggcagtactaagtagaattatagaaattctgcaagttttt
caagatttgaataatttcaatggcgtattggagatagtcagtgcagtaaattcagtgtca
gtatacagactagaccatacctttgaggcattgcaggaaagaaaaaggaaaattttggac
gaagctgtggaattaagtcaagatcactttaaaaaatacctagtaaaacttaagtcaatc
aatccaccttgtgtgcctttttttggaatatatttaacaaatattctgaagaccgaagaa
gggaataatgattttttaaaaaagaaagggaaagatttaatcaatttcagtaagaggagg
aaagtagctgaaattactggagaaattcagcagtatcagaatcagccttactgtttacgg
atagaaccagatatgaggagattctttgaaaaccttaaccccatgggaagtgcatctgaa
aaagagtttacagattatttgttcaacaagtcactagaaattgaacctcgaaactgcaaa
cagccacctcgatttcctaggaaatcaactttttccttaaaatctcctggaataaggcct
aacacaggccgacatggctctacctcaggtactttacgaggtcacccaacaccattagaa
agagaaccatgtaaaataagctttagtcggattgctgaaactgagctggaatcaacagtg
tcagcaccaacctctccaaatacaccatctactccaccagtatctgcttcttcagacctt
agtgtatttttagatgtggatctcaacagctcctgtggcagcaatagcatctttgctcca
gtgcttttgccacattcaaagtctttctttagttcatgtggtagtttacataaactaagt
gaagagcccctgattcctcctcctcttcctcctcgaaaaaagtttgatcatgatgcttca
aattccaagggaaatatgaaatctgatgatgatcctcctgctattccaccgagacagcct
cctcctccaaaggtaaaacccagagttcctgttcctactggtgcatttgatgggcctctg
catagtccacctccgccaccaccaagagatcctcttcctgatacccctccaccagttccc
cttcggcctccagaacactttataaactgtccatttaatcttcagccacctccactgggg
catcttcacagagattcagactggctcagagacattagtacctgtccaaattcgccaagc
actcctcctagcacaccctctccaagggtaccgcgtcgatgctatgtgctcagttctagt
cagaataatcttgctcatcctccagctccccctgttccaccaaggcagaattcaagccct
catctgccaaaactgccaccaaagacttacaaacgggagctttcgcaccccccattgtac
agactgcctttgctagaaaatgcagaaactccccaatga

DBGET integrated database retrieval system