KEGG   Odobenus rosmarus divergens (Pacific walrus): 101369947
Entry
101369947         CDS       T05148                                 

Gene name
SOS1
Definition
(RefSeq) son of sevenless homolog 1 isoform X1
  KO
K03099  son of sevenless
Organism
oro  Odobenus rosmarus divergens (Pacific walrus)
Pathway
oro01521  EGFR tyrosine kinase inhibitor resistance
oro01522  Endocrine resistance
oro04010  MAPK signaling pathway
oro04012  ErbB signaling pathway
oro04014  Ras signaling pathway
oro04062  Chemokine signaling pathway
oro04068  FoxO signaling pathway
oro04072  Phospholipase D signaling pathway
oro04150  mTOR signaling pathway
oro04151  PI3K-Akt signaling pathway
oro04510  Focal adhesion
oro04540  Gap junction
oro04630  JAK-STAT signaling pathway
oro04650  Natural killer cell mediated cytotoxicity
oro04660  T cell receptor signaling pathway
oro04662  B cell receptor signaling pathway
oro04664  Fc epsilon RI signaling pathway
oro04714  Thermogenesis
oro04722  Neurotrophin signaling pathway
oro04810  Regulation of actin cytoskeleton
oro04910  Insulin signaling pathway
oro04912  GnRH signaling pathway
oro04915  Estrogen signaling pathway
oro04917  Prolactin signaling pathway
oro04926  Relaxin signaling pathway
oro04935  Growth hormone synthesis, secretion and action
oro05034  Alcoholism
oro05160  Hepatitis C
oro05161  Hepatitis B
oro05163  Human cytomegalovirus infection
oro05165  Human papillomavirus infection
oro05200  Pathways in cancer
oro05205  Proteoglycans in cancer
oro05206  MicroRNAs in cancer
oro05210  Colorectal cancer
oro05211  Renal cell carcinoma
oro05213  Endometrial cancer
oro05214  Glioma
oro05215  Prostate cancer
oro05220  Chronic myeloid leukemia
oro05221  Acute myeloid leukemia
oro05223  Non-small cell lung cancer
oro05224  Breast cancer
oro05225  Hepatocellular carcinoma
oro05226  Gastric cancer
oro05231  Choline metabolism in cancer
Brite
KEGG Orthology (KO) [BR:oro00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04010 MAPK signaling pathway
    101369947 (SOS1)
   04012 ErbB signaling pathway
    101369947 (SOS1)
   04014 Ras signaling pathway
    101369947 (SOS1)
   04630 JAK-STAT signaling pathway
    101369947 (SOS1)
   04068 FoxO signaling pathway
    101369947 (SOS1)
   04072 Phospholipase D signaling pathway
    101369947 (SOS1)
   04151 PI3K-Akt signaling pathway
    101369947 (SOS1)
   04150 mTOR signaling pathway
    101369947 (SOS1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    101369947 (SOS1)
   04540 Gap junction
    101369947 (SOS1)
  09142 Cell motility
   04810 Regulation of actin cytoskeleton
    101369947 (SOS1)
 09150 Organismal Systems
  09151 Immune system
   04650 Natural killer cell mediated cytotoxicity
    101369947 (SOS1)
   04660 T cell receptor signaling pathway
    101369947 (SOS1)
   04662 B cell receptor signaling pathway
    101369947 (SOS1)
   04664 Fc epsilon RI signaling pathway
    101369947 (SOS1)
   04062 Chemokine signaling pathway
    101369947 (SOS1)
  09152 Endocrine system
   04910 Insulin signaling pathway
    101369947 (SOS1)
   04912 GnRH signaling pathway
    101369947 (SOS1)
   04915 Estrogen signaling pathway
    101369947 (SOS1)
   04917 Prolactin signaling pathway
    101369947 (SOS1)
   04926 Relaxin signaling pathway
    101369947 (SOS1)
   04935 Growth hormone synthesis, secretion and action
    101369947 (SOS1)
  09156 Nervous system
   04722 Neurotrophin signaling pathway
    101369947 (SOS1)
  09159 Environmental adaptation
   04714 Thermogenesis
    101369947 (SOS1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    101369947 (SOS1)
   05206 MicroRNAs in cancer
    101369947 (SOS1)
   05205 Proteoglycans in cancer
    101369947 (SOS1)
   05231 Choline metabolism in cancer
    101369947 (SOS1)
  09162 Cancer: specific types
   05210 Colorectal cancer
    101369947 (SOS1)
   05225 Hepatocellular carcinoma
    101369947 (SOS1)
   05226 Gastric cancer
    101369947 (SOS1)
   05214 Glioma
    101369947 (SOS1)
   05221 Acute myeloid leukemia
    101369947 (SOS1)
   05220 Chronic myeloid leukemia
    101369947 (SOS1)
   05211 Renal cell carcinoma
    101369947 (SOS1)
   05215 Prostate cancer
    101369947 (SOS1)
   05213 Endometrial cancer
    101369947 (SOS1)
   05224 Breast cancer
    101369947 (SOS1)
   05223 Non-small cell lung cancer
    101369947 (SOS1)
  09165 Substance dependence
   05034 Alcoholism
    101369947 (SOS1)
  09172 Infectious disease: viral
   05161 Hepatitis B
    101369947 (SOS1)
   05160 Hepatitis C
    101369947 (SOS1)
   05163 Human cytomegalovirus infection
    101369947 (SOS1)
   05165 Human papillomavirus infection
    101369947 (SOS1)
  09176 Drug resistance: antineoplastic
   01521 EGFR tyrosine kinase inhibitor resistance
    101369947 (SOS1)
   01522 Endocrine resistance
    101369947 (SOS1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04990 Domain-containing proteins not elsewhere classified [BR:oro04990]
    101369947 (SOS1)
Domain-containing proteins not elsewhere classified [BR:oro04990]
 Pleckstrin homology (PH) domain-containing proteins
  Dbl-Like RhoGEF family proteins
   101369947 (SOS1)
SSDB
Motif
Pfam: RasGEF RasGEF_N RhoGEF Histone PH PH_19 PH_10 IQ_SEC7_PH PH_13
Other DBs
NCBI-GeneID: 101369947
NCBI-ProteinID: XP_004410061
UniProt: A0A2U3WM57
LinkDB
Position
Unknown
AA seq 1333 aa
MQAQQLPYEFFSEENAPKWRGLLVPALKKVQGQVHPTLESSDDALQYVEELILQLLNMLC
QAQPRSASDVEERVQKSFPHPIDKWAIADAQSAIEKRKRRNPLSLPVEKIHPLLKEVLGY
KIDHQVSVYIVAVLEYISADILKLVGNYVRNIRHYEITKQDIKVAMCADKVLMDMFHQDV
EDINILSLTDEEPSTSGEQTYYDLVKAFMAEIRQYIRELNLIIKVFREPFVSNSKLFSAN
DVENIFSRIVDIHELSVKLLGHIEDTVEMTDEGSPHPLVGSCFEDLAEELAFDPYESYAR
DILRPGFHDRFLSQLSKPGAALYLQSIGEGFKEAVQYVLPRLLLAPVYHCLHYFELLKQL
EEKSEDQEDKECLKQAITALLNVQSGMEKICSKSLAKRRLSESACRFYSQQMKGKQLAIK
KMNEIQKNIDGWEGKDIGQCCNEFIMEGTLTRVGAKHERHIFLFDGLMICCKSNHGQPRL
PGASNAEYRLKEKFFMRKVQINDKDDTSEYKHAFEIILKDENSVIFSAKSAEEKNNWMAA
LISLQYRSTLERMLDVTMLQEEKEEQMRLPSADVYRFAEPDSEENIIFEENMQPKAGIPI
IKAGTVIKLIERLTYHMYADPNFVRTFLTTYRSFCKPQELLSLIIERFEIPEPEPTEADR
IAIENGDQPLSAELKRFRKEYIQPVQLRVLNVCRHWVEHHFYDFERDADLLQRMEEFIGT
VRGKAMKKWVESITKIIQRKKIARDNGPGHNITFQSSPPTVEWHISRPGHIETFDLLTLH
PIEIARQLTLLESDLYRAVQPSELVGSVWTKEDKEINSPNLLKMIRHTTNLTLWFEKCIV
ETENLEERVAVVSRIIEILQVFQELNNFNGVLEVVSAMNSSPVYRLDHTFEQIPSRQKKI
LEEAHELSEDHYKKYLAKLRSINPPCVPFFGIYLTNILKTEEGNPEVLKRHGKELINFSK
RRKVAEITGEIQQYQNQPYCLRVESDIKRFFENLNPMGNSMEKEFTDYLFNKSLEIEPRN
PKPLPRFPKKYSYPLKSPGVRPSNPRPGTMRHPTPLQQEPRKISYSRIPESETESTASAP
NSPRTPLTPPPASGASSTTDVCSVFDSDHSSPFHSSSDTIFIQVSLPHGPRSASVSSISL
TKGTDEVPLPPPVPPRRRPESAPAESSPSKIMSKHLDSPPAIPPRQPTSKAYSPRYSISD
RTSISDPPESPPLLPPREPVRTPDVFSSSPLHLQPPPLGKKSDHGNAFFPNSPSPFTPPP
PQTPSPHGTRRHLPSPPLTQEVDLPSITGPPVPPRQSTSQHIPKLPPKTYKREHTHPSMH
RDGPPLLENAHSS
NT seq 4002 nt   +upstreamnt  +downstreamnt
atgcaggcgcagcagctgccgtacgagtttttcagcgaggagaacgcacccaagtggcgg
gggctgctggtgccggcgctgaaaaaggtccaggggcaagttcatccgactcttgagtct
agtgatgatgctcttcaatatgttgaagaattaattttgcagttattaaatatgctatgc
caagctcagccccgaagtgcttcagatgtagaggaacgtgttcaaaaaagtttccctcat
ccaattgataagtgggcaatagctgatgcccaatcggctattgaaaagaggaagcgaaga
aaccctttatctcttccagtagaaaaaattcatcctttactaaaggaggtcctaggttat
aaaattgaccaccaggtttctgtttacatagtagctgtattagaatacatttctgcagac
attttaaagctggtggggaattatgtacgaaatatacggcattatgaaattacaaaacaa
gatattaaagtggcaatgtgtgctgataaggtattaatggacatgtttcatcaagatgta
gaagatataaatatattatctttaactgatgaagaaccttccacctcaggagagcaaact
tattatgatttggtaaaagcatttatggcagaaattcgacagtacataagggaactaaat
ttaattataaaagtttttagagagccctttgtctccaactcaaaattgttttcagctaat
gatgtagaaaatatatttagtcgtatagtagatatacatgaacttagtgtaaagttactg
ggccatatagaagatactgtggaaatgactgatgaaggtagcccccatcccttagtagga
agctgctttgaagacttagcagaggaactggcatttgatccatatgaatcatatgctcga
gatattttacggcctggttttcatgatcgcttccttagtcagttatcaaagcctggagcg
gcactctatttacagtcaataggcgaaggtttcaaagaagctgttcagtatgttttaccc
aggctacttctagcccctgtttaccactgtctacattactttgaacttttgaagcagttg
gaagaaaagagtgaagatcaagaagacaaggaatgtttgaaacaagcaataacagctttg
cttaatgttcagagtggtatggaaaaaatatgttctaaaagtcttgcaaaacgaagactg
agtgaatctgcatgtcggttttatagtcagcaaatgaaggggaaacaactagcaatcaag
aaaatgaacgagattcaaaagaatattgatggttgggagggaaaggacattggacagtgt
tgcaatgaatttataatggaaggaactcttacacgtgtaggagccaaacatgagagacac
atatttctctttgatggcttaatgatttgctgtaaatcaaatcatgggcagccaagactt
cctggtgctagcaatgcagaatatcgtcttaaagaaaagtttttcatgcgaaaggtacaa
attaatgacaaagatgacaccagtgagtacaagcatgcttttgaaataattttaaaagat
gaaaatagtgttatattttctgccaagtcagctgaagagaaaaacaactggatggcagcg
ttgatatctttacagtaccggagtacactggaaaggatgcttgatgtgacgatgctgcag
gaagagaaggaggagcagatgaggctccctagtgctgatgtttatagatttgcagagcct
gactctgaagagaatataatatttgaagaaaacatgcagcccaaggctggaattccaatt
atcaaagcgggaactgttattaaacttatagagaggctcacataccatatgtatgcagat
cccaattttgttcggacatttcttacaacatacagatccttttgtaaacctcaagaacta
ctgagtcttataatagaaaggtttgaaattccagagcctgagccaacagaagctgaccgc
atagctatagagaacggagatcagcccttgagtgcagaactaaaaaggtttagaaaagag
tacatacagcccgtgcagctgcgagtattaaatgtgtgtcggcattgggtagagcaccac
ttctatgattttgaaagggatgcagatcttttgcagcgaatggaagaatttattggaaca
gtgagaggtaaagcaatgaaaaaatgggttgaatccatcactaaaataatccaaaggaaa
aaaattgcaagagacaatggaccaggtcataatattacatttcagagttcacctcccaca
gttgagtggcatataagcaggcctgggcacatagagacttttgacctgctcaccttacac
ccaatagaaattgctcgacaactcactttacttgaatcagatctatatcgagctgtacag
ccatcagaattagttggaagtgtgtggacaaaagaagacaaagaaattaattctcctaat
cttctaaaaatgatccggcataccactaatctcactctgtggtttgagaaatgtattgta
gaaactgaaaatttagaagaaagagtagctgtggtgagtcgaataattgagattctacaa
gtctttcaagagctgaacaacttcaatggtgtccttgaggttgtcagtgctatgaactcc
tcacctgtttacagactagaccacacatttgagcaaataccaagtcgccaaaagaaaatt
ttagaagaagctcatgaattaagtgaagatcactataagaaatatttggcaaaactcagg
tctattaatccaccatgtgtgcctttctttggaatttatctaactaatatcttgaaaaca
gaagaaggcaaccctgaggtcctaaaaaggcatggaaaagagcttataaactttagcaaa
aggaggaaagtagcagaaataacaggagagatccagcagtaccaaaaccagccttattgt
ttacgagtagaatcagatatcaaaaggttctttgaaaatttgaatccaatgggaaatagt
atggaaaaagaatttacagattatcttttcaacaaatccctagaaatagaaccacgaaac
cctaagcctctcccaagatttccaaaaaaatatagctatcccctaaaatctcctggtgtt
cgtccatcaaacccaagaccagggaccatgagacatcccacacctctgcagcaggagcca
aggaaaattagttatagtaggatccctgaaagtgaaacagaaagtacagcatctgcacca
aactctccaagaacaccgttaacacctcctcctgcttctggtgcttctagtaccacagat
gtttgcagcgtttttgattccgatcattcgagcccttttcattcaagcagcgataccatc
tttatccaagtttcactgccccatggcccaagatctgcttcagtatcgtctataagttta
accaagggcactgatgaagtgcccctcccccctcccgttcctccacgaagacggccggaa
tctgcccccgccgaatcttcgccgtctaagatcatgtctaagcatttggacagcccccca
gcgattcctcctaggcagcccacatccaaagcctattcaccacgatactcaatatcagac
cggacctctatatcagaccctcctgagagccctcccttattaccaccacgagaacctgtg
aggacacctgatgttttctccagctccccgctacatctccagcctccccctttgggcaaa
aaaagcgatcatggcaatgccttcttcccaaacagcccttccccttttactccacctcct
cctcagacaccatctccccatggcacgagaaggcatctgccgtcaccgccattgacacaa
gaagtggaccttccttccatcactgggccacctgttcctccacgacaaagcacttctcaa
catatccctaaactccctccaaaaacttataaaagggagcacacacacccatccatgcac
agagacggaccaccgctgctggagaacgcccactcctcctga

KEGG   Odobenus rosmarus divergens (Pacific walrus): 101385114
Entry
101385114         CDS       T05148                                 

Gene name
SOS2
Definition
(RefSeq) son of sevenless homolog 2 isoform X1
  KO
K03099  son of sevenless
Organism
oro  Odobenus rosmarus divergens (Pacific walrus)
Pathway
oro01521  EGFR tyrosine kinase inhibitor resistance
oro01522  Endocrine resistance
oro04010  MAPK signaling pathway
oro04012  ErbB signaling pathway
oro04014  Ras signaling pathway
oro04062  Chemokine signaling pathway
oro04068  FoxO signaling pathway
oro04072  Phospholipase D signaling pathway
oro04150  mTOR signaling pathway
oro04151  PI3K-Akt signaling pathway
oro04510  Focal adhesion
oro04540  Gap junction
oro04630  JAK-STAT signaling pathway
oro04650  Natural killer cell mediated cytotoxicity
oro04660  T cell receptor signaling pathway
oro04662  B cell receptor signaling pathway
oro04664  Fc epsilon RI signaling pathway
oro04714  Thermogenesis
oro04722  Neurotrophin signaling pathway
oro04810  Regulation of actin cytoskeleton
oro04910  Insulin signaling pathway
oro04912  GnRH signaling pathway
oro04915  Estrogen signaling pathway
oro04917  Prolactin signaling pathway
oro04926  Relaxin signaling pathway
oro04935  Growth hormone synthesis, secretion and action
oro05034  Alcoholism
oro05160  Hepatitis C
oro05161  Hepatitis B
oro05163  Human cytomegalovirus infection
oro05165  Human papillomavirus infection
oro05200  Pathways in cancer
oro05205  Proteoglycans in cancer
oro05206  MicroRNAs in cancer
oro05210  Colorectal cancer
oro05211  Renal cell carcinoma
oro05213  Endometrial cancer
oro05214  Glioma
oro05215  Prostate cancer
oro05220  Chronic myeloid leukemia
oro05221  Acute myeloid leukemia
oro05223  Non-small cell lung cancer
oro05224  Breast cancer
oro05225  Hepatocellular carcinoma
oro05226  Gastric cancer
oro05231  Choline metabolism in cancer
Brite
KEGG Orthology (KO) [BR:oro00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04010 MAPK signaling pathway
    101385114 (SOS2)
   04012 ErbB signaling pathway
    101385114 (SOS2)
   04014 Ras signaling pathway
    101385114 (SOS2)
   04630 JAK-STAT signaling pathway
    101385114 (SOS2)
   04068 FoxO signaling pathway
    101385114 (SOS2)
   04072 Phospholipase D signaling pathway
    101385114 (SOS2)
   04151 PI3K-Akt signaling pathway
    101385114 (SOS2)
   04150 mTOR signaling pathway
    101385114 (SOS2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    101385114 (SOS2)
   04540 Gap junction
    101385114 (SOS2)
  09142 Cell motility
   04810 Regulation of actin cytoskeleton
    101385114 (SOS2)
 09150 Organismal Systems
  09151 Immune system
   04650 Natural killer cell mediated cytotoxicity
    101385114 (SOS2)
   04660 T cell receptor signaling pathway
    101385114 (SOS2)
   04662 B cell receptor signaling pathway
    101385114 (SOS2)
   04664 Fc epsilon RI signaling pathway
    101385114 (SOS2)
   04062 Chemokine signaling pathway
    101385114 (SOS2)
  09152 Endocrine system
   04910 Insulin signaling pathway
    101385114 (SOS2)
   04912 GnRH signaling pathway
    101385114 (SOS2)
   04915 Estrogen signaling pathway
    101385114 (SOS2)
   04917 Prolactin signaling pathway
    101385114 (SOS2)
   04926 Relaxin signaling pathway
    101385114 (SOS2)
   04935 Growth hormone synthesis, secretion and action
    101385114 (SOS2)
  09156 Nervous system
   04722 Neurotrophin signaling pathway
    101385114 (SOS2)
  09159 Environmental adaptation
   04714 Thermogenesis
    101385114 (SOS2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    101385114 (SOS2)
   05206 MicroRNAs in cancer
    101385114 (SOS2)
   05205 Proteoglycans in cancer
    101385114 (SOS2)
   05231 Choline metabolism in cancer
    101385114 (SOS2)
  09162 Cancer: specific types
   05210 Colorectal cancer
    101385114 (SOS2)
   05225 Hepatocellular carcinoma
    101385114 (SOS2)
   05226 Gastric cancer
    101385114 (SOS2)
   05214 Glioma
    101385114 (SOS2)
   05221 Acute myeloid leukemia
    101385114 (SOS2)
   05220 Chronic myeloid leukemia
    101385114 (SOS2)
   05211 Renal cell carcinoma
    101385114 (SOS2)
   05215 Prostate cancer
    101385114 (SOS2)
   05213 Endometrial cancer
    101385114 (SOS2)
   05224 Breast cancer
    101385114 (SOS2)
   05223 Non-small cell lung cancer
    101385114 (SOS2)
  09165 Substance dependence
   05034 Alcoholism
    101385114 (SOS2)
  09172 Infectious disease: viral
   05161 Hepatitis B
    101385114 (SOS2)
   05160 Hepatitis C
    101385114 (SOS2)
   05163 Human cytomegalovirus infection
    101385114 (SOS2)
   05165 Human papillomavirus infection
    101385114 (SOS2)
  09176 Drug resistance: antineoplastic
   01521 EGFR tyrosine kinase inhibitor resistance
    101385114 (SOS2)
   01522 Endocrine resistance
    101385114 (SOS2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04990 Domain-containing proteins not elsewhere classified [BR:oro04990]
    101385114 (SOS2)
Domain-containing proteins not elsewhere classified [BR:oro04990]
 Pleckstrin homology (PH) domain-containing proteins
  Dbl-Like RhoGEF family proteins
   101385114 (SOS2)
SSDB
Motif
Pfam: RasGEF RasGEF_N RhoGEF Histone PH PH_19 IQ_SEC7_PH PH_13
Other DBs
NCBI-GeneID: 101385114
NCBI-ProteinID: XP_004408083
UniProt: A0A2U3WH65
LinkDB
Position
Unknown
AA seq 1333 aa
MQQAPQPYEFFSEENSPKWRGLLVSALRKVQEQVHPNLSANEESLYYIEELIFQLLNKLC
MAQPRTVQDVEERVQKTFPHPIDKWAIADAQSAIEKRKRRNPLLLPVDKIHPSLKEVLGY
KVDYHVSLYIVAVLEYISADILKLAGNYVFNIRHYEISQQDIKVSMCADKVLMDMFDQDD
IGLVSLCEDEPSSSGELNYYDLVRTEIAEERQYLRELNMIIKVFREAFLCDRKLFKASDI
DRIFSNISDIHELTVKLLGLIEDTVEMTDESSPHPLAGSCFEDLAEEQAFDPYETLSQDI
LSPKFNEHFSKLMARPAVALHFQSIADGFKEAVRYVLPRLMLVPVYHCWHYFELLKQLKA
CSEEQEDRECLNQAITALMNLQGSMDRIYKQYSPRRRPGDPVCPFYNRQLRSKHLAIKKM
NEIQKNIDGWEGKDIGQCCNEFIMEGPLTRIGAKHERHIFLFDGLMISCKPNHSQSRLPG
YSSAEYRLKEKFVMRKIQICDKEDTCECRHAFELVSKDENSIIFAARSAEEKNNWMAALI
SLHYRSTLDRMLDSVLLKEENEQPLRLPSPEVYRFVIKDSEENIVFEDNLQSRSGIPIIK
GGTVVKLIERLTYHMYADPNFVRTFLTTYRSFCKPQELLSLLIERFEIPEPEPTEADKLA
VEKGEQPISADLKRFRKEYVQPVQLRILNVFRHWVEHHFYDFERDLELLERLESFISSVR
GKAMKKWVESIAKIIKRKKQAQANGISHNITFESPPPPIEWHISRAGQFETFDLMTLHPI
EIARQLTLLESDLYRKVQPSELVGSVWTKEDKEINSPNLLKMIRHTTNLTLWFEKCIVEA
ENFEERVAILSRIIEILQVFQDLNNFNGVLEIVSAVNSVSVYRLDHTFEALQERKRRILD
EAVELSQDHFKKYLVKLKSINPPCVPFFGIYLTNILKTEEGNNDFLKKKGKDLINFSKRR
KVAEITGEIQQYQNQPYCLRIEPEMRRFFENLNPMGSASEKEFTDYLFNKSLEIEPRNCK
QPPRFPRKSTFSLKSPGIRPNTGRHGSTSGTLRGHPTPLEREPCKMSFSRIAETDLESTV
SAPTSPNTPSTPPVSASSDLSVFLDVDLNSSCGSNSIFAPVLLPHSKSFFSSCGSLHKLS
EEQLLIPPPLPPRKKFDHDASNSKGNTKSDDDPPAIPPRQPPPPKVKPRVPAPAGAFDGP
LHSPPPPPPRDPLPDTPPPVPLRPPEHFINCPFNLQPPPLGHLHRDPDWFRDVSTCPNSP
NTPPSTPSPRVPRRCYVLSSSHNNLAHPQAPPVPPRQNSSPHLPKLPPKTYKRELSHPPS
YRLPLLENAETPQ
NT seq 4002 nt   +upstreamnt  +downstreamnt
atgcagcaggcgccgcagccgtacgagtttttcagcgaagagaacagtccgaaatggcgg
ggactgctggtctcggccctgcggaaggttcaggagcaagtacatcccaatctctcagct
aatgaagagtctctctattatattgaagagctgatttttcagctgcttaataaattatgc
atggctcaaccacggactgttcaagatgtggaggaacgtgttcaaaagacctttcctcat
ccaattgataaatgggctattgctgatgcacaatctgccatagaaaaacgaaaacgaaga
aatcctctcttactccctgtggacaaaatccatccttcattgaaggaagttttagggtac
aaagtggactatcatgtatccctgtatattgtggctgtactagagtatatctcagctgat
attttgaagttggctggtaattatgtttttaatatccgacattatgaaatatctcagcag
gacattaaagtgtcaatgtgtgcagataaggttttgatggacatgtttgaccaggatgac
ataggtttggtttctctctgtgaagatgaacccagttcttcaggtgaattaaactactat
gaccttgtcagaactgaaattgcagaagaaaggcagtacctacgggaactaaatatgatc
ataaaagtgtttcgagaagcctttctttgtgacagaaagctgtttaaagcttctgatatt
gacaggattttcagtaacatttcagatatacatgaattaactgtgaaacttttaggttta
attgaagacacagttgaaatgactgatgaaagcagtcctcatccgttagctggcagctgt
tttgaagatttggcagaagagcaagcatttgatccttatgaaacactatcacaggacatt
ctttctccaaaatttaatgaacatttcagtaagttgatggccagacctgcagtggctcta
cactttcagtccatcgctgatggttttaaagaggcagttcgttatgtccttccacgcctt
atgctggtgcctgtttatcattgttggcactattttgaattattaaagcaattgaaagca
tgtagtgaagagcaggaagacagagaatgtttgaaccaagctattactgctcttatgaat
ctccaaggtagtatggaccgaatttacaagcagtattcacctagacgccgacctggggat
cctgtttgccctttttataatcgtcagttaagaagcaagcatctggctattaaaaaaatg
aatgagattcagaaaaacatagacggatgggaaggcaaagatattggacagtgttgtaat
gaattcattatggaaggtccattgacacgaattggtgctaaacatgaacgacatattttt
ctctttgatggcttaatgatcagctgtaagcccaatcatagtcagtcacgccttccaggt
tatagtagtgcagaatacagattaaaagaaaaatttgtcatgaggaaaatacaaatatgt
gataaagaagatacttgtgagtgcagacatgcttttgagttggtatccaaagatgaaaac
agcataatatttgctgctaggtctgctgaagagaaaaataattggatggccgcacttatt
tctcttcactatcgtagtacactagatcgaatgctagattcagtgttactgaaagaagaa
aatgagcaaccattgagattaccaagtcctgaagtgtatcgttttgtgataaaagactct
gaggaaaacattgtttttgaagacaacctgcaaagtagaagtggaatccccattattaaa
ggaggaactgtagtgaaattaattgaaaggttaacataccatatgtatgcagatcccaat
tttgttcgtacttttcttactacataccgttcattttgtaaaccacaggaattgctaagc
ttactgattgaacgatttgaaattccagagccagaacctactgaagcagataaattggct
gtagagaaaggcgaacagccaatcagtgcagaccttaaaaggtttcgcaaggaatatgtc
caaccagtacaacttaggatcttaaatgtgtttcggcactgggttgaacaccatttttat
gactttgaaagagatttggagttgcttgaaagactagaatccttcatttcaagtgtaaga
gggaaagctatgaagaaatgggtagagtcaattgctaagatcatcaagagaaagaaacaa
gctcaagcaaatggaataagccataatattacctttgaaagtccacctccaccaattgaa
tggcatatcagcagagcaggacagtttgaaacatttgatctcatgacacttcatccaata
gaaattgcacgtcagctgacacttttggaatctgatctctacaggaaagtccaaccttct
gaactagtagggagtgtatggaccaaagaagataaagaaataaattctccaaatttatta
aaaatgattcgccataccacaaatctcaccctatggtttgaaaagtgcattgtggaagcg
gaaaattttgaggaacgggtggcaatactaagtagaattatagaaattctgcaagttttt
caagatttgaataatttcaatggtgtattggagatagtcagtgcggtaaattcagtgtcg
gtatacagactagaccatacttttgaggcgttgcaggaaagaaaaaggagaattttggat
gaagctgtggaattaagtcaagatcactttaaaaaatacctagtaaaacttaagtcaatc
aatccaccttgtgtgcctttttttggaatatatttaacaaatattctgaaaactgaagaa
gggaataatgattttttaaaaaagaaagggaaagatttaatcaatttcagtaagaggagg
aaagtagctgaaattacgggagaaattcagcagtatcagaatcaaccttactgtttacgg
atagaaccagaaatgaggaggttctttgaaaaccttaaccccatgggaagtgcttctgaa
aaggagtttacagattatttgttcaacaagtcactagaaattgaaccccgaaactgcaaa
cagccacctcgatttcctaggaaatcaactttctctttaaaatctcctggaatacggcct
aatacaggccgacatggctctacctcaggcactttacgaggtcatccaacgccattagaa
agagaaccatgtaaaatgagctttagtcggattgctgaaactgatcttgaatcaactgtg
tcagcaccaacctctccaaatacaccatctactccaccagtatctgcttcttcagacctt
agtgtgtttttagatgtggatctcaacagttcctgtggcagcaatagcatctttgctcca
gtcctcttgccacactcaaagtctttcttcagttcatgtggcagtttacataaactaagt
gaagagcagctgctgattcctcctcctcttcctcctcgaaagaagtttgatcatgatgct
tcaaattccaagggaaatacgaaatctgatgatgacccccctgctattccaccaagacaa
cctcctcctccaaaggtaaaacccagagttcctgctcctgctggtgcatttgatgggcct
ctgcatagcccacctccaccaccgccaagagatcctcttcctgatacccctccaccagtt
ccccttcggcctccagaacactttataaactgtccatttaatcttcagccacctccacta
ggacatcttcacagagatccagactggttcagagacgttagtacctgtccaaattcgcca
aatactcctcctagcacaccctctccgagagtaccacgtcgatgctatgtgctcagttct
agtcacaacaaccttgctcatcctcaagctccccccgttccaccaaggcagaattcaagc
cctcacctaccaaaactgccaccaaagacttacaaacgggagctttcgcaccctccatca
tacagactgcctttgctagaaaatgcggaaactcctcaatga

DBGET integrated database retrieval system