KEGG   Bos indicus (zebu cattle): 109564346
Entry
109564346         CDS       T04792                                 

Gene name
SOS2
Definition
(RefSeq) son of sevenless homolog 2
  KO
K03099  son of sevenless
Organism
biu  Bos indicus (zebu cattle)
Pathway
biu01521  EGFR tyrosine kinase inhibitor resistance
biu01522  Endocrine resistance
biu04010  MAPK signaling pathway
biu04012  ErbB signaling pathway
biu04014  Ras signaling pathway
biu04062  Chemokine signaling pathway
biu04068  FoxO signaling pathway
biu04072  Phospholipase D signaling pathway
biu04150  mTOR signaling pathway
biu04151  PI3K-Akt signaling pathway
biu04510  Focal adhesion
biu04540  Gap junction
biu04630  JAK-STAT signaling pathway
biu04650  Natural killer cell mediated cytotoxicity
biu04660  T cell receptor signaling pathway
biu04662  B cell receptor signaling pathway
biu04664  Fc epsilon RI signaling pathway
biu04714  Thermogenesis
biu04722  Neurotrophin signaling pathway
biu04810  Regulation of actin cytoskeleton
biu04910  Insulin signaling pathway
biu04912  GnRH signaling pathway
biu04915  Estrogen signaling pathway
biu04917  Prolactin signaling pathway
biu04926  Relaxin signaling pathway
biu04935  Growth hormone synthesis, secretion and action
biu05034  Alcoholism
biu05160  Hepatitis C
biu05161  Hepatitis B
biu05163  Human cytomegalovirus infection
biu05165  Human papillomavirus infection
biu05200  Pathways in cancer
biu05205  Proteoglycans in cancer
biu05206  MicroRNAs in cancer
biu05210  Colorectal cancer
biu05211  Renal cell carcinoma
biu05213  Endometrial cancer
biu05214  Glioma
biu05215  Prostate cancer
biu05220  Chronic myeloid leukemia
biu05221  Acute myeloid leukemia
biu05223  Non-small cell lung cancer
biu05224  Breast cancer
biu05225  Hepatocellular carcinoma
biu05226  Gastric cancer
biu05231  Choline metabolism in cancer
Brite
KEGG Orthology (KO) [BR:biu00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04010 MAPK signaling pathway
    109564346 (SOS2)
   04012 ErbB signaling pathway
    109564346 (SOS2)
   04014 Ras signaling pathway
    109564346 (SOS2)
   04630 JAK-STAT signaling pathway
    109564346 (SOS2)
   04068 FoxO signaling pathway
    109564346 (SOS2)
   04072 Phospholipase D signaling pathway
    109564346 (SOS2)
   04151 PI3K-Akt signaling pathway
    109564346 (SOS2)
   04150 mTOR signaling pathway
    109564346 (SOS2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    109564346 (SOS2)
   04540 Gap junction
    109564346 (SOS2)
  09142 Cell motility
   04810 Regulation of actin cytoskeleton
    109564346 (SOS2)
 09150 Organismal Systems
  09151 Immune system
   04650 Natural killer cell mediated cytotoxicity
    109564346 (SOS2)
   04660 T cell receptor signaling pathway
    109564346 (SOS2)
   04662 B cell receptor signaling pathway
    109564346 (SOS2)
   04664 Fc epsilon RI signaling pathway
    109564346 (SOS2)
   04062 Chemokine signaling pathway
    109564346 (SOS2)
  09152 Endocrine system
   04910 Insulin signaling pathway
    109564346 (SOS2)
   04912 GnRH signaling pathway
    109564346 (SOS2)
   04915 Estrogen signaling pathway
    109564346 (SOS2)
   04917 Prolactin signaling pathway
    109564346 (SOS2)
   04926 Relaxin signaling pathway
    109564346 (SOS2)
   04935 Growth hormone synthesis, secretion and action
    109564346 (SOS2)
  09156 Nervous system
   04722 Neurotrophin signaling pathway
    109564346 (SOS2)
  09159 Environmental adaptation
   04714 Thermogenesis
    109564346 (SOS2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    109564346 (SOS2)
   05206 MicroRNAs in cancer
    109564346 (SOS2)
   05205 Proteoglycans in cancer
    109564346 (SOS2)
   05231 Choline metabolism in cancer
    109564346 (SOS2)
  09162 Cancer: specific types
   05210 Colorectal cancer
    109564346 (SOS2)
   05225 Hepatocellular carcinoma
    109564346 (SOS2)
   05226 Gastric cancer
    109564346 (SOS2)
   05214 Glioma
    109564346 (SOS2)
   05221 Acute myeloid leukemia
    109564346 (SOS2)
   05220 Chronic myeloid leukemia
    109564346 (SOS2)
   05211 Renal cell carcinoma
    109564346 (SOS2)
   05215 Prostate cancer
    109564346 (SOS2)
   05213 Endometrial cancer
    109564346 (SOS2)
   05224 Breast cancer
    109564346 (SOS2)
   05223 Non-small cell lung cancer
    109564346 (SOS2)
  09165 Substance dependence
   05034 Alcoholism
    109564346 (SOS2)
  09172 Infectious disease: viral
   05161 Hepatitis B
    109564346 (SOS2)
   05160 Hepatitis C
    109564346 (SOS2)
   05163 Human cytomegalovirus infection
    109564346 (SOS2)
   05165 Human papillomavirus infection
    109564346 (SOS2)
  09176 Drug resistance: antineoplastic
   01521 EGFR tyrosine kinase inhibitor resistance
    109564346 (SOS2)
   01522 Endocrine resistance
    109564346 (SOS2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04990 Domain-containing proteins not elsewhere classified [BR:biu04990]
    109564346 (SOS2)
Domain-containing proteins not elsewhere classified [BR:biu04990]
 Pleckstrin homology (PH) domain-containing proteins
  Dbl-Like RhoGEF family proteins
   109564346 (SOS2)
SSDB
Motif
Pfam: RasGEF RasGEF_N RhoGEF Histone PH
Other DBs
NCBI-GeneID: 109564346
NCBI-ProteinID: XP_019823385
UniProt: A0A6P5CCJ3
LinkDB
Position
10
AA seq 1348 aa
MAASPLVTEIYSLRTFVRXTDIVPNRILVHNFCLISIILLILKLRTFFFKVQEQVHPNLS
ANEESLYYIEELIFQLLNKLCMAQPRTVQDVEERVQKTFPHPIDKWAIADAQSAIEKRKR
RNPLLLPVDKIHPSLKEVLGYKVDYHVSLYIVAVLEYISADILKLAGNYVFNIRHYEISQ
QDIKVSMCADKVLMDMFDQDDIGLVSLCEDEPSSSGELNYYDLVRTEIAEERQYLRELNM
IIKVFREAFLSDRKLFKPSDIEKIFSNILDIHELTVKLLGLIEDTVEMTDESSPHPLAGS
CFEDLAEEQAFDPYETLSQDILSPKFNENFSKLMARPAVALHFQSIADGFKEAVRYVLPR
LMLVPVYHCWHYFELLKQLKIRSEEHEDRECLNQAITALMNLQGSMDRIYKQYSPRRRPG
DPVCPFYNRQLRSKHLAIKKMNEIQKNIDGWEGKDIGQCCNEFIMXXXXXRGGTNLLFDG
LMISCKPNHSQSRLPGCSSAEYRLKEKFVMRKIQICDKEDTCECKHAFELVSKDENSIIY
AAKSAEEKNNWMAALISLQYRSTLDRMLDSVLLKEENEQPLRLPSPEVYRFVVKDSEENI
VFEDNLQSRSGIPIIKGGTVVKLIERLTYHMYADPNFVRTFLTTYRSFCKPQELLSLLIE
RFEIPEPEPTEADKLALEKGEQPISTDLKRFRKEYVQPVQLRVLNVFRHWVEHHFYDFER
DLELLERLESFISSVRGKAMKKWVESIAKIIKRKKQAQANGISHNITFESPPPPIEWHIS
RPGQFETFDLMTLHPIEIARQLTLLESDLYRKVQPSELVGSVWTKEDKEINSPNLLKMIR
HTTNLTLWFEKCIVEAENFEERVAILSRIIEILQVFQDLNNFNGVLEIVSAVNSVSVYRL
DHTFEALQERKRKILDDAVELSQDHFKKYLVKLKSINPPCVPFFGIYLTNILKTEEGNND
FLKKKGKDLINFSKRRKVAEITGEIQQYQNQPYCLRIEPEMRRFFENLNPMGCASEKEFT
DYLFNKSLEIEPRNCKQPPRFPRKSAFSLKSPGIRPNTGRHGSTSGTLRGHPTPLEREPC
KISFSRIAETELESTVSAPTSPNTPSTPPVSASSDLSVFLDVDLNSSCGSNSIFAPVLLP
HSKSFFSSCGSLHKLSEEPLIPPPLPPRKKFDQDASNSKGTMKSDDDPPAIPPRQPPPPK
VKPRVPAPAGPFDGPLHSPPPPPPRDPLPDTPPPVPLRPPEHFINCPFTLQPPPLGHLHR
DPDWFRDVSTCPNSPNTPPSTPSPRVPRRCYVLSSSQNNLAHPQAPPVPPRQNSSPHLPK
LPPKTYKRELSHPPVYRLPLLENAETPQ
NT seq 4047 nt   +upstreamnt  +downstreamnt
atggctgcctctcctcttgtcacagaaatttacagcctcaggacttttgtgagastgact
gatatcgtccctaataggatccttgtacacaacttttgtttaataagtatcattttgctt
atacttaaattgagaacttttttttttaaggttcaggagcaagtacatcccaatctctca
gctaatgaagagtctctctattacattgaagagctgatttttcagctgcttaataaatta
tgcatggcccaaccaaggactgttcaagatgtggaggaacgagttcaaaagacctttcct
catccaattgataaatgggctattgctgatgcacagtctgccatagagaaacgaaaaaga
agaaatcctctcttactgcctgtggacaaaatccatccttcattgaaggaagttttaggg
tacaaagtggactaccatgtatccctgtatattgtggctgtgctggagtatatctcagct
gatattttgaaattggctggtaattatgtttttaatatccgacattatgaaatatctcaa
caagacattaaagtgtcaatgtgtgcagataaggttttgatggacatgtttgatcaggat
gacataggcttggtttctctttgtgaagatgaacctagttcttcaggcgaattaaactac
tatgaccttgttagaactgaaattgcagaagaaagacagtatctacgggaactaaatatg
atcataaaagtgtttcgagaagcttttctttctgacagaaagctgtttaaaccttctgat
attgaaaagattttcagtaacattttagatatacatgaattgactgtgaaacttttaggt
ttaattgaagacacagttgaaatgactgatgaaagtagccctcatcccttagctggcagc
tgttttgaagatctggcagaggagcaagcatttgatccttatgaaacattatcccaggat
attctttcaccaaagtttaatgaaaattttagtaagttgatggccagacctgcagtggct
ctacactttcagtccatcgctgatggttttaaagaggcagttcgttatgtccttccacgc
cttatgctggtgccagtatatcattgttggcactattttgaattattaaagcaattgaaa
atacgtagtgaagagcatgaagacagagaatgtttgaaccaagctattactgctctcatg
aatctccaaggtagtatggaccgaatttacaagcagtattcacctagacgccgacctggg
gatcctgtttgccctttttataatcgtcaattaagaagcaagcacctggctattaaaaaa
atgaatgaaattcagaaaaacatagatggatgggaaggcaaagatattggacagtgttgt
aatgaatttattatgnnnnnnnnnnnnngtcgaggcgggaccaatcttctctttgatggc
ttaatgattagctgcaaacccaatcacagccagtcacgccttccaggatgcagtagtgca
gaatacagattaaaagaaaaatttgtcatgaggaaaatacaaatatgtgataaagaagat
acttgtgagtgcaaacatgcttttgaattagtatccaaagatgaaaacagcataatatat
gctgctaagtctgccgaagagaaaaataattggatggcagcacttatttcccttcagtat
cgtagtactcttgatcgaatgctagattcagtattattgaaggaagaaaatgaacaacca
ttgagattaccaagtcctgaagtgtatcgttttgtggtaaaagactctgaggaaaacatt
gtttttgaagacaacttgcagagtagaagtggaatccccattattaaaggaggaactgtg
gtgaaattaattgaaaggctaacatatcatatgtacgcagatcccaattttgttcgtact
tttcttactacatatcgttcattttgtaaaccacaggaattactaagcttactgattgaa
cgatttgaaattccagagccagaacctactgaagcagataaattggcgttagaaaaaggc
gagcagcccatcagtacagaccttaaaaggtttcgcaaggaatacgtccaaccagtacaa
cttagggtcttaaatgtgtttcggcactgggttgaacaccatttttatgactttgaaaga
gacttggagctgcttgaaagactagaatccttcatttcaagtgtaagagggaaagctatg
aagaaatgggtagagtcaattgctaagatcatcaagaggaaaaaacaagctcaggcaaat
ggtataagccataatattacctttgaaagtccacctccaccaattgaatggcatatcagc
agaccaggacagtttgaaacatttgatctcatgacacttcatccaatagaaattgcacgt
cagctgacccttttggaatctgatctctacaggaaagttcaaccttctgaacttgtaggg
agtgtatggaccaaagaagataaagaaataaattctccaaatttattaaaaatgattcgc
cacaccacaaatctcaccctctggtttgaaaaatgcattgtggaggcagaaaattttgag
gaacgagtggcaatactgagtagaattatagaaattctgcaagtttttcaagatttgaat
aatttcaatggtgtattggagatcgtcagtgcagtaaattcagtatcagtctatagacta
gatcatacctttgaggcgttgcaggaaagaaaaaggaaaattttggatgacgctgtggaa
ttaagtcaagatcattttaaaaaatatctagtgaaacttaagtcaatcaatccaccctgt
gtgcctttttttggaatatatttaacaaatatactgaagactgaagaagggaataatgat
tttttaaaaaagaaagggaaagatttaatcaatttcagtaagaggaggaaagtagctgaa
attactggagaaattcagcagtatcaaaatcaaccttactgtttacggatagaaccagaa
atgaggagattctttgaaaaccttaaccccatgggatgtgcttctgaaaaagagtttaca
gattatttgttcaacaagtcactagaaattgaaccccgaaattgcaaacagccacctcga
tttcctaggaaatcagctttctccttaaaatctcctggaataaggcctaatacaggccga
catggctctacctcaggtactttgcgaggtcacccaacaccattagaaagagaaccgtgt
aaaataagctttagtcggattgctgaaacggagcttgaatcaacagtgtcagcaccaacc
tctccaaatacaccatctaccccaccagtgtctgcttcttcagaccttagtgtgttttta
gatgtggatctcaacagttcctgtggaagcaatagcatctttgctccagtcctcttgcct
cattcaaagtctttcttcagttcgtgtggtagtttacataagctaagtgaagagccactg
attcctcctccacttccacctcgaaaaaaatttgatcaggatgcttcaaattccaaggga
actatgaaatctgatgatgacccccctgctattccaccaagacaacctcctcctccaaag
gtaaaacccagagttcctgctcccgctggtccatttgacgggcctctgcacagtccacct
ccaccrccgccgagagaccctcttcctgatacccctccaccggttccacttcggcctcca
gaacactttataaactgtccgtttacccttcagccacctccactgggacatcttcacaga
gatccagactggttcagagacgttagtacgtgtccaaattctccgaacactcctcctagc
acaccctctccaagggtacctcgtcgatgctatgtgctcagttctagtcaaaataatctt
gctcatcctcaagctccccctgttccaccaaggcagaattcaagccctcatctaccaaaa
ctgccaccaaagacttacaaacgggagctttcacaccccccagtgtatagactgcctttg
ttagaaaatgcagaaactcctcaatga

KEGG   Bos indicus (zebu cattle): 109566002
Entry
109566002         CDS       T04792                                 

Gene name
SOS1
Definition
(RefSeq) son of sevenless homolog 1 isoform X1
  KO
K03099  son of sevenless
Organism
biu  Bos indicus (zebu cattle)
Pathway
biu01521  EGFR tyrosine kinase inhibitor resistance
biu01522  Endocrine resistance
biu04010  MAPK signaling pathway
biu04012  ErbB signaling pathway
biu04014  Ras signaling pathway
biu04062  Chemokine signaling pathway
biu04068  FoxO signaling pathway
biu04072  Phospholipase D signaling pathway
biu04150  mTOR signaling pathway
biu04151  PI3K-Akt signaling pathway
biu04510  Focal adhesion
biu04540  Gap junction
biu04630  JAK-STAT signaling pathway
biu04650  Natural killer cell mediated cytotoxicity
biu04660  T cell receptor signaling pathway
biu04662  B cell receptor signaling pathway
biu04664  Fc epsilon RI signaling pathway
biu04714  Thermogenesis
biu04722  Neurotrophin signaling pathway
biu04810  Regulation of actin cytoskeleton
biu04910  Insulin signaling pathway
biu04912  GnRH signaling pathway
biu04915  Estrogen signaling pathway
biu04917  Prolactin signaling pathway
biu04926  Relaxin signaling pathway
biu04935  Growth hormone synthesis, secretion and action
biu05034  Alcoholism
biu05160  Hepatitis C
biu05161  Hepatitis B
biu05163  Human cytomegalovirus infection
biu05165  Human papillomavirus infection
biu05200  Pathways in cancer
biu05205  Proteoglycans in cancer
biu05206  MicroRNAs in cancer
biu05210  Colorectal cancer
biu05211  Renal cell carcinoma
biu05213  Endometrial cancer
biu05214  Glioma
biu05215  Prostate cancer
biu05220  Chronic myeloid leukemia
biu05221  Acute myeloid leukemia
biu05223  Non-small cell lung cancer
biu05224  Breast cancer
biu05225  Hepatocellular carcinoma
biu05226  Gastric cancer
biu05231  Choline metabolism in cancer
Brite
KEGG Orthology (KO) [BR:biu00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04010 MAPK signaling pathway
    109566002 (SOS1)
   04012 ErbB signaling pathway
    109566002 (SOS1)
   04014 Ras signaling pathway
    109566002 (SOS1)
   04630 JAK-STAT signaling pathway
    109566002 (SOS1)
   04068 FoxO signaling pathway
    109566002 (SOS1)
   04072 Phospholipase D signaling pathway
    109566002 (SOS1)
   04151 PI3K-Akt signaling pathway
    109566002 (SOS1)
   04150 mTOR signaling pathway
    109566002 (SOS1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    109566002 (SOS1)
   04540 Gap junction
    109566002 (SOS1)
  09142 Cell motility
   04810 Regulation of actin cytoskeleton
    109566002 (SOS1)
 09150 Organismal Systems
  09151 Immune system
   04650 Natural killer cell mediated cytotoxicity
    109566002 (SOS1)
   04660 T cell receptor signaling pathway
    109566002 (SOS1)
   04662 B cell receptor signaling pathway
    109566002 (SOS1)
   04664 Fc epsilon RI signaling pathway
    109566002 (SOS1)
   04062 Chemokine signaling pathway
    109566002 (SOS1)
  09152 Endocrine system
   04910 Insulin signaling pathway
    109566002 (SOS1)
   04912 GnRH signaling pathway
    109566002 (SOS1)
   04915 Estrogen signaling pathway
    109566002 (SOS1)
   04917 Prolactin signaling pathway
    109566002 (SOS1)
   04926 Relaxin signaling pathway
    109566002 (SOS1)
   04935 Growth hormone synthesis, secretion and action
    109566002 (SOS1)
  09156 Nervous system
   04722 Neurotrophin signaling pathway
    109566002 (SOS1)
  09159 Environmental adaptation
   04714 Thermogenesis
    109566002 (SOS1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    109566002 (SOS1)
   05206 MicroRNAs in cancer
    109566002 (SOS1)
   05205 Proteoglycans in cancer
    109566002 (SOS1)
   05231 Choline metabolism in cancer
    109566002 (SOS1)
  09162 Cancer: specific types
   05210 Colorectal cancer
    109566002 (SOS1)
   05225 Hepatocellular carcinoma
    109566002 (SOS1)
   05226 Gastric cancer
    109566002 (SOS1)
   05214 Glioma
    109566002 (SOS1)
   05221 Acute myeloid leukemia
    109566002 (SOS1)
   05220 Chronic myeloid leukemia
    109566002 (SOS1)
   05211 Renal cell carcinoma
    109566002 (SOS1)
   05215 Prostate cancer
    109566002 (SOS1)
   05213 Endometrial cancer
    109566002 (SOS1)
   05224 Breast cancer
    109566002 (SOS1)
   05223 Non-small cell lung cancer
    109566002 (SOS1)
  09165 Substance dependence
   05034 Alcoholism
    109566002 (SOS1)
  09172 Infectious disease: viral
   05161 Hepatitis B
    109566002 (SOS1)
   05160 Hepatitis C
    109566002 (SOS1)
   05163 Human cytomegalovirus infection
    109566002 (SOS1)
   05165 Human papillomavirus infection
    109566002 (SOS1)
  09176 Drug resistance: antineoplastic
   01521 EGFR tyrosine kinase inhibitor resistance
    109566002 (SOS1)
   01522 Endocrine resistance
    109566002 (SOS1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04990 Domain-containing proteins not elsewhere classified [BR:biu04990]
    109566002 (SOS1)
Domain-containing proteins not elsewhere classified [BR:biu04990]
 Pleckstrin homology (PH) domain-containing proteins
  Dbl-Like RhoGEF family proteins
   109566002 (SOS1)
SSDB
Motif
Pfam: RasGEF RasGEF_N RhoGEF Histone PH PH_19 PH_10 IQ_SEC7_PH PH_13
Other DBs
NCBI-GeneID: 109566002
NCBI-ProteinID: XP_019825592
UniProt: A0A6P5CL23
LinkDB
Position
11
AA seq 1333 aa
MQAQQLPYEFFSEENAPKWRGLLVPALKKVQGQVHPTLESSDDALQYVEELILQLLNMLC
QAQPRSASDVEERVQKSFPHPIDKWAIADAQSAIEKRKRRNPLSLPVEKIHPLLKEVLGY
KIDHQVSVYIVAVLEYISADILKLVGNYVRNIRHYEITKQDIKVAMCADKVLMDMFHQDV
EDINILSLTDEEPSTSGEQTYYDLVKAFMAEIRQYIRELNLIIKVFREPFVSNSKLFSAN
DVENIFSRIVDIHELSVKLLGHIEDTVEMTDEGSPHPLVGSCFEDLAEELAFDPYESYAR
DILRPGFHDRFLSQLSKPGAALYLQSIGEGFKEAVQYVLPRLLLAPVYHCLHYFELLKQL
EEKSEDQEDKECLKQAITALLNVQSGMEKICSKSLAKRRLSESACRFYSQQMKGKQLAIK
KMNEIQKNIDGWEGKDIGQCCNEFIMEGTLTRVGAKHERHIFLFDGLMICCKSNHGQPRL
PGASNAEYRLKEKFFMRKVQINDKDDTXEYKHAFEIILKDENSVIFSAKSAEEKNNWMAA
LISLQYRSTLERMLDVTMLQEEKEEQMRLPSADVYRFAEPDSEENIIFEENVQPKAGIPI
IKAGTVIKLIERLTYHMYADPNFVRTFLTTYRSFCKPQELLSLIIERFEIPEPEPTEADR
IAIENGDQPLSAELKRFRKEYIQPVQLRVLNVCRHWVEHHFYDFERDADLLQRMEEFIGT
VRGKAMKKWVESITKIIQRKKIARDNGPGHNITFQSSPPTVEWHISRPGHIETFDLLTLH
PIEIARQLTLLESDLYRAVQPSELVGSVWTKEDKEINSPNLLKMIRHTTNLTLWFEKCIV
ETENLEERVAVVSRIIEILQVFQELNNFNGVLEVVSAMNSSPVYRLDHTFEQIPSRQKKI
LEEAHELSEDHYKKYLAKLRSINPPCVPFFGIYLTNILKTEEGNPEVLKRHGKELINFSK
RRKVAEITGEIQQYQNQPYCLRVESDIKRFFENLNPMGNSMEKEFTDYLFNKSLEIEPRN
PKPLPRFPKKYNYPLKSPGVRPSNPRPGTMRHPTPLQQEPRKISYSRIPESETESTASAP
NSPRTPLTPPPASGASSTTDVCSVFDSDHSSPFHSSSDTVFIQVTLPHGPRSASVSSISL
TKSTDEVPVPPPVPPRRRPESAPAESSPSKMMSKHLDSPPAIPPRQPTSKAYSPRYSISD
RTSISDPPESPPLLPPREPVRTPDVFSSSPLHLQPPPLGKKSDHSNAFFPNSPSPFTPPP
PQTPSPHGTRRHLPSPPLTQEVDLHSIAGPPVPPRQSTSQHIPKLPPKTYKREHTHPSMH
RDGPPLLENAHSS
NT seq 4002 nt   +upstreamnt  +downstreamnt
atgcaggcgcagcagctgccgtacgagtttttcagcgaggagaacgcgcccaagtggcgg
gggctgctggtgcctgcgctgaaaaaggttcaggggcaagttcatccaactcttgagtct
agtgatgatgctcttcagtatgttgaagaattaattttgcagctattaaatatgctctgc
caagctcaaccccgaagtgcttcagatgtagaggaacgtgttcaaaaaagtttccctcat
ccaattgataagtgggcaatagctgatgcccaatcggctattgaaaagaggaagcgaaga
aaccctttatctctcccagtagaaaaaatccatcctttgttaaaggaggtcctaggttat
aaaattgaccaccaggtttctgtttacatagtagcagtattagaatacatttctgcagac
attttaaagctggtggggaattatgtgcgaaatatacggcattatgaaattacaaaacaa
gatattaaagtggcaatgtgtgctgataaggtattaatggatatgtttcatcaagatgtg
gaagatataaatatattatctttaactgatgaagagccttccacctcaggagagcagact
tactatgatttggtaaaagcatttatggcagaaattcgacaatacataagggaacttaat
ttaattataaaagtttttagagagccctttgtctccaattcaaaattgttttcagctaat
gatgtagaaaatatatttagtcgtatagtggatatacatgaacttagtgtaaaattactg
ggccatatagaagatactgtggaaatgacagatgaaggcagtccccatccattagtagga
agctgctttgaagacttagcggaggaactggcatttgatccatatgaatcatatgctcga
gatattctacgacctggttttcacgatcgtttccttagtcagttatcaaagcctggagca
gcgctctatttgcagtcaataggcgaaggtttcaaagaagctgttcaatatgttttaccc
agactacttctagcccctgtttaccactgtctgcattacttcgaacttctgaagcagtta
gaagaaaagagtgaggatcaagaagacaaagaatgtttgaaacaagcaataacagctttg
cttaatgttcagagtggtatggaaaaaatatgttctaaaagtcttgcaaaacgaagactt
agtgaatctgcatgtcggttttatagtcagcaaatgaaggggaaacaactagcaatcaag
aaaatgaacgagattcagaagaatattgatggttgggagggaaaagacattggacagtgt
tgcaatgagtttataatggaaggaactcttacacgtgtaggagccaaacatgagagacac
atatttctctttgatggcttaatgatttgctgtaaatcaaatcatgggcagccaagacta
cctggtgctagcaatgcagaatatcgtcttaaagaaaagttttttatgagaaaggtacaa
attaatgacaaagatgacaccartgagtacaagcatgcttttgaaataattttaaaagat
gaaaatagtgttatattttctgccaagtcagctgaagagaagaacaattggatggcagca
ttgatatctttacagtaccggagtaccctggaaagaatgcttgatgtgaccatgctgcag
gaagaaaaggaggagcagatgaggctccctagtgctgacgtttatagatttgcagagcct
gactctgaagaaaatatcatatttgaagaaaacgtgcagcccaaagctggaattccaatt
atcaaggcaggaaccgttattaaacttatagagaggctcacataccacatgtatgcagat
cccaattttgttcggacatttcttacaacatatagatccttttgtaaacctcaagaacta
ctgagtcttataatagaaaggtttgaaattccagagcctgagccaacagaagctgatcgc
atcgctatagagaatggggatcaacccttgagtgcagaactaaagagatttagaaaagaa
tatatacagcctgtgcagctgcgagtattaaatgtatgtcggcactgggtagagcaccac
ttctatgattttgaaagagatgcagatcttttgcagcgaatggaggaatttattggaacc
gtaagaggtaaagcaatgaaaaaatgggttgaatccatcactaaaataatccaaaggaaa
aaaattgccagagacaatggaccaggtcataatattacatttcagagttcacctcctacg
gttgagtggcatataagcagacctgggcacatcgagacttttgacctgctcaccttacac
ccaatagaaattgctcgacaactcactttacttgaatcagatctatatcgagctgtacag
ccatcagaattagttggaagtgtgtggacaaaagaagacaaagaaattaattctcctaat
cttctgaaaatgatccggcacaccactaatctcactctgtggtttgagaaatgtattgta
gaaactgaaaacttagaagaaagagtagctgtggtgagtcgaataattgagattctacaa
gtgttccaagagctcaacaacttcaatggtgttcttgaggttgtcagtgctatgaactca
tcccctgtttacagactagaccacacgtttgagcaaataccaagtcgccaaaagaaaatt
ttagaagaagctcatgaactaagtgaagatcactataagaaatatttggcaaaactcagg
tctattaatccaccatgtgtgcctttctttggaatttatctcactaatatcttgaaaaca
gaagaaggcaaccctgaggtcctaaaacggcatggaaaagagcttataaactttagcaaa
aggaggaaagtagcagaaataacaggcgagatccagcagtaccaaaatcagccttactgt
ttacgagtagagtcagatatcaaaaggttttttgaaaacttgaatccaatgggaaatagc
atggaaaaagaatttacggattatcttttcaacaaatccctagaaatagaaccacgaaac
cctaagcctcttccaagatttccaaaaaaatacaactatcccctaaaatctcctggtgtt
cgtccatcaaacccaagaccaggtactatgagacatcccacacctctgcagcaagagcca
aggaaaattagttatagtaggatccctgaaagtgaaacagaaagtacagcatctgcacca
aattctccaaggacaccgttaacacctcctcctgcttctggtgcttctagtaccacagat
gtttgcagcgtatttgattctgatcattcaagcccttttcactcaagcagcgataccgtc
tttatccaagttacactgccccatggcccaagatctgcttcagtatcatctataagttta
accaagagcactgatgaagtgcctgtcccccctcctgttcctccacgaagacggccagag
tctgccccagcggaatcttcaccatctaagatgatgtctaagcatttggacagcccccca
gcaattcctcctaggcaacccacatcaaaagcctattcaccacggtattcaatatcagac
cggacctctatatcagaccctcctgaaagccctcccttattacctccacgagaacctgtg
aggacacctgatgttttctcaagttcaccactacatctccaacctccccctctgggcaaa
aaaagtgaccatagtaatgccttcttcccaaacagcccctccccctttacaccacctcct
cctcaaacaccttctcctcatggaacaagaaggcatctgccgtcaccaccactgacacaa
gaagtggacctccattccattgctgggccgcctgttcctccacgacaaagcacttctcag
catatccctaaactccctccaaaaacttacaaaagggagcacacccacccgtccatgcac
agagacgggccaccactgttggagaatgcccattcttcctga

DBGET integrated database retrieval system