KEGG   Camelus ferus (Wild Bactrian camel): 102507247
Entry
102507247         CDS       T02979                                 

Gene name
SOS1
Definition
(RefSeq) son of sevenless homolog 1 isoform X1
  KO
K03099  son of sevenless
Organism
cfr  Camelus ferus (Wild Bactrian camel)
Pathway
cfr01521  EGFR tyrosine kinase inhibitor resistance
cfr01522  Endocrine resistance
cfr04010  MAPK signaling pathway
cfr04012  ErbB signaling pathway
cfr04014  Ras signaling pathway
cfr04062  Chemokine signaling pathway
cfr04068  FoxO signaling pathway
cfr04072  Phospholipase D signaling pathway
cfr04150  mTOR signaling pathway
cfr04151  PI3K-Akt signaling pathway
cfr04510  Focal adhesion
cfr04540  Gap junction
cfr04630  JAK-STAT signaling pathway
cfr04650  Natural killer cell mediated cytotoxicity
cfr04660  T cell receptor signaling pathway
cfr04662  B cell receptor signaling pathway
cfr04664  Fc epsilon RI signaling pathway
cfr04714  Thermogenesis
cfr04722  Neurotrophin signaling pathway
cfr04810  Regulation of actin cytoskeleton
cfr04910  Insulin signaling pathway
cfr04912  GnRH signaling pathway
cfr04915  Estrogen signaling pathway
cfr04917  Prolactin signaling pathway
cfr04926  Relaxin signaling pathway
cfr04935  Growth hormone synthesis, secretion and action
cfr05034  Alcoholism
cfr05160  Hepatitis C
cfr05161  Hepatitis B
cfr05163  Human cytomegalovirus infection
cfr05165  Human papillomavirus infection
cfr05200  Pathways in cancer
cfr05205  Proteoglycans in cancer
cfr05206  MicroRNAs in cancer
cfr05210  Colorectal cancer
cfr05211  Renal cell carcinoma
cfr05213  Endometrial cancer
cfr05214  Glioma
cfr05215  Prostate cancer
cfr05220  Chronic myeloid leukemia
cfr05221  Acute myeloid leukemia
cfr05223  Non-small cell lung cancer
cfr05224  Breast cancer
cfr05225  Hepatocellular carcinoma
cfr05226  Gastric cancer
cfr05231  Choline metabolism in cancer
Brite
KEGG Orthology (KO) [BR:cfr00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04010 MAPK signaling pathway
    102507247 (SOS1)
   04012 ErbB signaling pathway
    102507247 (SOS1)
   04014 Ras signaling pathway
    102507247 (SOS1)
   04630 JAK-STAT signaling pathway
    102507247 (SOS1)
   04068 FoxO signaling pathway
    102507247 (SOS1)
   04072 Phospholipase D signaling pathway
    102507247 (SOS1)
   04151 PI3K-Akt signaling pathway
    102507247 (SOS1)
   04150 mTOR signaling pathway
    102507247 (SOS1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102507247 (SOS1)
   04540 Gap junction
    102507247 (SOS1)
  09142 Cell motility
   04810 Regulation of actin cytoskeleton
    102507247 (SOS1)
 09150 Organismal Systems
  09151 Immune system
   04650 Natural killer cell mediated cytotoxicity
    102507247 (SOS1)
   04660 T cell receptor signaling pathway
    102507247 (SOS1)
   04662 B cell receptor signaling pathway
    102507247 (SOS1)
   04664 Fc epsilon RI signaling pathway
    102507247 (SOS1)
   04062 Chemokine signaling pathway
    102507247 (SOS1)
  09152 Endocrine system
   04910 Insulin signaling pathway
    102507247 (SOS1)
   04912 GnRH signaling pathway
    102507247 (SOS1)
   04915 Estrogen signaling pathway
    102507247 (SOS1)
   04917 Prolactin signaling pathway
    102507247 (SOS1)
   04926 Relaxin signaling pathway
    102507247 (SOS1)
   04935 Growth hormone synthesis, secretion and action
    102507247 (SOS1)
  09156 Nervous system
   04722 Neurotrophin signaling pathway
    102507247 (SOS1)
  09159 Environmental adaptation
   04714 Thermogenesis
    102507247 (SOS1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    102507247 (SOS1)
   05206 MicroRNAs in cancer
    102507247 (SOS1)
   05205 Proteoglycans in cancer
    102507247 (SOS1)
   05231 Choline metabolism in cancer
    102507247 (SOS1)
  09162 Cancer: specific types
   05210 Colorectal cancer
    102507247 (SOS1)
   05225 Hepatocellular carcinoma
    102507247 (SOS1)
   05226 Gastric cancer
    102507247 (SOS1)
   05214 Glioma
    102507247 (SOS1)
   05221 Acute myeloid leukemia
    102507247 (SOS1)
   05220 Chronic myeloid leukemia
    102507247 (SOS1)
   05211 Renal cell carcinoma
    102507247 (SOS1)
   05215 Prostate cancer
    102507247 (SOS1)
   05213 Endometrial cancer
    102507247 (SOS1)
   05224 Breast cancer
    102507247 (SOS1)
   05223 Non-small cell lung cancer
    102507247 (SOS1)
  09165 Substance dependence
   05034 Alcoholism
    102507247 (SOS1)
  09172 Infectious disease: viral
   05161 Hepatitis B
    102507247 (SOS1)
   05160 Hepatitis C
    102507247 (SOS1)
   05163 Human cytomegalovirus infection
    102507247 (SOS1)
   05165 Human papillomavirus infection
    102507247 (SOS1)
  09176 Drug resistance: antineoplastic
   01521 EGFR tyrosine kinase inhibitor resistance
    102507247 (SOS1)
   01522 Endocrine resistance
    102507247 (SOS1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04990 Domain-containing proteins not elsewhere classified [BR:cfr04990]
    102507247 (SOS1)
Domain-containing proteins not elsewhere classified [BR:cfr04990]
 Pleckstrin homology (PH) domain-containing proteins
  Dbl-Like RhoGEF family proteins
   102507247 (SOS1)
SSDB
Motif
Pfam: RasGEF RasGEF_N RhoGEF Histone PH PH_19 PH_10 IQ_SEC7_PH PH_13
Other DBs
NCBI-GeneID: 102507247
NCBI-ProteinID: XP_032353269
LinkDB
Position
15
AA seq 1333 aa
MQAQQLPYEFFSEENAPKWRGLLVPALKKVQGQVHPTLESSDDALQYVEELILQLLNMLC
QAQPRSASDVEERVQKSFPHPIDKWAIADAQSAIEKRKRRNPLSLPVEKIHPLLKEVLGY
KIDHQVSVYIVAVLEYISADILKLVGNYVRNIRHYEITKQDIKVAMCADKVLMDMFHQDV
EDINILSLTDEEPSTSGEQTYYDLVKAFMAEIRQYIRELNLIIKVFREPFVSNSKLFSAN
DVENIFSRIVDIHELSVKLLGHIEDTVEMTDEGSPHPLVGSCFEDLAEELAFDPYESYAR
DILRPGFHDRFLSQLSKPGAALYLQSIGEGFKEAVQYVLPRLLLAPVYHCLHYFELLKQL
EEKSEDQEDKECLKQAITALLNVQSGMEKICSKSLAKRRLSESACRFYSQQMKGKQLAIK
KMNEIQKNIDGWEGKDIGQCCNEFIMEGTLTRVGAKHERHIFLFDGLMICCKSNHGQPRL
PGASNAEYRLKEKFFMRKVQINDKDDTSEYKHAFEIILKDENSVIFSAKSAEEKNNWMAA
LISLQYRSTLERMLDVTMLQEEKEEQMRLPSADVYRFAEPDSEENIIFEENMQPKAGIPI
IKAGTVVKLIERLTYHMYADPNFVRTFLTTYRSFCKPQELLSLIIERFEIPEPEPTEADR
IAIENGDQPLSAELKRFRKEYIQPVQLRVLNVCRHWVEHHFYDFERDADLLQRMEEFIGT
VRGKAMKKWVESITKIIQRKKIARDNGPGHNITFQSSPPTVEWHISRPGHIETFDLLTLH
PIEIARQLTLLESDLYRAVQPSELVGSVWTKEDKEINSPNLLKMIRHTTNLTLWFEKCIV
ETENLEERVAVVSRIIEILQVFQELNNFNGVLEVVSAMNSSPVYRLDHTFEQIPSRQKKI
LEEAHELSEDHYKKYLAKLRSINPPCVPFFGIYLTNILKTEEGNPEVLKRHGKELINFSK
RRKVAEITGEIQQYQNQPYCLRVESDIKKFFENLNPMGNSMEKEFTDYLFNKSLEIEPRN
PKPLPRFPKKYSYPLKSPGVRPSNPRPGTMRHPTPLQQEPRKISYSRIPESETESTASAP
NSPRTPLTPPPASGASSTTDVCSVFDSDHSSPFHSSSDTVFIQVTLPHGPRSASVSSISL
TKGTDEVPVPPPVPPRRRPESAPAESSPSKIMSKHLDSPPAIPPRQPTSKVYSPRYSISD
RTSISDPPESPPLLPPREPVRTPDVFSSSPLHLQPPPLGKKSDHSNAFFPNSPSPFTPPP
PQTPSPHGTRRHLPSPPLTQEVDLHSIAGPPVPPRQSTSQHIPKLPPKTYKREHTHPSMH
RDGPPLLENAHSS
NT seq 4002 nt   +upstreamnt  +downstreamnt
atgcaggcgcagcagctgccgtacgagtttttcagcgaggagaacgcgcctaagtggcgg
ggactgctggtgcctgcgctgaaaaaggtccaggggcaagttcatcccactctcgagtct
agtgatgatgctcttcagtatgttgaagaattaattttgcaattattaaatatgctttgc
caagctcagccccgaagtgcttcagatgtagaggaacgtgttcaaaaaagtttccctcat
ccaattgataagtgggcaatagcagatgcccaatcggctattgaaaagaggaagcgaaga
aaccctttatctctcccagtagaaaaaattcatcctttgttaaaggaggtcttaggttat
aaaattgaccaccaggtttctgtttacatagtagcagtattagaatacatttctgcagac
attttaaaactggtggggaattatgtgcgaaatatacggcattatgaaattacaaaacaa
gatattaaagtggcaatgtgtgccgataaggtattgatggatatgtttcatcaagatgta
gaagatataaatatattatctttaactgatgaagagccttccacctcaggagagcaaact
tactatgatttggtaaaagcatttatggcagaaattcgacagtacataagggaactaaat
ttaattataaaagttttcagagagccctttgtctccaattcaaaactgttttcagctaat
gatgtagaaaatatatttagtcgtatagtagatatacatgaacttagtgtaaaattactg
ggccatatagaagatactgtagaaatgacagatgaaggcagtccccatccattagtagga
agctgctttgaagacttagcagaggaactggcatttgatccatatgaatcatatgctcga
gatattttacgacctggttttcatgatcgtttccttagtcagttatcaaagcctggagcg
gccctctacttgcagtcaataggtgaaggtttcaaagaagctgttcagtatgttttaccc
aggctgcttctagctcctgtttaccactgtctacattactttgaacttttgaagcagtta
gaagaaaagagtgaagatcaagaagacaaggaatgtttgaaacaagctataacagcttta
ctgaatgttcagagtggtatggaaaaaatatgttctaaaagtcttgcaaagcgaagactg
agtgaatctgcatgtcggttttacagtcagcaaatgaaggggaaacaactagcaatcaag
aaaatgaacgaaattcagaagaatattgatggttgggagggaaaagacattggacagtgt
tgcaatgagtttataatggaaggaactcttactcgtgtaggagccaaacatgagagacac
atatttctctttgatggcttaatgatttgctgtaaatcaaatcatgggcagccaagactt
cctggtgctagcaatgcagaatatcgtcttaaagaaaagttttttatgagaaaggtacaa
attaatgacaaagatgacaccagtgagtacaagcatgcttttgaaataattttaaaagat
gaaaatagtgttatattttctgccaagtcagctgaagagaagaacaattggatggcagca
ttgatatctttacagtaccggagtacactggaaaggatgcttgatgtgacaatgctacag
gaagagaaggaggagcagatgaggctccctagtgctgatgtttatagatttgcagagcct
gactctgaagaaaatatcatatttgaagaaaacatgcagcccaaagctggaattccaatt
atcaaagcaggaactgttgttaaacttatagagaggctcacataccatatgtacgcagat
cccaattttgttcggacatttcttacaacatatagatcgttttgtaaacctcaagaacta
ctgagtcttataatagaaaggtttgaaattccagagcctgagccaacagaagctgatcgc
atagctatagaaaatggagatcagcccttgagtgcagaactaaaaaggtttagaaaagaa
tatatacaacctgtacaactgcgagtattaaacgtatgtcggcactgggtagagcaccac
ttctatgattttgaaagagatgcagatcttttgcagcgaatggaggaatttattggaaca
gtaagaggtaaagcaatgaaaaaatgggttgaatccatcactaaaataatccaaaggaaa
aaaattgcaagagacaatggaccaggtcataatattacatttcagagttcacctcccaca
gttgagtggcatataagcagacctgggcatatagagacttttgacctgctcaccttacat
ccaatagaaattgctcgacaactcactttacttgaatcagatctatatcgagctgtacag
ccatcagaattagtcggaagtgtgtggacaaaagaagacaaagaaattaattctcctaat
cttctgaaaatgatccggcataccactaatctcactctgtggtttgagaaatgtattgta
gaaactgaaaacttagaagaaagagtagcagtggtaagtcgaataattgagattctgcaa
gtctttcaagagctgaacaacttcaatggtgtccttgaggttgtcagtgctatgaactca
tcacctgtttacaggctagaccacacattcgagcaaataccaagtcgccaaaagaaaatt
ttagaagaagctcatgaactaagtgaagatcactataagaaatatttggcaaaactcagg
tctattaatccaccatgtgtgcctttctttggaatttatctaactaatatcttgaaaaca
gaagaaggcaaccctgaggtcctgaaaaggcatggaaaagagcttataaactttagcaaa
aggaggaaagtggcagaaataacaggcgagatccagcagtaccaaaatcagccttactgt
ttacgagtagaatcagatatcaaaaagttctttgaaaacttgaatccaatgggaaatagc
atggaaaaagaatttacagattatcttttcaacaaatccctagaaatagaaccacgaaac
cctaaacctctcccgagatttccaaaaaaatatagctaccccctaaaatctcctggtgtt
cgtccatcaaacccaagaccaggtaccatgagacatcccacacctctgcagcaggagcca
aggaaaattagttatagtaggatccctgaaagtgaaacagaaagtacagcatctgcacca
aattctccaagaacaccgttaacacctcctcctgcttctggtgcttctagtaccacagat
gtttgcagcgtatttgattctgatcattcaagcccttttcattcaagcagcgataccgtc
tttatccaagttacactgccccatggcccaagatctgcttcagtatcatctataagttta
actaaaggcactgacgaagtgcctgtcccccctcctgttcctccacgaagacgaccagaa
tctgccccagcggaatcttcgccatctaagattatgtctaagcatttggacagcccccca
gcaattcctcctaggcaacccacatcaaaagtgtattcaccacgatactcaatatcagat
cggacttctatatcagaccctcctgaaagcccacccttactaccaccacgggaacccgtg
aggacacctgatgttttctcaagctcacctctacatctccagcctccccccttgggcaaa
aaaagtgaccatagtaatgccttcttcccaaacagcccttccccctttacaccacctcct
cctcaaacaccttctcctcacggcacgagaaggcatctgccatcaccaccattgacacaa
gaagtggacctgcattccattgctgggccgcctgttcctccacgacaaagcacttctcaa
catatccctaaactcccaccaaaaacttacaaaagggagcacacacacccatccatgcac
agagatggaccaccactgctggagaatgcccattcttcctga

KEGG   Camelus ferus (Wild Bactrian camel): 102517014
Entry
102517014         CDS       T02979                                 

Gene name
SOS2
Definition
(RefSeq) son of sevenless homolog 2
  KO
K03099  son of sevenless
Organism
cfr  Camelus ferus (Wild Bactrian camel)
Pathway
cfr01521  EGFR tyrosine kinase inhibitor resistance
cfr01522  Endocrine resistance
cfr04010  MAPK signaling pathway
cfr04012  ErbB signaling pathway
cfr04014  Ras signaling pathway
cfr04062  Chemokine signaling pathway
cfr04068  FoxO signaling pathway
cfr04072  Phospholipase D signaling pathway
cfr04150  mTOR signaling pathway
cfr04151  PI3K-Akt signaling pathway
cfr04510  Focal adhesion
cfr04540  Gap junction
cfr04630  JAK-STAT signaling pathway
cfr04650  Natural killer cell mediated cytotoxicity
cfr04660  T cell receptor signaling pathway
cfr04662  B cell receptor signaling pathway
cfr04664  Fc epsilon RI signaling pathway
cfr04714  Thermogenesis
cfr04722  Neurotrophin signaling pathway
cfr04810  Regulation of actin cytoskeleton
cfr04910  Insulin signaling pathway
cfr04912  GnRH signaling pathway
cfr04915  Estrogen signaling pathway
cfr04917  Prolactin signaling pathway
cfr04926  Relaxin signaling pathway
cfr04935  Growth hormone synthesis, secretion and action
cfr05034  Alcoholism
cfr05160  Hepatitis C
cfr05161  Hepatitis B
cfr05163  Human cytomegalovirus infection
cfr05165  Human papillomavirus infection
cfr05200  Pathways in cancer
cfr05205  Proteoglycans in cancer
cfr05206  MicroRNAs in cancer
cfr05210  Colorectal cancer
cfr05211  Renal cell carcinoma
cfr05213  Endometrial cancer
cfr05214  Glioma
cfr05215  Prostate cancer
cfr05220  Chronic myeloid leukemia
cfr05221  Acute myeloid leukemia
cfr05223  Non-small cell lung cancer
cfr05224  Breast cancer
cfr05225  Hepatocellular carcinoma
cfr05226  Gastric cancer
cfr05231  Choline metabolism in cancer
Brite
KEGG Orthology (KO) [BR:cfr00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04010 MAPK signaling pathway
    102517014 (SOS2)
   04012 ErbB signaling pathway
    102517014 (SOS2)
   04014 Ras signaling pathway
    102517014 (SOS2)
   04630 JAK-STAT signaling pathway
    102517014 (SOS2)
   04068 FoxO signaling pathway
    102517014 (SOS2)
   04072 Phospholipase D signaling pathway
    102517014 (SOS2)
   04151 PI3K-Akt signaling pathway
    102517014 (SOS2)
   04150 mTOR signaling pathway
    102517014 (SOS2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102517014 (SOS2)
   04540 Gap junction
    102517014 (SOS2)
  09142 Cell motility
   04810 Regulation of actin cytoskeleton
    102517014 (SOS2)
 09150 Organismal Systems
  09151 Immune system
   04650 Natural killer cell mediated cytotoxicity
    102517014 (SOS2)
   04660 T cell receptor signaling pathway
    102517014 (SOS2)
   04662 B cell receptor signaling pathway
    102517014 (SOS2)
   04664 Fc epsilon RI signaling pathway
    102517014 (SOS2)
   04062 Chemokine signaling pathway
    102517014 (SOS2)
  09152 Endocrine system
   04910 Insulin signaling pathway
    102517014 (SOS2)
   04912 GnRH signaling pathway
    102517014 (SOS2)
   04915 Estrogen signaling pathway
    102517014 (SOS2)
   04917 Prolactin signaling pathway
    102517014 (SOS2)
   04926 Relaxin signaling pathway
    102517014 (SOS2)
   04935 Growth hormone synthesis, secretion and action
    102517014 (SOS2)
  09156 Nervous system
   04722 Neurotrophin signaling pathway
    102517014 (SOS2)
  09159 Environmental adaptation
   04714 Thermogenesis
    102517014 (SOS2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    102517014 (SOS2)
   05206 MicroRNAs in cancer
    102517014 (SOS2)
   05205 Proteoglycans in cancer
    102517014 (SOS2)
   05231 Choline metabolism in cancer
    102517014 (SOS2)
  09162 Cancer: specific types
   05210 Colorectal cancer
    102517014 (SOS2)
   05225 Hepatocellular carcinoma
    102517014 (SOS2)
   05226 Gastric cancer
    102517014 (SOS2)
   05214 Glioma
    102517014 (SOS2)
   05221 Acute myeloid leukemia
    102517014 (SOS2)
   05220 Chronic myeloid leukemia
    102517014 (SOS2)
   05211 Renal cell carcinoma
    102517014 (SOS2)
   05215 Prostate cancer
    102517014 (SOS2)
   05213 Endometrial cancer
    102517014 (SOS2)
   05224 Breast cancer
    102517014 (SOS2)
   05223 Non-small cell lung cancer
    102517014 (SOS2)
  09165 Substance dependence
   05034 Alcoholism
    102517014 (SOS2)
  09172 Infectious disease: viral
   05161 Hepatitis B
    102517014 (SOS2)
   05160 Hepatitis C
    102517014 (SOS2)
   05163 Human cytomegalovirus infection
    102517014 (SOS2)
   05165 Human papillomavirus infection
    102517014 (SOS2)
  09176 Drug resistance: antineoplastic
   01521 EGFR tyrosine kinase inhibitor resistance
    102517014 (SOS2)
   01522 Endocrine resistance
    102517014 (SOS2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04990 Domain-containing proteins not elsewhere classified [BR:cfr04990]
    102517014 (SOS2)
Domain-containing proteins not elsewhere classified [BR:cfr04990]
 Pleckstrin homology (PH) domain-containing proteins
  Dbl-Like RhoGEF family proteins
   102517014 (SOS2)
SSDB
Motif
Pfam: RasGEF RasGEF_N RhoGEF Histone PH IQ_SEC7_PH PH_19 PH_13
Other DBs
NCBI-GeneID: 102517014
NCBI-ProteinID: XP_032337870
LinkDB
Position
6
AA seq 1332 aa
MQQAPQPYEFFSEENSPKWRGLLVSALRKVQEQVHPNLSANEESLYYIEELIFQLLNKLC
MAQPRTVQDVEERVQKTFPHPIDKWAIADAQSAIEKRKRRNPLLLPVDKIHPSLKEVLGY
KVDYHVSLYIVAVLEYISADILKLAGNYVFNIRHYEISQQDIEVSMCADKVLMDMFDQDD
IGLVSLCEDEPSSSGELNYYDLVRTEIAEERQYLRELNMIIKVFREAFISDRKLFKPSDI
EKIFSNILDIHELTVKLLGLIEDTVEMTDESSPHPLAGSCFEDLAEEQAFDPYETLSQDI
LSPKFNEHFSKLMARPAVALHFQSIADGFKEAVRYVLPRLMLVPVYHCWHYFELLKQLKA
CSEEHEDRECLNQAITALMNLQGSMDRIYKQYSPRRRPGDPVCPFYNRQLRSKHLAIKKM
NEIQKNIDGWEGKDIGQCCNEFIMEGPLTRIGAKHERHIFLFDGLMISCKPNHSQSRLPG
YSSAEYRLKEKFVMRKIQICDKEDTCECKHAFELVSKDENSIIFAAKSAEEKNNWMAALI
SLHYRSTLDRMLDSVLLKEENEQPLRLPSPEVYRFVVKDSEENIVFEDNVQSRSGIPIIK
GGTVVKLIERLTYHMYADPNFVRTFLTTYRSFCKPQELLSLLIERFEIPEPEPTEADKLA
VEKGEQPISADLKRFRKEYVQPVQLRILNVFRHWVEHHFYDFERDLELLERLESFISSVR
GKAMKKWVESIAKIIKRKKQAQANGISHNITFESPPPPIEWHISRPGQFETFDLMTLHPI
EIARQLTLLESDLYRKVQPSELVGSVWTKEDKEINSPNLLKMIRHTTNLTLWFEKCIVEA
ENFEERVAVLSRIIEILQVFQDLNNFNGVLEIVSAVNSVSVYRLDHTFEALQERKRKILD
EAVELSQDHFKKYLVKLKSINPPCVPFFGIYLTNILKTEEGNNDFLKKKGKDLINFSKRR
KVAEITGEIQQYQNQPYCLRIEPEMRRFFENLNPMGSASEKEFTDYLFNKSLEIEPRNCK
QPPRFPRKSTFSLKSPGIRPNTGRHGSTSGTLRGHPTPLEREPCKISFSRIAETELESTV
SAPTSPNTPSTPPVSASSDLSVFLDVDLNSSCGSNSIFAPVLLPHSKSFFSSCGSLHKLS
EEPLIPPPLPPRKKFDHDASNSKGNTKSDDDPPAIPPRQPPPPKVKPRVPAPTGALDGPL
HSPPPPPPRDPLPDTPPPVPLRPPEHFINCPFTLQPPPLGHLHRDPDWFRDVSTCPNSPN
TPPSTPSPRVPRRCYVLSSSQNNLAHPQAPPVPPRQNSSPHLPKLPPKTYKRELSHPPLY
RLPLLENAETPQ
NT seq 3999 nt   +upstreamnt  +downstreamnt
atgcagcaggcgccgcagccttacgagttcttcagcgaagagaacagtccgaaatggcgg
ggactgttggtctcggccctgcgaaaggttcaggagcaagtacatcccaatctctcagct
aatgaagagtctctctattatattgaagagctgatttttcagctgcttaataaattatgc
atggcccaaccaaggactgttcaagatgtggaggaacgagttcaaaaaaccttccctcat
ccaattgataaatgggctattgctgatgcacagtctgccatagagaaacgaaaacgaagg
aatcctctcttactgcctgtggacaaaatccatccttcattgaaggaagttttagggtac
aaagtggactaccacgtgtccctatatattgtggctgtactagagtatatctcagctgat
attttaaaattggctggtaattatgtttttaatatccgacattatgaaatatcccaacag
gacattgaagtgtcaatgtgtgcagataaggttttgatggacatgtttgatcaggatgac
ataggcttggtttctctctgtgaagatgaacctagttcttcaggtgaattaaattactat
gaccttgtcagaactgaaattgcagaagaaagacagtatctacgggaactaaatatgatc
ataaaagtgtttcgagaagcttttatttctgaccgaaagctgtttaaaccttctgatatt
gaaaagattttcagtaacattttagatatacatgaactgaccgtgaaacttttaggtttg
attgaagatacagttgaaatgactgatgaaagcagccctcatcccttagctggcagttgt
tttgaagatttggcagaagagcaagcatttgatccttatgaaacattatctcaggatatt
ctttcaccaaaatttaatgaacatttcagtaagttgatggccagacctgcagtggctcta
cactttcagtccattgctgatggttttaaagaggcagttcgttatgtccttccacgcctt
atgctggtgccagtgtatcattgttggcactattttgaattattaaagcaattgaaagca
tgtagtgaagagcacgaagacagagaatgtttgaaccaagctattactgctctaatgaat
ctccaaggtagtatggaccgaatttacaagcagtattcacctagacgccgacctggggat
cccgtttgccctttttataatcgtcaattaagaagcaagcacctggctattaaaaaaatg
aatgaaattcagaaaaacatagatggatgggaaggcaaagatattggacagtgttgtaac
gaatttattatggaaggcccattgacaagaattggtgctaaacatgaacggcatattttt
ctctttgatggcttgatgattagctgcaaacccaatcatagccagtcacgacttccagga
tacagtagtgcagaatacagattaaaagaaaaatttgtcatgaggaaaatacaaatatgt
gataaagaagatacttgtgagtgcaaacacgcctttgaattagtatccaaggatgaaaac
agcataatatttgctgctaagtctgctgaagagaaaaataattggatggcagcacttatt
tctctccattatcgtagtacccttgatcgaatgctagattcagtattattgaaggaagaa
aatgaacaaccactgagattaccaagtccagaagtgtatcgttttgtggtaaaagactct
gaggaaaacattgtttttgaagacaacgtgcaaagtagaagtggaatccccattattaaa
ggaggaactgtggtgaaattaattgaaaggttaacataccatatgtatgcagatcccaat
tttgttcgtacttttcttactacatatcgttcattttgtaaaccacaggaattgctaagc
ttactaattgaacgatttgaaattccagagccagaacctacagaagcagataaattggca
gtagagaaaggcgagcagccaatcagtgcagaccttaaaaggtttcgcaaggaatacgtc
caaccagtacaacttaggatcttaaatgtgtttcggcactgggttgaacaccatttttat
gactttgaaagagatttggagttgcttgaaagactagagtccttcatttcaagtgtaaga
gggaaagctatgaagaaatgggtagagtcaattgctaagatcatcaagaggaagaaacaa
gctcaggcaaacggaataagccataatattacatttgaaagtccacctccaccaattgaa
tggcatatcagtagaccaggacagtttgaaacatttgatctcatgacacttcatccaata
gaaattgcacgtcaactgacacttttggaatctgatctctacaggaaagtccagccttct
gaacttgtggggagtgtatggaccaaagaagataaagaaataaattctccaaatttatta
aaaatgattcgccacaccacaaatctcaccctctggtttgaaaaatgtattgtggaagca
gaaaattttgaggaacgagtggcagtcctaagtagaattatagaaattcttcaagttttt
caagatttgaataatttcaatggtgtattggaaatcgtcagtgcggtaaattcagtgtca
gtctacagactagatcatacctttgaggcattgcaagaaagaaaaaggaaaattttggat
gaagctgtggaattaagtcaagatcactttaaaaaatatctagtgaaacttaagtcaata
aatccaccttgtgtgcctttttttggaatatatttaacaaatattctgaagaccgaagaa
gggaataatgattttttaaaaaagaaagggaaagatttaatcaatttcagtaagaggagg
aaagtagctgaaattactggagaaattcagcaatatcagaatcaaccttactgtttacgg
atagaaccagaaatgaggaggttctttgaaaaccttaatccaatgggaagtgcttctgaa
aaagagtttacagattatttgttcaacaagtccctagaaattgaaccccgaaactgcaaa
cagccacctcgatttcctagaaaatcgactttctctttaaaatctcctggaataaggcct
aatacaggccgacatggctctacctcaggtactttacgaggtcatccaacaccattagaa
agagaaccatgcaaaataagctttagtcggattgctgaaacagagctggaatcaacagtg
tcagcaccaacctcccccaatacaccgtctactccaccagtatctgcttcttcagacctt
agtgtgtttttagatgtggatctcaacagctcctgtggaagcaatagcatctttgctcca
gttctcttgccacattcaaagtctttcttcagttcgtgtggtagtttacataaactaagt
gaagagccactgattcctcctccacttccgcctcgaaaaaagtttgatcacgatgcttca
aattccaagggaaatacgaaatctgatgatgacccccctgctattccaccaagacaacct
cctcctccaaaggtaaaacccagagttcctgctcctactggtgcattggacgggcctctg
catagtccacctccaccgccgccgagagatcctcttcctgatacccctccaccggttccc
cttcggcctccagaacactttataaactgtccatttactcttcagccacctccactggga
catcttcacagagatccagactggttcagagatgttagtacgtgtccaaactctccaaac
actcctcctagcacaccctctccaagggtaccacgtcgatgctacgtgctcagttctagt
caaaataatcttgctcatcctcaggctccccctgttccaccaaggcagaattcaagccct
cacctaccaaaactgccaccaaagacttacaaacgggagctctcgcaccccccgttgtat
agactgcctttgctagaaaatgcagaaactcctcaatga

DBGET integrated database retrieval system