KEGG   Sarcophilus harrisii (Tasmanian devil): 100931009
Entry
100931009         CDS       T02286                                 

Gene name
SOS1
Definition
(RefSeq) son of sevenless homolog 1 isoform X2
  KO
K03099  son of sevenless
Organism
shr  Sarcophilus harrisii (Tasmanian devil)
Pathway
shr01521  EGFR tyrosine kinase inhibitor resistance
shr01522  Endocrine resistance
shr04010  MAPK signaling pathway
shr04012  ErbB signaling pathway
shr04014  Ras signaling pathway
shr04062  Chemokine signaling pathway
shr04068  FoxO signaling pathway
shr04072  Phospholipase D signaling pathway
shr04150  mTOR signaling pathway
shr04151  PI3K-Akt signaling pathway
shr04510  Focal adhesion
shr04540  Gap junction
shr04630  JAK-STAT signaling pathway
shr04650  Natural killer cell mediated cytotoxicity
shr04660  T cell receptor signaling pathway
shr04662  B cell receptor signaling pathway
shr04664  Fc epsilon RI signaling pathway
shr04714  Thermogenesis
shr04722  Neurotrophin signaling pathway
shr04810  Regulation of actin cytoskeleton
shr04910  Insulin signaling pathway
shr04912  GnRH signaling pathway
shr04915  Estrogen signaling pathway
shr04917  Prolactin signaling pathway
shr04926  Relaxin signaling pathway
shr04935  Growth hormone synthesis, secretion and action
shr05034  Alcoholism
shr05160  Hepatitis C
shr05161  Hepatitis B
shr05163  Human cytomegalovirus infection
shr05165  Human papillomavirus infection
shr05200  Pathways in cancer
shr05205  Proteoglycans in cancer
shr05206  MicroRNAs in cancer
shr05210  Colorectal cancer
shr05211  Renal cell carcinoma
shr05213  Endometrial cancer
shr05214  Glioma
shr05215  Prostate cancer
shr05220  Chronic myeloid leukemia
shr05221  Acute myeloid leukemia
shr05223  Non-small cell lung cancer
shr05224  Breast cancer
shr05225  Hepatocellular carcinoma
shr05226  Gastric cancer
shr05231  Choline metabolism in cancer
Brite
KEGG Orthology (KO) [BR:shr00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04010 MAPK signaling pathway
    100931009 (SOS1)
   04012 ErbB signaling pathway
    100931009 (SOS1)
   04014 Ras signaling pathway
    100931009 (SOS1)
   04630 JAK-STAT signaling pathway
    100931009 (SOS1)
   04068 FoxO signaling pathway
    100931009 (SOS1)
   04072 Phospholipase D signaling pathway
    100931009 (SOS1)
   04151 PI3K-Akt signaling pathway
    100931009 (SOS1)
   04150 mTOR signaling pathway
    100931009 (SOS1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100931009 (SOS1)
   04540 Gap junction
    100931009 (SOS1)
  09142 Cell motility
   04810 Regulation of actin cytoskeleton
    100931009 (SOS1)
 09150 Organismal Systems
  09151 Immune system
   04650 Natural killer cell mediated cytotoxicity
    100931009 (SOS1)
   04660 T cell receptor signaling pathway
    100931009 (SOS1)
   04662 B cell receptor signaling pathway
    100931009 (SOS1)
   04664 Fc epsilon RI signaling pathway
    100931009 (SOS1)
   04062 Chemokine signaling pathway
    100931009 (SOS1)
  09152 Endocrine system
   04910 Insulin signaling pathway
    100931009 (SOS1)
   04912 GnRH signaling pathway
    100931009 (SOS1)
   04915 Estrogen signaling pathway
    100931009 (SOS1)
   04917 Prolactin signaling pathway
    100931009 (SOS1)
   04926 Relaxin signaling pathway
    100931009 (SOS1)
   04935 Growth hormone synthesis, secretion and action
    100931009 (SOS1)
  09156 Nervous system
   04722 Neurotrophin signaling pathway
    100931009 (SOS1)
  09159 Environmental adaptation
   04714 Thermogenesis
    100931009 (SOS1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    100931009 (SOS1)
   05206 MicroRNAs in cancer
    100931009 (SOS1)
   05205 Proteoglycans in cancer
    100931009 (SOS1)
   05231 Choline metabolism in cancer
    100931009 (SOS1)
  09162 Cancer: specific types
   05210 Colorectal cancer
    100931009 (SOS1)
   05225 Hepatocellular carcinoma
    100931009 (SOS1)
   05226 Gastric cancer
    100931009 (SOS1)
   05214 Glioma
    100931009 (SOS1)
   05221 Acute myeloid leukemia
    100931009 (SOS1)
   05220 Chronic myeloid leukemia
    100931009 (SOS1)
   05211 Renal cell carcinoma
    100931009 (SOS1)
   05215 Prostate cancer
    100931009 (SOS1)
   05213 Endometrial cancer
    100931009 (SOS1)
   05224 Breast cancer
    100931009 (SOS1)
   05223 Non-small cell lung cancer
    100931009 (SOS1)
  09172 Infectious disease: viral
   05161 Hepatitis B
    100931009 (SOS1)
   05160 Hepatitis C
    100931009 (SOS1)
   05163 Human cytomegalovirus infection
    100931009 (SOS1)
   05165 Human papillomavirus infection
    100931009 (SOS1)
  09165 Substance dependence
   05034 Alcoholism
    100931009 (SOS1)
  09176 Drug resistance: antineoplastic
   01521 EGFR tyrosine kinase inhibitor resistance
    100931009 (SOS1)
   01522 Endocrine resistance
    100931009 (SOS1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04990 Domain-containing proteins not elsewhere classified [BR:shr04990]
    100931009 (SOS1)
Domain-containing proteins not elsewhere classified [BR:shr04990]
 Pleckstrin homology (PH) domain-containing proteins
  Dbl-Like RhoGEF family proteins
   100931009 (SOS1)
SSDB
Motif
Pfam: RasGEF RasGEF_N RhoGEF Histone PH PH_19 PH_10 IQ_SEC7_PH PH_13
Other DBs
NCBI-GeneID: 100931009
NCBI-ProteinID: XP_003758384
Ensembl: ENSSHAG00000014797
UniProt: G3WPQ9
LinkDB
Position
2
AA seq 1316 aa
MQALQLQYEFFSEENAPKWRGLLVTALKKVQMQVHPTLASNEDALQYVEELILQLLSMLC
QAQPRSVLDVEDRVQKSFPHPIDKWAIADAQSANEKRKRRNPLSLPVEKIHPLLKEVLGY
KIDHQVSVYIVAVLEYISADILKLVGNYVRNIRHYEITKQDIKVAMCADKVLMDMFHQDV
EDINILSLTDEEPSTSGEQTYYDLVKGFMAEVRQYIRELNLIIKVFREPFVSNSKLFSSH
DVENIFSRIADVHELSVKLLGHIEDTVEMTDEGSPHPLVGSCFEDLAEELAFDPYESYAQ
DILRPGFHDHFLSQLSKPGAALYLQSIGEGFKEAVQYVLPRLLLAPVYHCLHYFELLKQL
EEKSEDQEDKECLKQAITALLNLQSSMERICSKSLAKRRLSESACRFYSQQMKGKQLAIK
KMNEIQKNIDGWEGKDIGQCCNEFIMEGTLTRVGAKHERHIFLFDGLMICCKSNHGQPRL
PGASNAEYRLKEKFFMRKVQINDKDDTNEYKHAFEIILKDENSVIFSAKSAEEKNNWMAA
LISLQYRSTLERMLDVTMLQEEKEEQMRLPSSDLYRFAEPDSEENIVFEENMQPKSGIPI
IKAGTVIKLIERLTYHMYADPNFVRTFLTTYRSFCKPQELLSLLIERFEIPEPEPTEADK
IAMENGDQPLSVELKRFRKEYIQPVQLRVLNVCRHWVEHHFYDFERDTDLLQRLEEFIGT
VRGKAMKKWVESITKIIQRKKMARDNGPGHNITFESSPPAVEWHISRPGHTETFDLLTLH
PIEIARQLTLLESDLYRAVQPSELVGSVWTKEDKEINSPNLLKMIRHTTNLTLWFEKCIV
EAENLEERVAVVSRIIEILQVFQELNNFNGVLEVVSAMNSSPVYRLDHTFEQIPSRQKKI
LEEAHELSEDHYKKYLAKLRSINPPCVPFFGIYLTNILKTEEGNPEVLKRHGKELINFSK
RRKVAEITGEIQQYQNQPYCLRVESDIKRFFENLNPMGNSMEKEFTDYLFNKSLEIEPRN
PKPLPRFPKKYSYPLKSPGVRPSNPRPGTMRHPTPLQQEPRKISYSRIPESETESAASAP
NSPRTPLTPPPASGASSTTDVGSVFDSEHSSPFHSRSASVSSINLTKSIDEMPIPPPVPP
RRRPESAPAESSPSKIMPKHLDSPPAIPPRQPTSKVYSPRYSDRTSISDPPESPPLLPPR
EPVRTPDVFSSSPLHLQPPPLGKKSEHGNTFFPNSPSPFTPPPPQTPSPHGTRRHLPSPP
LTQDVDLHSIPAPPVPPRQSTSQHIPKLPPKTYKREHTHPSMHRDGPPLLENAHSS
NT seq 3951 nt   +upstreamnt  +downstreamnt
atgcaggcgctgcagctccagtacgagttcttcagcgaggaaaacgcgcccaagtggagg
gggctgctggtgacggccctgaaaaaggtccagatgcaagtccatcctacacttgcttca
aatgaggatgctctccagtatgttgaagagttaattttacagttgttaagtatgctgtgc
caagctcaaccccgaagcgttttagatgttgaggatcgtgtacaaaaaagttttcctcat
ccaattgataagtgggcgatagctgatgctcaatccgctaatgaaaagaggaagcgaaga
aatccattatctctcccagtagaaaaaattcaccctttattaaaggaagttctaggatat
aaaattgaccaccaggtttctgtttacatagtggcagtattagaatacatttctgcagat
attttaaagctggttggaaattatgtacggaatatacgacattatgaaattacaaaacag
gacatcaaagtagcaatgtgtgctgataaggtattgatggatatgtttcatcaagatgta
gaagatataaatatattatctttaactgatgaagaaccttccacttcaggggaacaaacg
tactatgacttggtaaaaggattcatggctgaagttcgacaatatataagggaacttaat
ctcattattaaagtttttagagaaccatttgtctccaattcaaaattattttcttctcac
gatgtagaaaatatatttagtcgtatagcagatgttcatgaactcagtgtaaaattattg
ggccatatagaagacactgtagaaatgacagatgaaggcagtccccatccattagttggt
agctgctttgaagacttagcagaggaattggcatttgatccatatgaatcatatgctcaa
gacatattgcgacctggttttcatgatcactttcttagtcagttatccaagcctggggca
gccctctacttgcagtcaataggtgaaggtttcaaagaagctgttcagtatgttttaccc
aggctacttctagctcctgtgtaccactgtctacattactttgaacttttaaagcagtta
gaagagaagagtgaagaccaagaagacaaagaatgtttgaaacaagcaataacggctttg
cttaaccttcagagcagtatggaaagaatatgttccaaaagtcttgcaaaacggagattg
agcgaatctgcatgtcgattttatagccaacagatgaagggaaaacaactagcaataaag
aaaatgaatgagattcagaagaacattgatgggtgggagggaaaagacattggacagtgt
tgcaatgagtttatcatggaaggaactctcacacgtgtaggtgccaaacatgagagacac
atatttctctttgatggtctgatgatttgctgtaaatcaaatcatggtcagccaagactt
cctggtgctagcaatgctgagtatcgtctcaaagagaagttttttatgcgcaaggtacaa
atcaatgataaggatgacactaatgaatacaaacatgcctttgaaataatcttaaaagat
gaaaatagtgttatattttctgccaagtcagctgaggagaaaaacaactggatggcagca
ctgatatcattacaatatcgtagtactctggaaaggatgctggatgtaacaatgttacaa
gaggagaaggaagagcagatgaggcttccaagttcagatctttatagatttgcagaacca
gattctgaagaaaacatagtatttgaagaaaacatgcaacccaaatctggaattccaatc
atcaaagcaggaaccgttatcaaacttatagagcgactcacctaccatatgtatgctgat
cccaattttgttcggacatttcttacaacatatagatctttctgcaagcctcaagagcta
ctgagtcttctaatagagaggtttgaaattccagagccagagccaacagaagctgataag
atagctatggagaatggagatcaacccctgagtgtggaactaaaacgatttagaaaagaa
tatatacaacccgtacaacttcgagtactaaatgtatgtcggcattgggtagaacatcac
ttctatgattttgaaagggatacagatcttttgcaacgactggaagaatttattggaaca
gtaaggggcaaagcaatgaagaaatgggttgaatctatcactaaaatcattcaacggaaa
aaaatggcaagagacaatggaccaggacataatattacatttgaaagttcaccacctgca
gttgaatggcatataagcagacctggacacacagaaacttttgatctgctcaccttgcac
ccaatagaaattgctcgacagctcactttacttgagtcagatctctacagagctgtacag
ccatcagaattagttggaagtgtgtggacaaaagaagacaaagaaattaattctcctaac
cttctgaaaatgattcgccacacaactaatcttactttgtggtttgaaaagtgtattgta
gaagcagaaaacctagaggaaagagtagctgtggtgagccgaataattgagattcttcaa
gtctttcaagaactgaacaactttaatggtgtacttgaggttgtcagtgccatgaattca
tcacctgtttatagactggaccacacgtttgagcaaattccaagtcgccaaaagaaaatt
ctagaggaagctcatgaactgagtgaagatcactataaaaaatatttggcaaaactcagg
tctattaatccaccatgtgtgcctttctttgggatttatttgacaaatatcttgaaaacg
gaagaaggcaatcctgaggttttgaaaagacatggaaaagaactcataaactttagcaaa
agaagaaaagttgcagaaataacaggagagatacagcagtaccaaaatcaaccatattgt
ttgcgagtagaatctgatatcaaaagattctttgaaaatttgaatccaatgggaaatagc
atggaaaaagaattcacagattatcttttcaacaagtcactggaaattgaaccaagaaac
cctaagcctctaccgagatttccaaaaaaatatagttatcccctcaagtctcctggtgtt
cgtccatcaaacccaagaccaggtaccatgagacaccccacaccactgcagcaggaacca
aggaagatcagttacagccgcatccctgaaagtgaaacagaaagtgcagcatcagcacct
aattctccacggaccccgttaacacctcctccagcttctggtgcttccagtaccactgat
gtgggcagcgtgtttgattctgagcattcaagcccttttcactcaagatctgcttcagta
tcctcaataaatttaactaagagcattgatgaaatgcccatccccccaccagtgccccca
agaagacgaccagaatccgctccagcagaatcttctccctcaaagattatgcccaaacat
ttggatagtcctccagcaattcctcctcggcaacccacatcaaaagtctattcaccacgt
tactccgaccggacctcgatctctgatcctccagaaagccctccccttttaccaccacga
gaacctgtgaggacacctgatgttttctccagctcaccactacatctccagccaccccct
ttgggcaaaaaaagtgaacatggcaatacttttttcccaaacagcccctccccctttacc
ccaccacctcctcaaacaccttctcctcacggcacaaggaggcacctaccatcaccacca
ttaacacaagatgtggaccttcattccatccctgcgcctcctgttcctccacggcaaagc
acttctcaacacatcccaaaacttcctccaaaaacttataaaagggaacacacacaccct
tccatgcatagagatggaccacctttgttggagaatgcccattcctcctga

KEGG   Sarcophilus harrisii (Tasmanian devil): 100932999
Entry
100932999         CDS       T02286                                 

Gene name
SOS2
Definition
(RefSeq) son of sevenless homolog 2 isoform X1
  KO
K03099  son of sevenless
Organism
shr  Sarcophilus harrisii (Tasmanian devil)
Pathway
shr01521  EGFR tyrosine kinase inhibitor resistance
shr01522  Endocrine resistance
shr04010  MAPK signaling pathway
shr04012  ErbB signaling pathway
shr04014  Ras signaling pathway
shr04062  Chemokine signaling pathway
shr04068  FoxO signaling pathway
shr04072  Phospholipase D signaling pathway
shr04150  mTOR signaling pathway
shr04151  PI3K-Akt signaling pathway
shr04510  Focal adhesion
shr04540  Gap junction
shr04630  JAK-STAT signaling pathway
shr04650  Natural killer cell mediated cytotoxicity
shr04660  T cell receptor signaling pathway
shr04662  B cell receptor signaling pathway
shr04664  Fc epsilon RI signaling pathway
shr04714  Thermogenesis
shr04722  Neurotrophin signaling pathway
shr04810  Regulation of actin cytoskeleton
shr04910  Insulin signaling pathway
shr04912  GnRH signaling pathway
shr04915  Estrogen signaling pathway
shr04917  Prolactin signaling pathway
shr04926  Relaxin signaling pathway
shr04935  Growth hormone synthesis, secretion and action
shr05034  Alcoholism
shr05160  Hepatitis C
shr05161  Hepatitis B
shr05163  Human cytomegalovirus infection
shr05165  Human papillomavirus infection
shr05200  Pathways in cancer
shr05205  Proteoglycans in cancer
shr05206  MicroRNAs in cancer
shr05210  Colorectal cancer
shr05211  Renal cell carcinoma
shr05213  Endometrial cancer
shr05214  Glioma
shr05215  Prostate cancer
shr05220  Chronic myeloid leukemia
shr05221  Acute myeloid leukemia
shr05223  Non-small cell lung cancer
shr05224  Breast cancer
shr05225  Hepatocellular carcinoma
shr05226  Gastric cancer
shr05231  Choline metabolism in cancer
Brite
KEGG Orthology (KO) [BR:shr00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04010 MAPK signaling pathway
    100932999 (SOS2)
   04012 ErbB signaling pathway
    100932999 (SOS2)
   04014 Ras signaling pathway
    100932999 (SOS2)
   04630 JAK-STAT signaling pathway
    100932999 (SOS2)
   04068 FoxO signaling pathway
    100932999 (SOS2)
   04072 Phospholipase D signaling pathway
    100932999 (SOS2)
   04151 PI3K-Akt signaling pathway
    100932999 (SOS2)
   04150 mTOR signaling pathway
    100932999 (SOS2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100932999 (SOS2)
   04540 Gap junction
    100932999 (SOS2)
  09142 Cell motility
   04810 Regulation of actin cytoskeleton
    100932999 (SOS2)
 09150 Organismal Systems
  09151 Immune system
   04650 Natural killer cell mediated cytotoxicity
    100932999 (SOS2)
   04660 T cell receptor signaling pathway
    100932999 (SOS2)
   04662 B cell receptor signaling pathway
    100932999 (SOS2)
   04664 Fc epsilon RI signaling pathway
    100932999 (SOS2)
   04062 Chemokine signaling pathway
    100932999 (SOS2)
  09152 Endocrine system
   04910 Insulin signaling pathway
    100932999 (SOS2)
   04912 GnRH signaling pathway
    100932999 (SOS2)
   04915 Estrogen signaling pathway
    100932999 (SOS2)
   04917 Prolactin signaling pathway
    100932999 (SOS2)
   04926 Relaxin signaling pathway
    100932999 (SOS2)
   04935 Growth hormone synthesis, secretion and action
    100932999 (SOS2)
  09156 Nervous system
   04722 Neurotrophin signaling pathway
    100932999 (SOS2)
  09159 Environmental adaptation
   04714 Thermogenesis
    100932999 (SOS2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    100932999 (SOS2)
   05206 MicroRNAs in cancer
    100932999 (SOS2)
   05205 Proteoglycans in cancer
    100932999 (SOS2)
   05231 Choline metabolism in cancer
    100932999 (SOS2)
  09162 Cancer: specific types
   05210 Colorectal cancer
    100932999 (SOS2)
   05225 Hepatocellular carcinoma
    100932999 (SOS2)
   05226 Gastric cancer
    100932999 (SOS2)
   05214 Glioma
    100932999 (SOS2)
   05221 Acute myeloid leukemia
    100932999 (SOS2)
   05220 Chronic myeloid leukemia
    100932999 (SOS2)
   05211 Renal cell carcinoma
    100932999 (SOS2)
   05215 Prostate cancer
    100932999 (SOS2)
   05213 Endometrial cancer
    100932999 (SOS2)
   05224 Breast cancer
    100932999 (SOS2)
   05223 Non-small cell lung cancer
    100932999 (SOS2)
  09172 Infectious disease: viral
   05161 Hepatitis B
    100932999 (SOS2)
   05160 Hepatitis C
    100932999 (SOS2)
   05163 Human cytomegalovirus infection
    100932999 (SOS2)
   05165 Human papillomavirus infection
    100932999 (SOS2)
  09165 Substance dependence
   05034 Alcoholism
    100932999 (SOS2)
  09176 Drug resistance: antineoplastic
   01521 EGFR tyrosine kinase inhibitor resistance
    100932999 (SOS2)
   01522 Endocrine resistance
    100932999 (SOS2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04990 Domain-containing proteins not elsewhere classified [BR:shr04990]
    100932999 (SOS2)
Domain-containing proteins not elsewhere classified [BR:shr04990]
 Pleckstrin homology (PH) domain-containing proteins
  Dbl-Like RhoGEF family proteins
   100932999 (SOS2)
SSDB
Motif
Pfam: RasGEF RasGEF_N RhoGEF PH Histone IQ_SEC7_PH PH_13 PH_19 PH_9
Other DBs
NCBI-GeneID: 100932999
NCBI-ProteinID: XP_031808829
Ensembl: ENSSHAG00000003599
LinkDB
Position
2
AA seq 1329 aa
MQPPQQPYDFFSEENHPKWRGLFIPALRKVQHQVHPNLSAKEDCLYYIEELILQLLHKLC
IVQPRTVQDVEERVQKTFPHPIDKWAIADAQSAVEKRKRRTPLLLPVDKIHPLLKEVLGY
KVDYHVSLYIVAVLEYISADILKLAGNYVFNIRHYEISQQDIKVSMCADKVLMDMFDQDD
IGLVSLCEDESLSSGELNYYDIVRTEIAEERQYLRELNLIIKVFREAFLSNKKLFASSDI
EGIFSNILDIHELTVKLLGLIEDTVEMTDESSPHPLAGSCFEDLAEEQAFDPYETLSQDI
LSPQFHEHFNTLMAKPAVSLHFQSIADGFKEAVQYVLPRLMLVPVYHCSHYFELLEQLQE
CSEEQEDRECLKQAITALLNFRCSMERICNKHSPRRRPVDPVCRFYNRQLRSKHLAIKKM
NEIQKNIDGWEGKDIVQCCNEFIMEGPLIRIGAKHERHIFLFDGLMISCKTNHGQSRIPG
YSNAEYRLKEKFIMRKIQICDKEDTSEYKHIFELISKDENSIIFAAKSTEEKNNWMAALI
SLQYRSTLDRMLDSVLLKEENEQPLRLPSPEVYRFVVKDSEENIVFEDNLQSGIPVIKGG
TVVKLIERLTYHMYADPNFVRTFLTTYRSFCKPQELLKLLIERFEIPEPEPSEADKLAVE
KGEQPISADLKRFRKEYVQPVQLRILNVFRHWVEHHFYDFERDLQLLEKLESFISSVKGK
TMKKWVESIVKIIKRKKQAQANGISHNITFESPPPPIEWHISRPGQYETFDLMTLHPIEI
ARQLTLLESDLYRAVQPSELVGSVWTKEDKEINSPNLLKMIRHTTNLTLWFEKCIVEAEN
FEERVAVLSRIMEILQVFQDLNNFSGVLEIVSAMNSVSVYRLEHTFEALQERKRKILEEA
VELSQDHFKKYLAKLKSINPPCVPFFGIYLTNILKTEEGNNDFLKKQGKELINFSKRRKV
AEITGEIQQYQNQPYCLRIEAEIRRFFENLNPIGNASEKEFTDYLFNKSQEIEPRNCKQP
PRFPRKTTFSLKSPGIRPHTGRHGSTSGTLRGHPTPLEREPCKISFSRINEAEHESTASA
PTSPNTPSTPPVSASSDLSVFLDTDLNTSYGSNSIFAPVILPPSKSFLNSCGSLHKLTEE
SLVPPPLPPRKKFDQDVSTSKGNVKYDDDPPAIPPRQPPPPKIKPRVPAYSGPFEGPLPS
PPPPPPRDPLPETPLPVPLRPPEHFINYPFNLQPSPMGHTHRDPDWFREASTCPNSPNTP
PSTPSPRVPHRCVLSSNQNNLAHSQAPPIPPRQNSSPHLPKLPPKTYKRDLCQPPVYRLP
LLEHAETPQ
NT seq 3990 nt   +upstreamnt  +downstreamnt
atgcagccgccgcaacagccttacgacttctttagtgaagagaaccatccgaaatggcgg
ggactcttcatccctgccctgcggaaggttcagcatcaagttcaccccaatctctcagca
aaagaggattgtctatattacattgaagagttgatccttcaactgctacataaattatgc
attgtacagccaaggactgttcaagatgtagaggaacgtgttcaaaagacttttcctcat
ccaattgacaaatgggctattgctgatgcacaatctgccgtagaaaaacgaaaacgaagg
acccctcttttgttacctgtggataaaattcatcctttattgaaggaagttttagggtat
aaagtagattatcatgtctctctttatattgtggctgtattggaatatatctctgctgat
attctgaaattggctggtaattatgtttttaatatcagacattatgaaatctcccagcag
gacattaaagtatcaatgtgtgcagataaggttttaatggacatgtttgatcaggatgac
attggtttggtttcactctgtgaagatgaatctctttcttcaggtgaactaaattactat
gacatagtcagaactgaaattgcagaagaaagacaatatctacgggaattaaatctaata
ataaaagtatttcgggaggcttttctttcaaacaaaaaactgttcgcatcttctgatatt
gaagggatattcagtaacattttagatattcatgaattgactgtaaagcttttaggcttg
attgaagacacagttgaaatgactgatgaaagcagtcctcatcctttagctggaagctgt
tttgaagatctagcagaagaacaagcatttgatccatatgaaaccttatcacaggatatt
ctttcaccacagtttcatgaacatttcaatactttgatggctaaacctgctgtttctcta
cattttcagtccattgctgatggctttaaagaggctgtacaatatgttcttccacgcctt
atgctagtgccagtttaccattgttcacactactttgaattattagagcaattgcaagaa
tgcagtgaagaacaagaagaccgagaatgtttaaaacaagctattactgctcttctgaat
ttccgatgtagcatggaacggatttgcaataagcattcacctagacggcgacctgtggat
cctgtttgtcgattttataatcgtcaattacggagtaagcacctggccattaaaaaaatg
aatgaaattcaaaaaaatattgatggatgggaaggcaaagatattgtccagtgttgtaat
gaatttataatggaaggaccattgataagaataggagctaagcatgaacgccatattttt
ctttttgatggcttaatgattagctgtaaaactaatcatggacaatcccggattccaggt
tatagcaatgcagagtacagattgaaagaaaaatttatcatgaggaaaatacaaatctgt
gataaagaagacacttctgaatacaagcatatctttgagttgatttccaaagatgaaaac
agcattatatttgctgctaagtccactgaagagaaaaataattggatggcagccttgatt
tctcttcagtatcgtagtacattagatcgaatgctagattcagtattattaaaagaagaa
aatgaacaaccattgaggttgccaagtccagaagtttatcgttttgttgtaaaagactcc
gaagaaaacattgtttttgaagacaacttgcaaagtggaatccctgtcattaaaggagga
actgtggtgaaattaattgaaagactaacctaccatatgtatgcagatcccaattttgtg
cgtacttttcttacaacctatcgttcattttgtaagccacaggaattgttgaagttattg
attgaacggtttgaaatccccgagccagaaccttccgaagcagataaattggcagtagag
aaaggagagcaaccaatcagtgcagacctgaaacggtttcgcaaagaatatgtccaacca
gtacaacttaggatactaaatgtatttcggcactgggtagaacatcatttttatgatttt
gagagagatctgcaattacttgaaaaattagaatcctttatttcaagtgtaaaagggaaa
acgatgaagaagtgggtagagtcaattgttaaaatcatcaagaggaaaaaacaagctcag
gcaaatggaattagccataatattacttttgaaagtccaccaccacccattgagtggcat
attagcagaccaggacagtatgaaacatttgatctcatgaccctgcatccaatagaaatt
gcacgtcaactaacacttctggaatctgacctctacagggcagttcagccttctgaactt
gtaggaagtgtgtggactaaagaagataaggaaataaattctccaaatttattgaaaatg
attcggcacactacaaatctaactctctggtttgaaaagtgcatagtagaagcagaaaac
tttgaagaacgagtggcagtattaagtaggattatggagattctgcaagtttttcaagac
ttgaacaatttcagtggtgtgttagagatagtcagtgcaatgaattcagtgtccgtatac
agattggaacatacatttgaagcattacaggaaagaaaacggaagattttagaagaagct
gtagaactaagccaagatcactttaaaaaatacctagcaaaactgaaatcaatcaaccca
ccttgtgtgcctttttttggaatatatttaacaaatattctgaagactgaagaaggtaat
aatgatttccttaaaaaacaagggaaagaattaatcaactttagtaaaagaaggaaagta
gctgaaattactggagaaattcaacagtatcagaatcagccttattgtttacggatagaa
gcagaaataaggagattctttgaaaatctcaaccctataggaaatgcatcagaaaaagaa
tttacagactatttattcaataaatcacaagaaattgaaccccgaaactgtaaacaacca
cctcgatttcctaggaaaacaactttctccttaaaatctcctggtataaggccacatact
ggcagacatggctctacttcaggcactttacgaggtcatccaactccattagaaagagaa
ccatgtaaaataagctttagtcggataaatgaggctgaacatgaatcaacagcatcagca
ccaacctctccaaacacaccatctacgccaccagtctctgcttcttcagatcttagtgta
tttttagatacagatcttaacacttcttatggaagcaacagcatctttgctccagtaatc
ttaccaccttcaaaatctttcttaaattcatgtggtagtttacataaattaactgaagag
tcactggttcctcctcctcttcctcctcgtaaaaagtttgaccaagatgtttcgacttcc
aagggaaacgtgaaatatgatgatgatcctcctgctattccaccaaggcagccaccacct
ccaaaaataaaacctcgagttcctgcttacagtggtccgtttgaagggcctttgcctagt
ccaccaccaccacctccacgagatcctcttcctgagactcctttaccagtacctcttcgg
cctcctgaacattttataaactatccatttaatcttcagccatcaccaatgggacacact
cacagagatccagattggtttcgggaagccagtacatgtccaaactcaccaaatactcct
ccaagcacaccctctccaagggtgccacatcgctgtgtgctcagttccaatcaaaataat
cttgctcattctcaagctccccctatcccaccaagacagaattcaagtcctcacttacca
aaactgccaccaaagacttacaaaagagatctttgtcaacctcctgtatatagattgccc
ttgttagaacatgcagaaactcctcagtga

DBGET integrated database retrieval system