KEGG   Phascolarctos cinereus (koala): 110201388
Entry
110201388         CDS       T05867                                 

Gene name
SOS1
Definition
(RefSeq) LOW QUALITY PROTEIN: son of sevenless homolog 1
  KO
K03099  son of sevenless
Organism
pcw  Phascolarctos cinereus (koala)
Pathway
pcw01521  EGFR tyrosine kinase inhibitor resistance
pcw01522  Endocrine resistance
pcw04010  MAPK signaling pathway
pcw04012  ErbB signaling pathway
pcw04014  Ras signaling pathway
pcw04062  Chemokine signaling pathway
pcw04068  FoxO signaling pathway
pcw04072  Phospholipase D signaling pathway
pcw04150  mTOR signaling pathway
pcw04151  PI3K-Akt signaling pathway
pcw04510  Focal adhesion
pcw04540  Gap junction
pcw04630  JAK-STAT signaling pathway
pcw04650  Natural killer cell mediated cytotoxicity
pcw04660  T cell receptor signaling pathway
pcw04662  B cell receptor signaling pathway
pcw04664  Fc epsilon RI signaling pathway
pcw04714  Thermogenesis
pcw04722  Neurotrophin signaling pathway
pcw04810  Regulation of actin cytoskeleton
pcw04910  Insulin signaling pathway
pcw04912  GnRH signaling pathway
pcw04915  Estrogen signaling pathway
pcw04917  Prolactin signaling pathway
pcw04926  Relaxin signaling pathway
pcw04935  Growth hormone synthesis, secretion and action
pcw05034  Alcoholism
pcw05160  Hepatitis C
pcw05161  Hepatitis B
pcw05163  Human cytomegalovirus infection
pcw05165  Human papillomavirus infection
pcw05200  Pathways in cancer
pcw05205  Proteoglycans in cancer
pcw05206  MicroRNAs in cancer
pcw05210  Colorectal cancer
pcw05211  Renal cell carcinoma
pcw05213  Endometrial cancer
pcw05214  Glioma
pcw05215  Prostate cancer
pcw05220  Chronic myeloid leukemia
pcw05221  Acute myeloid leukemia
pcw05223  Non-small cell lung cancer
pcw05224  Breast cancer
pcw05225  Hepatocellular carcinoma
pcw05226  Gastric cancer
pcw05231  Choline metabolism in cancer
Brite
KEGG Orthology (KO) [BR:pcw00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04010 MAPK signaling pathway
    110201388 (SOS1)
   04012 ErbB signaling pathway
    110201388 (SOS1)
   04014 Ras signaling pathway
    110201388 (SOS1)
   04630 JAK-STAT signaling pathway
    110201388 (SOS1)
   04068 FoxO signaling pathway
    110201388 (SOS1)
   04072 Phospholipase D signaling pathway
    110201388 (SOS1)
   04151 PI3K-Akt signaling pathway
    110201388 (SOS1)
   04150 mTOR signaling pathway
    110201388 (SOS1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    110201388 (SOS1)
   04540 Gap junction
    110201388 (SOS1)
  09142 Cell motility
   04810 Regulation of actin cytoskeleton
    110201388 (SOS1)
 09150 Organismal Systems
  09151 Immune system
   04650 Natural killer cell mediated cytotoxicity
    110201388 (SOS1)
   04660 T cell receptor signaling pathway
    110201388 (SOS1)
   04662 B cell receptor signaling pathway
    110201388 (SOS1)
   04664 Fc epsilon RI signaling pathway
    110201388 (SOS1)
   04062 Chemokine signaling pathway
    110201388 (SOS1)
  09152 Endocrine system
   04910 Insulin signaling pathway
    110201388 (SOS1)
   04912 GnRH signaling pathway
    110201388 (SOS1)
   04915 Estrogen signaling pathway
    110201388 (SOS1)
   04917 Prolactin signaling pathway
    110201388 (SOS1)
   04926 Relaxin signaling pathway
    110201388 (SOS1)
   04935 Growth hormone synthesis, secretion and action
    110201388 (SOS1)
  09156 Nervous system
   04722 Neurotrophin signaling pathway
    110201388 (SOS1)
  09159 Environmental adaptation
   04714 Thermogenesis
    110201388 (SOS1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    110201388 (SOS1)
   05206 MicroRNAs in cancer
    110201388 (SOS1)
   05205 Proteoglycans in cancer
    110201388 (SOS1)
   05231 Choline metabolism in cancer
    110201388 (SOS1)
  09162 Cancer: specific types
   05210 Colorectal cancer
    110201388 (SOS1)
   05225 Hepatocellular carcinoma
    110201388 (SOS1)
   05226 Gastric cancer
    110201388 (SOS1)
   05214 Glioma
    110201388 (SOS1)
   05221 Acute myeloid leukemia
    110201388 (SOS1)
   05220 Chronic myeloid leukemia
    110201388 (SOS1)
   05211 Renal cell carcinoma
    110201388 (SOS1)
   05215 Prostate cancer
    110201388 (SOS1)
   05213 Endometrial cancer
    110201388 (SOS1)
   05224 Breast cancer
    110201388 (SOS1)
   05223 Non-small cell lung cancer
    110201388 (SOS1)
  09165 Substance dependence
   05034 Alcoholism
    110201388 (SOS1)
  09172 Infectious disease: viral
   05161 Hepatitis B
    110201388 (SOS1)
   05160 Hepatitis C
    110201388 (SOS1)
   05163 Human cytomegalovirus infection
    110201388 (SOS1)
   05165 Human papillomavirus infection
    110201388 (SOS1)
  09176 Drug resistance: antineoplastic
   01521 EGFR tyrosine kinase inhibitor resistance
    110201388 (SOS1)
   01522 Endocrine resistance
    110201388 (SOS1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04990 Domain-containing proteins not elsewhere classified [BR:pcw04990]
    110201388 (SOS1)
Domain-containing proteins not elsewhere classified [BR:pcw04990]
 Pleckstrin homology (PH) domain-containing proteins
  Dbl-Like RhoGEF family proteins
   110201388 (SOS1)
SSDB
Motif
Pfam: RasGEF RasGEF_N RhoGEF Histone PH PH_19 PH_10 IQ_SEC7_PH PH_13
Other DBs
NCBI-GeneID: 110201388
NCBI-ProteinID: XP_020832638
UniProt: A0A6P5JGV8
LinkDB
Position
Unknown
AA seq 1332 aa
MQALQLQYEFFSEENAPKWRGLLVTALKKVQMQVHPTLASNEDALQYVEELILQLLSMLC
QAQPRSVLDVEDRVQKSFPHPIDKWAIADAQSANEKRKRRNPLSLPVEKIHPLLKEVLGY
KIDHQVSVYIVAVLEYISADILKLVGNYVRNIRHYEITKQDIKVAMCADKVLMDMFHQDV
EDINILSLTDEEPSTSGEQTYYDLVKAFMAEVRQYIRELNLIIKVFREPFVSNSKLFSSH
DVENIFSRIADVHELSVKLLGHIEDTVEMTDEGSPHPLVGSCFEDLAEELAFDPYESYAQ
DILRPGFHDHFLSQLSKPGAALYLQSIGEGFKEAVQYVLPRLLLAPVYHCLHYFELLKQL
EEKSEDQEDKECLKQAITALLNLQSSMERICSKSLAKRRLSESACRFYSQQMKGKQLAIK
KMNEIQKNIDGWEGKDIGQCCNEFIMEGTLTRVGAKHERHIFLFDGLMICCKSNHGQPRL
PGASNAEYRLKEKFFMRKVQINDKDDTNEYKHAFEIILKDENSVIFSAKSAEEKNNWMAA
LISLQYRSTLERMLDATMLQEEKEEQMRLPSSDLYRFAEPDSEENIVFEENMQPKSGIPI
IKAGTVIKLIERLTYHMYADPNFVRTFLTTYRSFCKPQELLSLLIERFEIPEPEPTEADR
IAMENGDQPLSVELKRFRKEYIQPVQLRVLNVCRHWVEHHFYDFERDADLLQRLEEFIGT
VRGKAMKKWVESITKIIQRKKMARDNGPGHNITFESSPPAVEWHISRPGHTETFDLLTLH
PIEIARQLTLLESDLYRAVQPSELVGSVWTKEDKEINSPNLLKMIRHTTNLTLWFEKCIV
EAENLEERVAVVSRIIEILQVFQELNNFNGVLEVVSAMNSSPVYRLDHTFEQIPSRQKKI
LEEAHELSEDHYKKYLAKLRSINPPCVPFFGIYLTNILKTEEGNPEVLKRHGKELINFSK
RRKVAEITGEIQQYQNQPYCLRVESDIKRFFENLNPMGNSMEKEFTDYLFNKSLEIEPRN
PKPLPRFPKKYSYPLKSPGVRPSNPRPGTMRHPTPLQQEPRKISYSRIPESETESTASAP
NSPRTPLTPPPASGASSTTDVCSVFDSDHSSPFHSSSDTIFIQVTLPHGPRSASVSSINL
TKSTDEMPIPPPVPPRRRPESAPAESSPSKIMSKHLDSPPAIPPRQPTSKVYSPRYSDRT
SMSDPPESPPLLPPREPVRTPDVFSSSPLHLQPPPLGKKSEHGNTFFPNSPSPFTPPPPQ
TPSPHGTRRHLPSPPLTQQDVDLHSIPGPPVPPRQSTSQHIPKLPPKTYKREHTHPSMHR
DGPPLLENAHSS
NT seq 3999 nt   +upstreamnt  +downstreamnt
atgcaggcgctgcagctgcagtacgagttcttcagcgaggagaacgcgcccaagtggagg
gggctgctggtgacggccctgaaaaaggtccagatgcaagtccatcctacacttgcatca
aatgaggatgctctccagtatgtggaagagttaattctacagctgttaagtatgctgtgc
caagctcaaccccgcagcgttttagatgttgaggatcgtgtacaaaaaagttttcctcat
ccaattgataagtgggcaatagctgatgctcaatctgctaatgaaaagaggaagcgaaga
aatccattatctctccctgtagaaaaaattcaccctttattgaaggaagttttagggtat
aaaattgaccaccaggtttctgtttacatagtagcagtattagaatacatttctgcagat
attttaaagctggttggaaattatgtacggaatatacgacattatgaaattacaaaacaa
gacattaaagtagcaatgtgtgctgataaggtattgatggatatgttccatcaagatgta
gaagatatcaatatattatctttaactgatgaagaaccttccacttcaggggaacaaaca
tactatgacctggtaaaggcatttatggctgaagttcgacaatatataagggaacttaat
ctcattattaaagtttttagagaaccatttgtctccaattcaaaattattttcttctcac
gatgtagaaaatatatttagtcgtatagcagatgttcatgaactcagtgtgaaattattg
ggccatatagaagacactgtagaaatgactgatgaaggcagtccccatccattagttggt
agctgctttgaagacttagcagaggaattggcatttgatccatatgaatcgtatgctcaa
gacatattgcgacctggttttcatgaccactttcttagtcagttatccaagcctggggca
gccctctatttgcagtcaataggtgaaggtttcaaagaagctgttcagtatgttttaccc
aggctacttctagctcctgtgtaccactgtctacattactttgaacttttaaagcagcta
gaagaaaagagtgaagaccaagaagacaaagaatgtttgaaacaagcaataacagctttg
cttaaccttcagagcagtatggaaagaatatgttccaagagtcttgcaaaacggagactg
agcgaatctgcatgtagattttatagccaacagatgaaggggaaacaactagcaataaag
aaaatgaatgagattcagaagaacattgatggctgggaaggaaaagacattgggcagtgt
tgcaatgagtttatcatggaaggaactctcacacgtgtaggtgccaaacatgagagacac
atatttctctttgatggcctgatgatttgttgtaaatcaaatcatggtcagccaagactt
cctggtgctagcaatgctgagtatcgtctcaaagagaagttttttatgcgcaaggtacaa
atcaatgataaagatgacactaatgaatacaaacatgcctttgaaatcatcttaaaagat
gaaaatagtgttatattttctgctaagtcagctgaggagaaaaacaactggatggcagca
ctgatatcattacagtaccgcagtacattggaaaggatgctggacgctacgatgttgcag
gaggagaaggaggagcagatgaggctgccaagttctgatctttatagatttgcagaacca
gactctgaagaaaacatagtctttgaagaaaacatgcaacccaaatctggaattccaatc
atcaaagccggaaccgttatcaaacttatagagcgactcacctaccatatgtatgctgat
cccaattttgttcggacatttcttacaacatatagatctttctgcaagcctcaagagcta
ctgagtcttctaatagagaggtttgaaattccagagcctgagccaacagaagctgatagg
atagcaatggagaatggagatcaacccctgagtgtggaactaaaacggtttagaaaagaa
tatatccaacctgtacaacttcgagtactaaacgtatgtcggcattgggtagaacaccac
ttctatgattttgaaagagatgcagatcttttgcaacgactggaagaatttattggaaca
gtaaggggcaaagcaatgaagaaatgggttgaatctatcactaaaattatccagcggaaa
aaaatggcaagagacaatggaccaggacataatattacttttgaaagttcacctcctgca
gttgagtggcatataagcagacctggacacacagagacttttgacctactcaccttgcac
ccaatagaaattgctcgacagctcactttacttgaatcagatctctaccgagctgtacag
ccatcagaattagttggaagtgtgtggacaaaagaagacaaagaaattaattctcctaac
cttctgaaaatgatccgccatacaaccaatcttactttgtggtttgaaaagtgtattgta
gaagcagaaaacctagaagaaagagtagctgtggtgagtcgaataattgagatccttcaa
gtctttcaagaactgaacaactttaatggtgtacttgaggttgtcagtgccatgaactca
tcacctgtttatagactggaccacacatttgagcaaattccaagccgccaaaagaaaatt
ttagaggaagctcatgaactgagtgaagatcactataaaaaatatttggcaaaactcagg
tctattaatccaccatgtgtgcctttctttgggatttatttgacaaatatcttgaaaaca
gaagaaggcaatcctgaggttttgaaaagacatggaaaagaacttataaactttagcaaa
aggagaaaagttgcagaaataacaggagagatacagcagtaccaaaatcaaccatattgt
ttgcgagtagaatctgatatcaaaagattctttgaaaatttgaatccaatgggaaatagc
atggaaaaagaattcacagattatcttttcaacaaatcactggaaattgaaccaagaaat
cccaagcctctaccaagatttccaaaaaagtacagttatcccctcaagtctcctggtgtt
cgtccatcaaacccaagaccaggtaccatgagacacccaacacccctgcaacaggagcca
aggaagattagttacagtcggatccctgaaagtgaaacagaaagtacggcgtctgcacct
aactctccaagaacaccattaacacctcctcctgcctctggtgcttccagcaccaccgat
gtgtgcagtgtgttcgattctgatcactcaagcccttttcactcaagcagcgataccatc
tttatccaagttacactgccccatggcccaagatctgcttcagtatcctcgataaattta
accaagagcactgatgaaatgcccatccccccacctgtgcccccaagaagaagaccagag
tctgccccagcagaatcttctccctctaagattatgtccaaacatttggatagtcctcca
gcaatccctcctcggcaacccacatcaaaagtctattcaccacgttactctgatcggacc
tcgatgtcagaccctccagaaagccctccccttttaccaccacgagaacccgtgaggaca
cctgatgttttctccagctcaccactacatctccagccaccccctttgggcaaaaaaagt
gaacatggcaatactttcttcccaaacagcccctccccttttacaccaccacctcctcaa
acaccttctcctcatggcacaaggaggcatctgccatcaccaccattaacccaacaagat
gtggaccttcattccatccctgggccacctgttcctccacgacaaagcacttctcaacac
atcccaaaacttcctccaaaaacttacaaaagggagcacacacacccatccatgcataga
gatggaccgcctttgttggagaatgcccattcctcctga

KEGG   Phascolarctos cinereus (koala): 110211774
Entry
110211774         CDS       T05867                                 

Gene name
SOS2
Definition
(RefSeq) son of sevenless homolog 2
  KO
K03099  son of sevenless
Organism
pcw  Phascolarctos cinereus (koala)
Pathway
pcw01521  EGFR tyrosine kinase inhibitor resistance
pcw01522  Endocrine resistance
pcw04010  MAPK signaling pathway
pcw04012  ErbB signaling pathway
pcw04014  Ras signaling pathway
pcw04062  Chemokine signaling pathway
pcw04068  FoxO signaling pathway
pcw04072  Phospholipase D signaling pathway
pcw04150  mTOR signaling pathway
pcw04151  PI3K-Akt signaling pathway
pcw04510  Focal adhesion
pcw04540  Gap junction
pcw04630  JAK-STAT signaling pathway
pcw04650  Natural killer cell mediated cytotoxicity
pcw04660  T cell receptor signaling pathway
pcw04662  B cell receptor signaling pathway
pcw04664  Fc epsilon RI signaling pathway
pcw04714  Thermogenesis
pcw04722  Neurotrophin signaling pathway
pcw04810  Regulation of actin cytoskeleton
pcw04910  Insulin signaling pathway
pcw04912  GnRH signaling pathway
pcw04915  Estrogen signaling pathway
pcw04917  Prolactin signaling pathway
pcw04926  Relaxin signaling pathway
pcw04935  Growth hormone synthesis, secretion and action
pcw05034  Alcoholism
pcw05160  Hepatitis C
pcw05161  Hepatitis B
pcw05163  Human cytomegalovirus infection
pcw05165  Human papillomavirus infection
pcw05200  Pathways in cancer
pcw05205  Proteoglycans in cancer
pcw05206  MicroRNAs in cancer
pcw05210  Colorectal cancer
pcw05211  Renal cell carcinoma
pcw05213  Endometrial cancer
pcw05214  Glioma
pcw05215  Prostate cancer
pcw05220  Chronic myeloid leukemia
pcw05221  Acute myeloid leukemia
pcw05223  Non-small cell lung cancer
pcw05224  Breast cancer
pcw05225  Hepatocellular carcinoma
pcw05226  Gastric cancer
pcw05231  Choline metabolism in cancer
Brite
KEGG Orthology (KO) [BR:pcw00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04010 MAPK signaling pathway
    110211774 (SOS2)
   04012 ErbB signaling pathway
    110211774 (SOS2)
   04014 Ras signaling pathway
    110211774 (SOS2)
   04630 JAK-STAT signaling pathway
    110211774 (SOS2)
   04068 FoxO signaling pathway
    110211774 (SOS2)
   04072 Phospholipase D signaling pathway
    110211774 (SOS2)
   04151 PI3K-Akt signaling pathway
    110211774 (SOS2)
   04150 mTOR signaling pathway
    110211774 (SOS2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    110211774 (SOS2)
   04540 Gap junction
    110211774 (SOS2)
  09142 Cell motility
   04810 Regulation of actin cytoskeleton
    110211774 (SOS2)
 09150 Organismal Systems
  09151 Immune system
   04650 Natural killer cell mediated cytotoxicity
    110211774 (SOS2)
   04660 T cell receptor signaling pathway
    110211774 (SOS2)
   04662 B cell receptor signaling pathway
    110211774 (SOS2)
   04664 Fc epsilon RI signaling pathway
    110211774 (SOS2)
   04062 Chemokine signaling pathway
    110211774 (SOS2)
  09152 Endocrine system
   04910 Insulin signaling pathway
    110211774 (SOS2)
   04912 GnRH signaling pathway
    110211774 (SOS2)
   04915 Estrogen signaling pathway
    110211774 (SOS2)
   04917 Prolactin signaling pathway
    110211774 (SOS2)
   04926 Relaxin signaling pathway
    110211774 (SOS2)
   04935 Growth hormone synthesis, secretion and action
    110211774 (SOS2)
  09156 Nervous system
   04722 Neurotrophin signaling pathway
    110211774 (SOS2)
  09159 Environmental adaptation
   04714 Thermogenesis
    110211774 (SOS2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    110211774 (SOS2)
   05206 MicroRNAs in cancer
    110211774 (SOS2)
   05205 Proteoglycans in cancer
    110211774 (SOS2)
   05231 Choline metabolism in cancer
    110211774 (SOS2)
  09162 Cancer: specific types
   05210 Colorectal cancer
    110211774 (SOS2)
   05225 Hepatocellular carcinoma
    110211774 (SOS2)
   05226 Gastric cancer
    110211774 (SOS2)
   05214 Glioma
    110211774 (SOS2)
   05221 Acute myeloid leukemia
    110211774 (SOS2)
   05220 Chronic myeloid leukemia
    110211774 (SOS2)
   05211 Renal cell carcinoma
    110211774 (SOS2)
   05215 Prostate cancer
    110211774 (SOS2)
   05213 Endometrial cancer
    110211774 (SOS2)
   05224 Breast cancer
    110211774 (SOS2)
   05223 Non-small cell lung cancer
    110211774 (SOS2)
  09165 Substance dependence
   05034 Alcoholism
    110211774 (SOS2)
  09172 Infectious disease: viral
   05161 Hepatitis B
    110211774 (SOS2)
   05160 Hepatitis C
    110211774 (SOS2)
   05163 Human cytomegalovirus infection
    110211774 (SOS2)
   05165 Human papillomavirus infection
    110211774 (SOS2)
  09176 Drug resistance: antineoplastic
   01521 EGFR tyrosine kinase inhibitor resistance
    110211774 (SOS2)
   01522 Endocrine resistance
    110211774 (SOS2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04990 Domain-containing proteins not elsewhere classified [BR:pcw04990]
    110211774 (SOS2)
Domain-containing proteins not elsewhere classified [BR:pcw04990]
 Pleckstrin homology (PH) domain-containing proteins
  Dbl-Like RhoGEF family proteins
   110211774 (SOS2)
SSDB
Motif
Pfam: RasGEF RasGEF_N RhoGEF PH Histone IQ_SEC7_PH PH_19 PH_13 PH_9
Other DBs
NCBI-GeneID: 110211774
NCBI-ProteinID: XP_020846968
UniProt: A0A6P5KMT3
LinkDB
Position
Unknown
AA seq 1329 aa
MQPPQQPYDFFSEENHPKWRGLFVPALRKVQHQVHPNLSAKEDSLYYIEELILQLLNKLC
IAQPRTVQDVEERVQKTFPHPIDKWAIADAQSAIEKRKRRTPLLLPVDKIHPLLKEVLGY
KVDYHVSLYIVAVLEYISADILKLAGNYVFNIRHYEISQQDIKVSMCADKVLMDMFDQDD
IGLVSLCEDEPFSSGELNYYDLVRTEIAEERQYLRELNLIIKVFREAFLSNKKLFASSDI
EGIFSNILDIHELTVKLLGLIEDTVEMTDESSPHPLAGSCFEDLAEEQAFDPYETLSQDI
LSPQFHERFNTLMAKPAVSLHFQSIADGFKEAVRYVLPRLMLVPVYHCSHYFELLEQLQE
CSEEQEDRECLKQAITALLNFRCSMERICNKHSPRRRPADPVCRFYNRQLRSKHLAIKKM
NEIQKNIDGWEGKDIGQCCNEFIMEGPLIRVGAKHERHIFLFDGLMISCKTNHGQSRIPG
YSNAEYRLKEKFIMRKIQICDKDDTSEYKHAFELISKDENSIIFAAKSTEEKNNWMAALI
SLQYRSTLDRMLDSVLLKEENEQPLRLPSPEVYRFVVKDCEENIVFEDNLQSGIPVIKGG
TVVKLIERLTYHMYADPNFVRTFLTTYRSFCKPQELLKLLIERFEIPEPEPTEADKLAVE
KGEQPISADLKRFRKEYVQPVQLRILNVFRHWVEHHFYDFERDLELLEKLESFISSVKGK
TMKKWVESIVKIIKRKKQAQANGISHNITFESPPPPIEWHISRPGQYETFDLMTLHPIEI
ARQLTLLESDLYRAVQPSELVGSVWTKEDKEINSPNLLKMIRHTTNLTLWFEKCIVEAEN
FEERVAILSRIIEILQVFQDLNNFNGVLEVVSAVNSVSVYRLDHTFEALQERKRKILDEA
VELSQDHFKKYLAKLKSINPPCVPFFGIYLTNILKTEEGNNDFLKKQGKELINFSKRRKV
AEITGEIQQYQNQPYCLRIESEIRRFFENLNPMGNASDKEFTDYLFNKSQEIEPRNCKQP
PRFPRKTTFSLKSPGIRPHAGRHGSTSGTLRGHPTPLEREPCKISFSRINEAEHESAASA
PTSPNTPSTPPVSASSDLSVFLDIDLNSSYGSNSIFAPVVLPHSKSFFSSCGSLHKLTEE
PLVPPPLPPRKKFDQDVSASKGNLKYDDDPPAIPPRQPPPPKIRPRVPAYSGPFEGPLPS
PPPPPPRDPLPETPPPVPLRPPEHFINYPFNLQPPPMGHTHRDPDWFREASTCPNSPNTP
PSTPSPRVPRRCVLSSSQNNLAHSQAPPIPPRQNSSPHLPKLPPKTYKRDLCQPPMYRLS
LLENAETPQ
NT seq 3990 nt   +upstreamnt  +downstreamnt
atgcagccgccgcagcagccctacgacttcttcagcgaagagaaccacccgaaatggcgg
ggactcttcgtcccggccctgcggaaggttcagcatcaagttcaccccaatctctcagca
aaagaagattctttatattatattgaagagttgatccttcaactgctaaataaattgtgc
attgcgcagccaagaactgttcaagatgtagaggaacgtgttcaaaagacttttcctcat
ccaattgacaagtgggctatcgctgatgcacagtctgccatagaaaaacgaaaacgaagg
acccctctcttgttgcctgtggataaaattcatcctttattgaaggaagttttagggtat
aaagtagattatcatgtctctctttatattgtggctgtattggagtatatctcagctgat
attctgaaattggctggtaattatgtttttaatatcagacattatgaaatatcccagcag
gacattaaagtatcaatgtgtgcagataaggttttaatggacatgtttgatcaggatgac
attggtttggtttcgctctgtgaagacgaacccttttcttcaggtgaactaaattactat
gatttagtcagaactgaaattgcagaagaaagacaatatctacgggaattaaatctgata
ataaaagtatttcgggaggctttcctttcaaacaaaaagctgtttgcatcttctgatatt
gaagggatattcagcaacattttagatattcatgaattgaccgtaaagcttttgggcttg
attgaagacacagttgaaatgactgatgaaagcagtcctcatcccttagctggcagttgt
tttgaagatctagcagaagaacaagcatttgatccctatgaaaccttgtcacaggatatt
ctttcaccacaatttcatgagcgtttcaatactttgatggccaaacctgctgtttctcta
cattttcagtccattgctgatggctttaaagaggctgtacggtatgttcttccacgcctt
atgctagtgccagtttaccattgttcacactactttgaattattagagcaattgcaagaa
tgcagtgaagagcaagaagacagagaatgtttgaaacaagctattactgctcttctgaat
ttccgttgtagcatggaacggatttgcaacaagcattcacctagacggcgacctgcggat
cctgtttgtcgattttataatcgtcaattacggagcaagcacctggccattaaaaaaatg
aatgaaattcaaaaaaatattgatggatgggaaggcaaagatattggccaatgttgtaat
gaatttataatggaaggcccattgataagagtaggagctaagcatgaacgccatattttt
ctttttgatggcttaatgattagttgtaaaactaatcatggacagtcccggattccaggt
tatagcaatgcagaatacagattgaaagaaaaatttatcatgaggaaaatacaaatttgt
gataaagatgacacttctgaatacaagcatgcctttgaattgatttccaaagatgaaaac
agcattatatttgctgctaagtccactgaagagaaaaacaattggatggcagccttgatt
tctcttcagtatcgtagtacactagatcgaatgctagattcagtattattaaaggaagaa
aatgaacaaccactgaggttgccaagtccagaagtatatcgctttgttgtgaaagactgt
gaggaaaacattgtttttgaagacaacttgcaaagtggaatccctgtcattaaaggagga
actgtggtgaaattgattgaaaggctaacataccacatgtatgcagatcccaattttgtg
cgtacttttcttacaacctatcgttcattttgtaaaccacaggaattgttgaagttattg
attgaacggtttgaaatccctgagccagaacctactgaagcagataaattggcagtggag
aaaggagagcaaccaatcagtgcagatctgaaacggtttcgcaaagaatatgtccaacca
gtacaacttaggatactaaatgtatttcggcactgggtagaacatcatttttatgatttt
gagcgagatctggagttactcgaaaaattagaatcctttatttcaagcgtaaaaggaaaa
accatgaagaagtgggtagaatcaattgttaaaatcatcaagaggaaaaaacaagctcag
gcaaatggaattagccataatattactttcgaaagtccaccaccaccaattgagtggcat
attagcagaccaggacagtatgaaacatttgatctcatgacccttcatccaatagaaatt
gcacgtcaactaacacttctggaatctgacctctacagggcagtgcagccttctgaactt
gtgggaagtgtgtggactaaagaagataaagaaataaattccccaaatttattgaaaatg
attcggcatactacaaatcttactctctggtttgaaaagtgcatagtagaagcagaaaac
tttgaagaacgagtggcaatattaagtagaattatagaaattttgcaagtttttcaagac
ttgaacaatttcaatggtgtattagaggtagtcagtgcagtgaattcagtatctgtgtac
cgattggaccacacatttgaagcattacaggaaagaaaacgcaagattttagatgaagcc
gtagaattaagccaagatcactttaaaaaatacctagcaaaacttaaatcaatcaaccca
ccttgtgtgcctttttttggaatatatttaacaaatattctgaagactgaagaagggaat
aatgatttccttaaaaaacaagggaaagaactaatcaactttagtaaaagaaggaaagta
gccgaaattactggagagatacagcagtatcagaatcagccttattgtttacggatagaa
tcagaaataaggaggttcttcgaaaatctcaaccctatgggaaatgcatcagacaaagaa
tttacagattatttattcaataaatcacaagaaattgaaccccgaaactgtaaacagcca
cctcgatttcctaggaaaacaactttctccttaaaatctcctggtataagaccacatgct
ggcagacatggctctacttcaggcactttacgaggtcacccaacaccattagaaagagaa
ccatgtaaaataagcttcagtcggataaatgaggctgagcatgaatcagcagcatcagca
ccaacctctccaaacacaccatctactccaccagtctctgcttcttcagacctcagtgta
tttttagatatagatcttaacagttcttatggaagcaacagcatctttgccccagtggtc
ttaccacattcaaaatctttcttcagttcatgtggtagtttacataaattaactgaagaa
ccactggttcctcctcctcttcctcctcgtaaaaagtttgatcaagatgtttcagcttct
aagggaaacttgaaatatgatgatgatcctcctgctattccaccaaggcagccaccacct
ccaaagataagacctcgagttcctgcttacagtggtccatttgaagggcctttgcctagc
ccaccaccaccacctccacgagaccctcttcctgagactcctccacctgtaccccttcgg
cctcccgaacatttcataaactatccgtttaatcttcagccaccaccaatgggacatact
cacagagatccggattggttccgggaagccagtacatgtccaaactcaccgaatactcct
ccaagcacaccctctccaagggtgccacgtcgctgcgtgctcagttctagtcaaaataat
cttgctcattctcaagctccccctatcccaccaagacagaattcaagccctcacttacca
aaactgccaccaaagacttacaaaagagatctttgtcaacctcctatgtacagattgtca
ttgttagaaaatgcagaaactcctcagtga

DBGET integrated database retrieval system