KEGG   Saimiri boliviensis boliviensis (Bolivian squirrel monkey): 101034942
Entry
101034942         CDS       T04350                                 

Gene name
APC2
Definition
(RefSeq) adenomatous polyposis coli protein 2
  KO
K02085  adenomatosis polyposis coli protein
Organism
sbq  Saimiri boliviensis boliviensis (Bolivian squirrel monkey)
Pathway
sbq04310  Wnt signaling pathway
sbq04390  Hippo signaling pathway
sbq04550  Signaling pathways regulating pluripotency of stem cells
sbq04810  Regulation of actin cytoskeleton
sbq04934  Cushing syndrome
sbq05010  Alzheimer disease
sbq05022  Pathways of neurodegeneration - multiple diseases
sbq05165  Human papillomavirus infection
sbq05200  Pathways in cancer
sbq05206  MicroRNAs in cancer
sbq05210  Colorectal cancer
sbq05213  Endometrial cancer
sbq05217  Basal cell carcinoma
sbq05224  Breast cancer
sbq05225  Hepatocellular carcinoma
sbq05226  Gastric cancer
Brite
KEGG Orthology (KO) [BR:sbq00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04310 Wnt signaling pathway
    101034942 (APC2)
   04390 Hippo signaling pathway
    101034942 (APC2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04550 Signaling pathways regulating pluripotency of stem cells
    101034942 (APC2)
  09142 Cell motility
   04810 Regulation of actin cytoskeleton
    101034942 (APC2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    101034942 (APC2)
   05206 MicroRNAs in cancer
    101034942 (APC2)
  09162 Cancer: specific types
   05210 Colorectal cancer
    101034942 (APC2)
   05225 Hepatocellular carcinoma
    101034942 (APC2)
   05226 Gastric cancer
    101034942 (APC2)
   05217 Basal cell carcinoma
    101034942 (APC2)
   05213 Endometrial cancer
    101034942 (APC2)
   05224 Breast cancer
    101034942 (APC2)
  09164 Neurodegenerative disease
   05010 Alzheimer disease
    101034942 (APC2)
   05022 Pathways of neurodegeneration - multiple diseases
    101034942 (APC2)
  09167 Endocrine and metabolic disease
   04934 Cushing syndrome
    101034942 (APC2)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    101034942 (APC2)
 09180 Brite Hierarchies
  09181 Protein families: metabolism
   01009 Protein phosphatases and associated proteins [BR:sbq01009]
    101034942 (APC2)
  09182 Protein families: genetic information processing
   03036 Chromosome and associated proteins [BR:sbq03036]
    101034942 (APC2)
  09183 Protein families: signaling and cellular processes
   04812 Cytoskeleton proteins [BR:sbq04812]
    101034942 (APC2)
Protein phosphatases and associated proteins [BR:sbq01009]
 Protein serine/threonine phosphatases
  Phosphoprotein phosphatases (PPPs)
   Protein phosphatase-1
    PP1-interacting proteins (PIPs)
     101034942 (APC2)
Chromosome and associated proteins [BR:sbq03036]
 Eukaryotic type
  Centrosome formation and ciliogenesis proteins
   Other centriole associated proteins
    101034942 (APC2)
Cytoskeleton proteins [BR:sbq04812]
 Eukaryotic cytoskeleton proteins
  Microtubules
   Tubulin-binding proteins
    EB / APC
     101034942 (APC2)
SSDB
Motif
Pfam: APC_rep APC_N_CC APC_r APC_basic Suppressor_APC Arm Arm_APC_u3 SAMP bZIP_1 HALZ APG6_N DUF1241 Herpes_BLRF2
Other DBs
NCBI-GeneID: 101034942
NCBI-ProteinID: XP_010330774
LinkDB
Position
Unknown
AA seq 1988 aa
MGLLGLLGLLHSAFFGDQTLQELKMTSSVASYEQLVRQVEALKAENSHLRQELRDNSSHL
SKLQTETSGMKEVLKHLQGKLEQEARVLVSSGQTEVLEQLKALQMDITSLYNLKFQPPAL
GPEPAARTPEGSPVHGPGPSKDSFGELSRATIRLLEELDRERCFLLNEIEKEEKEKLWYY
SQLQGLSKRLDELPHVETQFSMQMDLIRQQLEFEAQHIRSLMEERFGTSDEMVQRAQIRA
SRLEQIDKELLEAQDRVQQTEPQALLAVKSVPVDEDPETEVPTHPEDGTPQPGNSKVEVV
FWLLSMLATRDQEDTARTLLAMSSSPESCVAMRRSGCLPLLLQILHGTGTEAGGRAGAPG
APGAKDARMRANAALHNIVFSQPDQGLARKEMRVLHVLEQIRAYCETCWDWLQAREGGPE
GGGVPTAPVPIEPQICQATCAVMKLSFDEEYRRAMNELGGLQAVAELLQVDYEMHKMTRD
PLNLALRRYAGMTLTNLTFGDVANKATLCARRGCMEAIVAQLASDSEELHQVVSSILRNL
SWRADINSKKVLREAGSVAALVQCVLRATKVGTWWGAGVEGGEQPRFFSHWVGRQWRQGV
SFPTWEEIKLNNSLLSRWSSLRWAASPSSCHCSLRVSVSWARFRNPNLTSRVHFPLAHPA
SPPPGGQGVDQADAPCALRQVLRDHNCLQTLLQHLTSHSLTIVSNACGTLWNLSARSARD
QELLWDLGAVGMLRNLVHSKHKMIAMGSAAALRNLLAHRPAKHQAAATAVSPGSCVPSLY
VRKQRALEAELDARHLAQALEHLEKQGLPAADAAAKKPLPPLRHLDGLAQDYASDSGCFD
DDDAPSSLAAAAATAEPASPAALSLFLGSPFLQGQALARTPPTRRGGTEAEKEASGEAAV
AAKAKAKLALAVARIDQLVEDISALHTSSDDSFSLSSGDPGQEAPREGRAQSCSPCRGPE
GGRRDAGNRGHRLNSGSASDGYCPREHMQPCPLAALASRREDPRCGQPRPSRLDLDLPGS
QAEAPAREAASGDARVCTIKLSPTYQHVPLLEGATRAGVEPLTGTGTLPGARKQAWLPAD
PLSKVPEKLAAAPLSAASKALQKLVAQEAPLSLSRCSSLSSLSSAGRPGPSEGGDLDDTD
SSLEGLEEAGPGEAELDGAWTAPGAASLPVAIPAPRRNRGRGLGVEDATPSSSSENYVQE
TPLVLSRCSSVSSLGSFESPSIASSIPSDPCSGQGSGTISPSELPDSPGQTMPPSRSKTP
PLAPAPAGPPEASQFSLQWESYVKRFLDIADCRERCRLPSELDAGSVRFTVEKPDENFSC
ASSLSALALHEHYVQQDVELRLLPSACAVPARLRKVASALVPGRRALPVPVYMLVPAPAP
ARAQEDDSCTDSAEGTPVNFSSAASLSDETLQGPPRDQPGGPAGRQRPTSRPTSAKQPGG
HRHKAGGAGRSAEQAQGAGKNRAGLELPLGRPPSTPAYKDGAKPSRTRGDGALQSLCLTT
PTEEAVYCFYGNDSDEEPPAAAPTPIHRRASAIPRALTRERPHGREAHAPSKAAPSALPP
ARAQPSLIADETPPCYSLSSSASSLSEPEPPETSAPPAGRPRDREPSITKDPGPGGSLDS
SPSPRAAEELLQHCISSALPRRRPPVSALRRRKPRTTRLDERPAEGSRERGEEAAGSDRA
SDLDSVEWRAIQEGANSIVTWLHQAAAATREASSESDSILSFVSGLSVGSTLQPSKHRKG
RQAGGETGNTRRPEKRGTASAKTSGSSRSSAGPEKPRGTQKTTPGVPAVLRGRTVIYVPS
PAPRAQPKGTPGPRTTPRKVAPPCLAQPAAPAKVPSPGQQRSRSLHRPGKTSELATLSQP
PRSATPPARLTSDAVVQTEEVAAPKTNSSTSPSLESREHPGVPAGAPLPLLGSDVDGPSL
AKAPISAPLVHEGLGVAVGGFPASRHGSPSRSARVPPFNYVPSPMVAAATTDSAAEKAPA
TASASLLE
NT seq 5967 nt   +upstreamnt  +downstreamnt
atggggctcctggggctcctgggcctgctgcactcggccttcttcggggaccagacgctg
caggagctgaagatgacgagctccgtggcatcctacgagcagctggtgcggcaggtggag
gccttgaaggctgagaacagccacctgaggcaggagctccgggacaactccagccacctg
tccaagctgcagacagagacgtcgggcatgaaggaagtcttgaagcatctacaggggaag
ctggagcaggaggcccgagtgttggtgtcctcgggacagacggaggtgctagagcagctg
aaggccctgcagatggacatcaccagcctgtacaacctcaaattccagccgcccgccctg
ggcccggagcctgccgcccggaccccggagggcagcccggtacatggccccgggccctct
aaagacagctttggggagctgagccgggccaccatccggctgctggaggaactggaccgg
gaacggtgtttcctgctgaatgagattgagaaggaggagaaggagaagctctggtactac
tctcagctgcagggcctgtccaagcgcctggacgagctgccgcacgtggagacacagttc
tcgatgcagatggacctgatacggcagcagctggagttcgaggcccagcacatccgctcg
ctgatggaggagcgctttggcacctcggacgagatggtgcagcgggcgcagatccgtgcc
tcacgcctggagcagattgacaaggagctgctggaggcacaggaccgagtgcagcagacg
gagccccaggccttgctggcggtgaagtcggtgccggtggacgaggatcccgagacagag
gtccccacacacccagaagatggcacccctcagccaggcaacagcaaggtggaggtggtc
ttctggctcttgtccatgttggcgacacgcgaccaggaggatacggcgcgcacgctgctg
gccatgtccagctcgccggagagctgcgtggccatgcgccgctcgggctgcctgccgctg
ctgctgcaaatcctccacggcaccggcaccgaggccgggggtcgcgccggggctccaggg
gcaccgggcgccaaggacgcacgcatgcgtgccaatgcggcgctgcacaacatcgtcttc
tcgcagccggaccaggggctggcgcgcaaggagatgcgcgtcctgcacgtgctggagcag
atccgcgcctactgcgagacgtgctgggactggctgcaggcccgggagggcgggcccgag
ggagggggagttcccacagccccggtccccatcgagccacagatctgccaggccacctgt
gcagtgatgaagctgtcctttgacgaggaataccgccgtgccatgaacgagttaggcggg
ctgcaggccgtggcggagctgctgcaggtcgactacgagatgcacaagatgacccgggac
ccgctgaacctggcgctgcgccgctacgcgggcatgacccttaccaacctcactttcggg
gacgtggccaacaaggccacactgtgtgcgcgccggggctgcatggaggccatcgtggcc
cagctggcctcagacagtgaggagctccaccaggtggtgtccagcatcctccgcaacctc
tcctggcgggccgacatcaacagcaagaaggtgctgagggaggcgggcagtgtggccgcc
ctggtgcagtgtgtcctacgggccaccaaggtgggcacctggtggggagcaggggttgag
ggaggggaacagccacggttcttcagtcactgggtagggagacaatggcggcaaggggtg
agctttccgacttgggaggaaatcaagttgaacaactcactgcttagcaggtggtccagc
ctcaggtgggcggcgtctccctcttcgtgtcactgctctctgagagtgagtgtaagctgg
gcgcgtttccgaaatccaaacctaacaagccgtgttcactttcccctggcacatccggcc
tccccacccccgggtgggcagggggtggaccaggctgacgcgccctgtgccctcaggcag
gtgctccgggatcacaactgtctgcagacgctgctgcagcacctgacctcgcacagcctg
accatcgtgagcaacgcgtgcggcacactctggaacctgtcggcccgcagcgcgcgtgac
caggagctgctgtgggacctgggcgccgttggcatgctgcggaacctggtgcactccaaa
cacaagatgatcgccatgggcagcgccgccgccctgcgcaacctgctggcccaccggccc
gccaagcaccaggcggccgccaccgccgtgtccccgggcagctgcgtgcccagcctgtac
gtgcgcaagcagcgggcgctggaggccgaactggacgcgcggcacctggcgcaggcgctg
gagcacctggagaagcagggcctgcccgccgccgacgctgctgctaagaagccgctgcca
cccctgcggcacctggacggcctggctcaagactatgcttccgactcgggctgctttgac
gacgacgacgcaccgtcatccctagctgcggctgcggccactgcggagccggccagcccc
gcagcactgtcactctttctgggcagccccttcctgcaggggcaggcgctggcccgcacc
ccgcccacgcgccgcggtggcacggaggcggagaaggaggccagtggggaggcggccgtg
gcggccaaggccaaggccaagctggcgcttgcagtggcgcgcatcgaccagctggtagag
gacatctccgccctgcacacctcctccgacgacagcttcagcctcagctctggggacccc
gggcaggaggcgccacgggagggccgtgcccagtcttgctcgccatgccgcggtcccgag
ggcgggcggcgggacgcaggcaaccggggacaccgcctcaacagcggcagtgccagcgac
gggtactgcccacgcgagcacatgcagccctgcccgctggccgcactggcttcgcgccgc
gaggaccccaggtgtgggcagcctcggcccagccggcttgaccttgacctgccaggcagc
caggccgaggcccccgccagggaggccgcatccggcgacgcccgtgtgtgtaccatcaag
ctgtcgcccacctaccagcacgtgccgctgctcgagggtgccacaagggcgggtgtagag
cccctcacggggactgggacccttccaggggcccgaaagcaggcctggctgccggcagac
cctctgagcaaggttcccgagaagctggcggcggccccactgtccgctgccagtaaggca
ctgcagaagctggtagcgcaggaggcgccgctctcgctgtcccgatgcagctccctgtcc
tcgctgtcctcggcaggtcgcccaggccccagtgaggggggtgacctggacgacactgac
tcctccctggagggactggaggaggcaggccccggcgaggctgagctggacggtgcgtgg
acagcgcccggggctgcctccctgcccgtagccatcccggctccccggcgtaaccgaggc
cggggcctgggggtggaggacgccacgccgtccagctcgtcggagaactacgtgcaggag
acgccactggtgctcagccgctgcagctctgtgagctcgctgggcagcttcgagagcccg
tccatagccagctccatccccagtgacccctgcagcgggcagggcagcggcaccatcagc
cctagcgagctgcccgacagccctgggcagaccatgccgcccagccggagcaagacgccg
ccgctggcgcccgcgccggcaggtccccccgaggccagccagttcagcctgcagtgggag
agctacgtgaagcgcttcctggacattgcggactgccgcgagcgctgccggctcccgtcg
gagctggacgccggcagcgtgcgcttcaccgtggagaagccggatgagaacttctcgtgc
gcctccagcctcagcgcactggccctgcacgagcactatgtccagcaggacgtggagctg
cggctgctgccctcggcctgcgccgtgcctgcccggctgcgcaaggtggcctccgcgctg
gtgccaggtcgccgcgcgctgcccgtgcccgtctacatgttggtgcccgctccggccccg
gcccgggcccaggaggacgactcctgcaccgactccgcggagggcacgccggtcaacttc
tccagcgccgcctccctcagcgacgaaacgctgcagggaccccccagagaccagcctggg
ggaccagcgggcaggcagagacccaccagccgccccacctcggccaagcagcccgggggg
caccggcacaaggcgggaggcgccggccgcagcgcggagcaggcccagggtgcgggcaag
aaccgagcagggctggagctgcccctgggccggcccccgagcacccccgcatacaaggac
ggcgcaaagcccagccggacccgcggggatggggcactccagtcgctgtgcctcacgacg
cccacggaggaggccgtgtactgcttttacggcaatgactccgacgaggagcccccggcg
gccgcaccaacgcccattcaccggcgcgcatcggccattccccgcgcgctcacgcgggag
cgcccgcacggtcgggaggcccacgccccgtccaaggcagcaccgtctgccctgccgccc
gcccgggcccagcccagcctcatcgcagacgagaccccgccctgctactccctgagctcc
tcggccagctccctcagtgagcccgagcccccggagacatcggcgccgccggccggccgg
ccacgagacagggagccttcgatcaccaaggacccgggcccgggaggcagccttgacagc
tcgcctagccctcgggccgcggaggagctgctgcagcactgcatcagctcggccctgccc
agacgccggcctcccgtgtccgccctgcggcgccgcaagccccgcaccacgcggctggat
gagcggcccgcagagggatcccgggagcgcggcgaggaggcagcgggctcggaccgggcc
tcagacctggacagcgtggagtggcgagccatccaggagggcgccaactccatcgtcacg
tggctgcaccaggcggcggctgccacgcgcgaggcctcgtctgagtctgactctatcctg
tccttcgtatctgggctgtcggtgggctccaccctgcagccctccaagcacaggaaggga
cgacaggcggggggcgaaaccggcaatacccggcggccagagaaacggggcacagcctca
gccaagactagcgggagctcccgttcctctgcgggccccgagaagccacgtggcacacag
aagaccacgcccggggtgcccgctgtgctccggggacgaacagtcatctatgtgcccagc
ccagcaccccgggcccagcccaaagggacccccggcccccgcaccacaccgcggaaggtg
gcgccgccttgcctggcacagcccgcggctccagccaaagtccccagccccggacagcag
cggtcaaggagcctacaccggcccggcaagacctcggagctggcgacgctgagccaaccg
cccaggagcgccacaccacccgcccgcctcaccagcgacgccgtcgtccagacagaggag
gtcgccgcccccaagaccaactccagcacgtccccgagcctggagagcagggagcacccc
ggagtccccgccggcgccccgctccccctcctcggcagcgacgtggacgggccaagcctc
gccaaggctcccatctccgcacccctcgtgcacgagggcctgggggttgccgtagggggc
ttccccgccagtcggcacggctcccccagccgctcggcccgagtaccccccttcaattat
gtgcccagccccatggtggctgcagccaccaccgactcagccgcggagaaagccccggcc
accgcctcggccagcctcctggaatag

KEGG   Saimiri boliviensis boliviensis (Bolivian squirrel monkey): 101049516
Entry
101049516         CDS       T04350                                 

Definition
(RefSeq) adenomatous polyposis coli protein isoform X1
  KO
K02085  adenomatosis polyposis coli protein
Organism
sbq  Saimiri boliviensis boliviensis (Bolivian squirrel monkey)
Pathway
sbq04310  Wnt signaling pathway
sbq04390  Hippo signaling pathway
sbq04550  Signaling pathways regulating pluripotency of stem cells
sbq04810  Regulation of actin cytoskeleton
sbq04934  Cushing syndrome
sbq05010  Alzheimer disease
sbq05022  Pathways of neurodegeneration - multiple diseases
sbq05165  Human papillomavirus infection
sbq05200  Pathways in cancer
sbq05206  MicroRNAs in cancer
sbq05210  Colorectal cancer
sbq05213  Endometrial cancer
sbq05217  Basal cell carcinoma
sbq05224  Breast cancer
sbq05225  Hepatocellular carcinoma
sbq05226  Gastric cancer
Brite
KEGG Orthology (KO) [BR:sbq00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04310 Wnt signaling pathway
    101049516
   04390 Hippo signaling pathway
    101049516
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04550 Signaling pathways regulating pluripotency of stem cells
    101049516
  09142 Cell motility
   04810 Regulation of actin cytoskeleton
    101049516
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    101049516
   05206 MicroRNAs in cancer
    101049516
  09162 Cancer: specific types
   05210 Colorectal cancer
    101049516
   05225 Hepatocellular carcinoma
    101049516
   05226 Gastric cancer
    101049516
   05217 Basal cell carcinoma
    101049516
   05213 Endometrial cancer
    101049516
   05224 Breast cancer
    101049516
  09164 Neurodegenerative disease
   05010 Alzheimer disease
    101049516
   05022 Pathways of neurodegeneration - multiple diseases
    101049516
  09167 Endocrine and metabolic disease
   04934 Cushing syndrome
    101049516
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    101049516
 09180 Brite Hierarchies
  09181 Protein families: metabolism
   01009 Protein phosphatases and associated proteins [BR:sbq01009]
    101049516
  09182 Protein families: genetic information processing
   03036 Chromosome and associated proteins [BR:sbq03036]
    101049516
  09183 Protein families: signaling and cellular processes
   04812 Cytoskeleton proteins [BR:sbq04812]
    101049516
Protein phosphatases and associated proteins [BR:sbq01009]
 Protein serine/threonine phosphatases
  Phosphoprotein phosphatases (PPPs)
   Protein phosphatase-1
    PP1-interacting proteins (PIPs)
     101049516
Chromosome and associated proteins [BR:sbq03036]
 Eukaryotic type
  Centrosome formation and ciliogenesis proteins
   Other centriole associated proteins
    101049516
Cytoskeleton proteins [BR:sbq04812]
 Eukaryotic cytoskeleton proteins
  Microtubules
   Tubulin-binding proteins
    EB / APC
     101049516
SSDB
Motif
Pfam: Arm_APC_u3 APC_basic EB1_binding APC_r APC_u5 APC_u14 APC_u15 APC_rep APC_u9 SAMP APC_N_CC Arm APC_u13 APC_15aa Suppressor_APC JIP_LZII
Other DBs
NCBI-GeneID: 101049516
NCBI-ProteinID: XP_003920746
UniProt: A0A2K6V503
LinkDB
Position
Unknown
AA seq 2843 aa
MAAASYDQLLKQVEALKMENSNLRQELEDNSNHLTKLETEASNMKEVLKQLQGSIEDETM
TSSGQIDLLERLKELNLDSSNFPGVKLRSKMSLRSYGSREGSVSSRSGECSPVPMGSFPR
RGFVNGSRESTGYLEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSLPLTENFSL
QTDMTRRQLEYEARQIRVAMEEQLGTCQDMEKRAQRRIARIQQIEKDILRIRQLLQSQAT
EAERSSQNKHETGSHEAERQNEGQGVAEISMATSGNGQGSTTRMDHETASVLSSSSTHSA
PRRLTSHLGTKVEMVYSLLSMLGTHDKDDMSRTLLAMSSSQDSCISMRQSGCLPLLIQLL
HGNDKDSVLLGNSRGSKEARARASAALHNIIHSQPDDKRGRREIRVLHLLEQIRAYCETC
WEWQEAHEQGMDQDKNPMPAPVEHQICPAVCVLMKLSFDEEHRHAMNELGGLQAIAELLQ
VDCEMYGLTNDHYSITLRRYAGMALTNLTFGDVANKATLCSMKGCMRALVAQLKSESEDL
QQVIASVLRNLSWRADVNSKKTLREVGSVKALMECALEVKKESTLKSVLSALWNLSAHCT
ENKADICAVDGALAFLVGTLTYRSQTNTLAIIESGGGILRNVSSLIATNEDHRQILRENN
CLQTLLQHLKSHSLTIVSNACGTLWNLSARNPKDQEALWDMGAVSMLKNLIHSKHKMIAM
GSAAALRNLMANRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLS
PKASHRSKQRHKQSLYGDYVFDTNRHDDNRSENFNTGNMTVLSPYLNTTVLPSSSSSRGS
LDSSRSEKDRSLERERGVGLGNYHPATENPGTSSKRGLQISTTAAQIAKVMEEVSAIHTS
QEDRSSGSTTELHCVTDERNALRRNSTAHTHSNTYNFTKSENSNRTCSMPYAKLEYKRSS
NDSLNSVSSSDGYGKRGQMKPSIESYSEDDESKFCSYGQYPADLAHKIHSANHMDDNDGE
LDTPINYSLKYSDEQLNSGRQSPSQNERWARPKHIIEDEMKQSEQRQSRSQSTTYPVYTE
TTDDKHLKFQPHFGQQECVSPYRSRGANGSETNRVGSNHGINQNVSQSLCQEDDYEDDKP
TNYSERYSEEEQHEEEERPTNYSIKYNEEKHLVDQPIDYSLKYSTDIPSSQKQSFSFSKS
SSGQSTKTEHISSSSENTSTPSSTAKRQNQLHPSSAQNRSGQTQKAATCKVSSINQETIQ
TYCVEDTPICFSRCSSLSSLSSAEDEIGCDQTTQEADSANTLQIAEIKENNGTRSTEDPV
SEVPAVSQHTRTKSSRLQGSSLSSESTRHKAVEFSSGAKSPSKSGAQTPKSPPEHYVQET
PLMFSRCTSVSSLDSFESRSIASSVQSEPCSGMVSGIISPSDLPDSPGQTMPPSRSKTPP
PPPQTAQTKREVPKSKAPSAEKRESGPKQAAVNAAVQRVQVLPDADTLLHFATESTPDGF
SCSSSLSALSLDEPFIQKDVELRIMPPVQENDNGNETESEQPKESNENQEKEVEKTVDSE
KDLLDDSDDDDIEILEECIISAMPTKSSHKAKKPAQTASKLPPPVARKPSQLPVYKLLPS
QNRLQPQKHVSFTPGDDMPRVYCVEGTPINFSTATSLSDLTIESPPNELAAGEGVRAGAQ
SGEFEKRDTIPTEGRSTEEAQGGKNSSVTIPELDDNKAEEGDILAECINSAMPKGKSHKP
FRVKKIMDQVQQASASSSATNKNQLDGKKKKPTSPVKPIPQNTEYRTRVRKNVDSKNNLN
AERAFSDNKDSKKQNLKNNSKDFNDKLPNNEDRVRGSFAFDSPHHYTPIEGTPYCFSRND
SLSSLDFDDDDVDLSREKAELRKGKENKESEAKVTSHTELTSNQQSANKTQAITKQPINR
GQPKPVLQKQSTFPQSSKDIPDRGAATDEKLQNFAIENTPVCFSHNSSLSSLSDIDQENN
NNKENEPIKGTEPPDSQGEPSKPQASGYAPKSFHVEDTPVCFSRNSSLSSLSIDSEDDLL
QECISSAMPKKKKPSRLKGDNEKHSPRNMGGILAEDLTLDLKDIQRPDSEHGLSPDSENF
DWKAIQEGANSIVSSLHQAAAAACLSRQASSDSDSILSLKSGISLGSPFHLTPDQEEKPF
TSNKGPRILKPGEKSTLETKKIESESKGIKGGKKVYRSLITGKVRSNSEISGQMKQPLQA
NMPSISRGRTMIHIPGVRNSSSSTSPVSKKGPPLKTQASKSPSEGQTATTSPRGAKPSVK
SELSPVARQTSQIGGSSKAPSRSGSRDSTPSRPAQQPLSRPIQSPGRNSISPGRNGISPP
NKLSQLPRTSSPSTASTKSSGSGKMSYTSPGRQMSQQNLTKQTGLSKNASSIPRSESASK
GLNQMNNGNGANKKVELSRMSSTKSSGSESDRSERPVLVRQSTFIKEAPSPTLRRKLEES
ASFESLSPSSRPASPTRSQAQTPVLSPSLPDMSLSTHSSVQPGGWRKLPPNLSPTIEYND
GRPAKRHDIARSHSESPSRLPINRSGTWKREHSKHSSSLPRVSTWRRTGSSSSILSASSE
SSEKAKSEDEKHVNSISGTKQNKENQVSAKGTWRKIKENEISPTNSTSQTVSSGATNGAE
SKTLIYQMAPAVSKTEDVWVRIEDCPINNPRSGRSPTGNTPPVIDSVSEKGNPNIKDAKD
NQAKQNVGNGSVPMRIVGLENRLNSFIQVDAPDQKGTETKPGQNNPVPASETNESSIVER
TPFSSSSSSKHSSPSGTVAARVTPFNYNPSPRKSSADSTSARPSQIPTPVNNTKKRDSKT
DSTDSSGTQSPKRHSGSYLVTSV
NT seq 8532 nt   +upstreamnt  +downstreamnt
atggctgcagcttcatatgatcagttgttaaagcaagttgaggcactgaagatggagaac
tcaaatcttcgacaagagctagaagataattccaatcatcttacaaaactggaaactgag
gcatctaatatgaaggaagtacttaaacaactacaaggaagtattgaagatgaaactatg
acttcttctggacagattgatttattagagcgtcttaaagagcttaacttagatagcagt
aatttccctggagtaaaactacggtcaaaaatgtcccttcgttcttatggaagccgggaa
ggatctgtatcaagccgttcaggagagtgcagtcccgttcctatgggttcatttccaaga
agagggtttgtaaacggaagcagagaaagtactggatatttagaagaacttgagaaagag
aggtcattgcttcttgctgatcttgataaagaagaaaaggaaaaagactggtattatgct
caacttcagaatctcactaaaagaatagatagtcttcctttaactgaaaatttttcctta
caaacagatatgaccagaaggcaattagaatatgaagcgaggcaaatcagagttgcaatg
gaagagcaactaggcacttgccaagatatggaaaaacgagcacagcgaagaatagccaga
attcagcaaatcgagaaagacatacttcgtataagacagcttttacagtcccaagcaaca
gaagcagagaggtcatctcagaacaagcatgaaactggctcacatgaagctgagcggcag
aatgaaggtcaaggagtggcagaaatcagcatggcaacttctggtaatggtcagggttca
actacacgaatggaccacgaaacagccagtgttttgagttctagtagcacgcactctgca
cctcgaaggctgacaagtcatctgggaaccaaggtggaaatggtgtattcattgttgtca
atgcttggtactcatgataaggatgatatgtcgcgaactttgctagctatgtctagctcc
caagacagctgtatatccatgcgacagtctggatgtcttcctctcctcatccagctttta
catggcaatgacaaagactctgtattgttgggaaattcccggggcagtaaagaggctcgg
gccagggccagtgcagctctccacaacatcattcactcacagcctgatgacaagagaggc
aggcgtgaaatccgagtccttcatcttttggaacagatacgcgcttactgtgaaacctgt
tgggagtggcaggaagcccatgaacaaggcatggaccaggacaaaaatccaatgccagct
cctgttgaacatcagatctgtcctgctgtgtgtgttctaatgaaactttcatttgacgaa
gagcatagacatgcaatgaatgaactagggggactacaggccattgcagagttattgcaa
gtggactgtgaaatgtatgggcttactaatgaccactacagtattacactaagacgatat
gctggaatggctttgacaaacttgactttcggagatgtagccaacaaggctacgctatgc
tctatgaaaggctgcatgagagcacttgtggcccaactaaaatctgaaagtgaagactta
cagcaggttattgcaagtgttttgaggaatttgtcttggcgagcagatgtaaatagtaaa
aagacattgcgggaagttggaagtgtgaaagcattgatggaatgtgctttggaagttaaa
aaggaatcaaccctcaaaagcgtactgagtgccttatggaatttgtcagcacattgcact
gagaataaagctgacatatgtgccgtagatggtgcacttgcatttttggttggcactctt
acttaccggagccagacaaacactttagccattattgaaagtggaggtgggatattacgg
aatgtgtccagcttgatagctacaaatgaggaccacaggcaaatcctaagagagaacaac
tgtctacagactttattacaacacttaaaatctcatagtttgacaatagtcagtaatgca
tgtggaactttgtggaatctctcagcaagaaatcctaaagaccaggaagcattatgggac
atgggggcagttagcatgctcaagaacctcattcattcaaagcacaaaatgattgctatg
ggaagtgctgcagctttaaggaatctcatggcaaataggcctgcaaagtataaagatgcc
aatattatgtctcctggttcaagcttgccatctcttcatgtcaggaaacagaaagcccta
gaagcagaattagatgctcagcatttatcagaaacttttgacaatatagacaatttaagt
cccaaggcatctcatcgcagtaagcagagacacaagcaaagtctctatggtgattatgtt
tttgacaccaatcgacatgatgataataggtcagaaaattttaatactggcaacatgact
gtcctttcaccatatttgaatactacagtgttacccagttcctcttcatcaagaggaagc
ttagatagttctcgttctgaaaaagatagaagtttggagagagaacgaggagttggccta
ggcaactaccatccagcaacagaaaatccaggaacctcttcaaagcgaggtttgcagatc
tccaccactgcagcccagattgccaaagtcatggaagaggtgtcagccattcatacctct
caagaagacagaagttctgggtctaccactgaattacattgtgtgacagatgagagaaat
gcacttagaagaaattctactgcccatacacattcaaacacttacaatttcaccaagtca
gaaaattcaaacaggacatgttctatgccttatgccaaattagaatacaagagatcttcg
aatgatagtttaaatagtgtcagtagtagtgatggttatggtaaaagaggtcaaatgaaa
ccctcaattgaatcctattctgaagatgatgaaagtaagttttgcagttatggtcaatac
ccagctgacctagcccataaaatacatagtgcaaatcatatggatgataatgatggagaa
ctagatacaccaataaattacagtctgaaatattcagatgagcagttgaactctggaagg
caaagtccttcacagaatgaaagatgggcaagacccaaacacataatagaagatgaaatg
aaacaaagtgagcaaagacaatcaaggagtcaaagtacaacttatcctgtgtatactgag
accactgatgataaacacctcaagttccaaccacattttggacagcaggaatgtgtttcc
ccatacaggtcacggggagccaatggttcagaaacaaatcgagtaggttctaatcatggg
attaatcaaaatgtaagccagtctttgtgtcaagaagatgactatgaagatgataagcca
accaactatagtgaacgttactctgaggaagaacagcatgaagaagaagagagaccaaca
aattatagcataaaatataatgaagagaaacatcttgtggatcagcctattgattatagt
ttaaaatattctacagatattccttcatcacagaaacagtcattttcattctcaaagagt
tcatctggccagagcactaaaactgaacacatctcttcaagcagtgagaatacatccaca
ccttcatctactgccaagaggcagaatcagcttcatccaagttctgcacagaacagaagt
ggtcagactcaaaaggctgccacttgcaaagtttcttctattaaccaagaaacaatacag
acttactgtgtagaagataccccaatatgtttttcaagatgtagttcattatcatctttg
tcatcagctgaagatgaaataggatgtgatcagacgacacaggaagcagattctgctaat
actctgcaaatagcagaaataaaagaaaataatggaactaggtcaactgaagatcctgtg
agcgaagttccagcagtgtcgcagcacactagaaccaaatccagcagactgcagggttct
agtttatcttcagaatcgaccagacacaaagctgttgaattttcttcaggagcaaaatcc
ccctccaaaagtggtgctcagacacccaaaagtcctcctgaacactatgtgcaggagact
ccactcatgtttagcagatgtacttctgtcagttcacttgatagtttcgagagtcgttca
attgctagttctgttcagagtgaaccatgcagtggaatggtaagtggcattataagcccc
agtgatctcccagatagccctggacaaaccatgccaccaagcagaagtaaaacccctcca
cctcctccccaaacagctcagaccaagcgagaagtacctaaaagtaaagcacctagtgct
gaaaagagagagagtggacctaagcaagctgcagtaaacgctgcagttcagagggtccag
gttcttccagatgctgatactttattacattttgccacggaaagtactccagatggattt
tcttgttcatctagcctgagtgctctgagcctcgatgagccatttatacagaaagatgtg
gaattaagaataatgcctcccgttcaggaaaatgacaatgggaatgaaacagaatcagag
cagcctaaagaatcaaatgaaaaccaggagaaagaggtagaaaaaactgttgattctgaa
aaggacctattagatgattcagatgatgatgatattgaaatactagaagaatgtattatt
tctgctatgccgacaaagtcatcacataaagcaaaaaagccagcccagactgcttcaaaa
ttacctccacctgtggcaaggaaaccaagtcagctgcctgtgtacaaacttctaccatca
caaaacaggttgcaaccccaaaagcatgttagttttacaccgggagatgatatgccacgt
gtatattgtgtagaagggacacctataaacttttccacagctacatctttaagtgatcta
acaatagaatcccctccaaatgagttagctgctggagaaggagttagagcaggggcacag
tcaggtgaatttgaaaaacgagataccattcctacagaaggcagaagtacagaggaggct
caaggaggaaaaaactcatctgtaaccatacctgaattggatgacaataaagcagaagaa
ggtgatattcttgcagaatgcattaattccgctatgcccaaagggaaaagtcacaagcct
ttccgtgtgaagaagataatggaccaggtccagcaagcatctgcgtcttcatctgcaacc
aacaaaaatcagttagatggtaagaaaaagaaacctacttcaccagtaaaacctatacca
caaaatactgaatataggacacgtgtaagaaaaaatgtagactcaaaaaataatttaaat
gctgagagagctttctcagacaacaaagattcaaagaaacagaacttgaaaaataattcc
aaggacttcaatgataagctccctaataatgaagatagagtcagaggaagttttgctttt
gattcacctcatcattatacacctattgaaggaactccttactgtttttcacgaaatgat
tctttgagttccctagattttgatgatgatgatgttgacctttccagagaaaaggctgaa
ttaagaaaggggaaagaaaataaggaatcagaagctaaagttaccagccacacagaacta
acctccaaccaacaatcagctaataagacacaagctattacaaagcagccaataaatcga
ggtcagccaaaacccgtactgcagaaacaatccacttttccccagtcatccaaagatata
ccagacagaggggcagcaactgatgaaaaattacagaattttgctattgaaaatactcca
gtttgcttttctcataattcctctctgagttctctcagtgacattgaccaagaaaacaac
aacaataaagaaaatgaacctatcaaagggactgagccccctgactcacagggagaacca
agtaaacctcaagcatcaggctatgctcctaaatcgtttcatgttgaagatacccccgtt
tgtttctcaagaaacagttctctcagttctcttagtattgattctgaagatgacctgctg
caggaatgtataagttccgcgatgccaaaaaagaaaaagccttcaagactcaagggtgat
aatgaaaaacatagccccagaaatatgggtggcatattagctgaagatctgacacttgat
ttgaaagatatacagagaccagattcagaacatggtttatcccctgattcagaaaatttt
gattggaaagctattcaggaaggtgcaaattccatagtgagtagtttacatcaagctgct
gctgctgcctgtttatctagacaagcttcatctgattccgattccatcctttccttgaaa
tcaggaatctctctggggtcaccatttcatcttacacctgatcaagaagaaaaacccttt
acaagtaataaaggcccacgaattctaaaaccaggggagaaaagtaccttggaaactaaa
aagatagaatctgaaagtaaaggaatcaaaggaggaaaaaaagtttatagaagtttgatt
actggaaaggttcgatctaattcagaaatttcaggccaaatgaaacagcctcttcaagca
aacatgccttcaatctctcgaggcaggacaatgattcatattccaggagttcgaaatagc
tcctcaagtacaagtcctgtttctaaaaaaggccccccacttaagacccaagcctccaaa
agccctagtgaaggtcagacagccaccacttctcctagaggagccaagccatctgtgaaa
tcagaattaagccctgttgccaggcagacatcccaaataggtgggtcaagtaaagcgcct
tctagatcaggatctagagattcgaccccgtcaagaccagcccagcaaccattaagtaga
cctatacagtctcctgggcgaaactctatttcccctggtagaaatggaataagtcctcct
aacaaattatctcagcttccaaggacatcttcccccagtactgcttcaactaagtcctca
ggttctggaaaaatgtcgtatacatctccaggcagacagatgagccaacagaaccttacc
aaacaaacaggtttatccaagaatgccagtagtattccaagaagtgagtctgcctccaaa
ggactaaatcagatgaataatggcaatggagccaataaaaaggtagaactttctagaatg
tcttcaactaaatcaagtggaagtgaatctgatagatcagaaagacctgtattagtacgc
cagtcaactttcatcaaagaagctccaagcccaaccttacgaagaaaattggaggaatct
gcttcatttgaatctctttctccatcgtctagaccagcttctcccactaggtcccaggca
caaactccagttttaagtccttcccttcctgatatgtctctatccacacattcatctgtt
cagcctggtggatggcgaaaactcccacctaatctcagccccactatagagtataatgat
ggaagaccagcaaagcggcatgatattgcacggtctcattctgaaagtccttctagactg
ccaatcaataggtcaggaacctggaaacgtgagcacagcaaacattcatcatcccttccc
cgagtaagcacttggagaagaactggaagttcatcttcaattctttctgcttcatcagaa
tccagtgaaaaagcaaaaagtgaggatgaaaaacatgtgaactctatttcaggaaccaaa
caaaataaagaaaaccaagtatccgcaaagggaacatggagaaaaataaaagaaaatgaa
atttctcccacaaatagtacttctcagactgtttcctcaggtgctaccaatggtgctgaa
tcaaagaccctaatttatcaaatggcacctgctgtttctaaaacagaggatgtttgggtg
agaattgaggactgtcccattaacaaccctagatctggtagatctcccacaggtaatact
cccccagtgattgacagtgtttcagaaaagggaaatccaaacattaaagatgcaaaagat
aatcaggcaaaacaaaatgtgggtaatggcagtgttcccatgcgtatcgtgggtttggaa
aatcgcctgaactccttcattcaggtagatgcccctgaccaaaaaggaactgagacaaaa
ccaggacaaaataatcctgtccctgcatcagagactaatgaaagttctatagtggaacgt
accccatttagttctagcagctcaagcaaacacagttcacctagtgggactgttgctgcc
agagtgactccttttaattacaacccaagccccaggaaaagcagtgcagatagcacttca
gcccggccatctcagatcccaactccagtgaataacacaaagaagcgagattcaaaaact
gacagcacagattccagtggaacccaaagtcctaagcgccattctgggtcttaccttgtg
acatctgtttaa

DBGET integrated database retrieval system