KEGG   Mus pahari (shrew mouse): 110326173
Entry
110326173         CDS       T05929                                 

Gene name
Apc2
Definition
(RefSeq) adenomatous polyposis coli protein 2 isoform X1
  KO
K02085  adenomatosis polyposis coli protein
Organism
mpah  Mus pahari (shrew mouse)
Pathway
mpah04310  Wnt signaling pathway
mpah04390  Hippo signaling pathway
mpah04550  Signaling pathways regulating pluripotency of stem cells
mpah04810  Regulation of actin cytoskeleton
mpah04934  Cushing syndrome
mpah05010  Alzheimer disease
mpah05022  Pathways of neurodegeneration - multiple diseases
mpah05165  Human papillomavirus infection
mpah05200  Pathways in cancer
mpah05206  MicroRNAs in cancer
mpah05210  Colorectal cancer
mpah05213  Endometrial cancer
mpah05217  Basal cell carcinoma
mpah05224  Breast cancer
mpah05225  Hepatocellular carcinoma
mpah05226  Gastric cancer
Brite
KEGG Orthology (KO) [BR:mpah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04310 Wnt signaling pathway
    110326173 (Apc2)
   04390 Hippo signaling pathway
    110326173 (Apc2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04550 Signaling pathways regulating pluripotency of stem cells
    110326173 (Apc2)
  09142 Cell motility
   04810 Regulation of actin cytoskeleton
    110326173 (Apc2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    110326173 (Apc2)
   05206 MicroRNAs in cancer
    110326173 (Apc2)
  09162 Cancer: specific types
   05210 Colorectal cancer
    110326173 (Apc2)
   05225 Hepatocellular carcinoma
    110326173 (Apc2)
   05226 Gastric cancer
    110326173 (Apc2)
   05217 Basal cell carcinoma
    110326173 (Apc2)
   05213 Endometrial cancer
    110326173 (Apc2)
   05224 Breast cancer
    110326173 (Apc2)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    110326173 (Apc2)
  09164 Neurodegenerative disease
   05010 Alzheimer disease
    110326173 (Apc2)
   05022 Pathways of neurodegeneration - multiple diseases
    110326173 (Apc2)
  09167 Endocrine and metabolic disease
   04934 Cushing syndrome
    110326173 (Apc2)
 09180 Brite Hierarchies
  09181 Protein families: metabolism
   01009 Protein phosphatases and associated proteins [BR:mpah01009]
    110326173 (Apc2)
  09182 Protein families: genetic information processing
   03036 Chromosome and associated proteins [BR:mpah03036]
    110326173 (Apc2)
  09183 Protein families: signaling and cellular processes
   04812 Cytoskeleton proteins [BR:mpah04812]
    110326173 (Apc2)
Protein phosphatases and associated proteins [BR:mpah01009]
 Protein serine/threonine phosphatases
  Phosphoprotein phosphatases (PPPs)
   Protein phosphatase-1
    PP1-interacting proteins (PIPs)
     110326173 (Apc2)
Chromosome and associated proteins [BR:mpah03036]
 Eukaryotic type
  Centrosome formation and ciliogenesis proteins
   Other centriole associated proteins
    110326173 (Apc2)
Cytoskeleton proteins [BR:mpah04812]
 Eukaryotic cytoskeleton proteins
  Microtubules
   Tubulin-binding proteins
    EB / APC
     110326173 (Apc2)
SSDB
Motif
Pfam: APC_basic APC_rep APC_N_CC APC_r Arm_APC_u3 Suppressor_APC Arm SAMP bZIP_1 HALZ Flagellin_N
Other DBs
NCBI-GeneID: 110326173
NCBI-ProteinID: XP_021059996
Ensembl: MGP_PahariEiJ_G0030989
LinkDB
Position
9
AA seq 2277 aa
MTSSVASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKLEQE
ARVLVSSGQTEVLEQLKALQTDISSLYNLKFHAPALGPEPASRTPEGSPVHGSGPSKDSF
GELSRATIRLLEELDQERCFLLSEIEKEEKEKLWYYSQLQGLSKRLDELPHVDTFSMQMD
LIRQQLEFEAQHIRSLMEERFGTSDEMVQRAQIRASRLEQIDKELLEAQDRVQQTEPQAL
LAVKPVAVEEEQEAEVPTHPEDGTPQPGNSKVEVVFWLLSMLATRDQEDTARTLLAMSSS
PESCVAMRRSGCLPLLLQILHGTEAGSMGRAGTPGAPGAKDARMRANAALHNIVFSQPDQ
GLARKEMRVLHVLEQIRAYCETCWDWLQARDSGTESGTGDTPVPIEPQICQATCAVMKLS
FDEEYRRAMNELGGLQAVAELLQVDYEMHKMTRDPLNLALRRYAGMTLTNLTFGDVANKA
TLCARRGCMEAIVAQLASESEELHQVVSSILRNLSWRADINSKKVLREVGSMTALMECVL
RASKESTLKSVLSALWNLSAHSTENKAAICQVGGALAFLVSTLTYRCQGNSLAVIESGGG
ILRNVSSLIATREDYRQVLRDHNCLQTLLQHLTSHSLTIVSNACGTLWNLSARSPRDQEL
LWDLGAVGMLRNLVHSKHKMIAMGSAAALRNLLAHRPAKYQAAAMAVSPGTCVPSLYVRK
QRALEAELDTRHLVHALGHLEKQGLPEAETTSKKPLPPLRHLDGLVQDYASDSGCFDDDD
APSLAAAITTAEPASPAVMSMFLGGPFLQGQALARTPPARQGGLETEKEAGGEAAVAAKA
KAKLALAVARIDRLVEDISALHTSSDDSFSLSSGDPGQEAPREGRAQSCSPCRGTEGGRR
DAGSRAHPLLRLKAAHTSLSNDSLNSGSTSDGYCTREHMTPCPLAALAEHRDDPVRGQTR
PSRLDLDLPSRAELPARDTAATDARVRTIKLSPTYQHVPLLDGAAGAGVRPLVGPGTSPG
ARKQAWIPADSLSKVPEKLVASPLPVASKVLQKLVAQDGPMSLSRCSSLSSLSSTGHAVP
SQAENLDSDSSLEGLEEAGPGEAELGRAWRASGSTSLPVSIPAPQRGRSRGLGVEDATPS
SSSENCIQETPLVLSRCSSVSSLGSFESRSIASSIPSDPCSGLGSGTVSPSELPDSPGQT
MPPSRSKTPPAPPGQPETSQFSLQWESYVKRFLDIADCRERCQPPSELDAGSVRFTVEKP
DENFSCASSLSALALHELYVQQDVELRLRPPACPERAVGGGGHRRRDEAASRLDGPAPAG
SRARSAADKELEALRECLGAAMPPRLRKVASALVPGRRALPVPVYMLVPAPARGDDSGTD
SAEGTPVNFSSAASLSDETLQGPSRDKPARPGDRQKPTGRAAPARQTRSHRPKAAGAGKS
TEHTRGPSRNRAGLELPLSRPQSARSNRDGSCQTRTRGDGALQSLCLTTPTEEAVYCFYD
SDEEPPASAPPPRRASAIPRALKREKPTGRKETPSRVAQPATLPVRAQPRLIVDETPPCY
SLTSSASSLSEPEASEQPASHARGPEQGSEQVSSPSPRAEEELLQRCISLAMPRRRTQVP
GSRRRKPRAMRSDIRPTEITQKCQEEVAGSDPASDLDSVEWQAIQEGANSIVTWLHQAAA
KASLEASSESDSLLSLVSGVSSTLQPSKLRKGRKPAAEAGGAWRPEKRGTASTKITGSPR
LPSGSEKAKGTQKMMAGESTMLRGRTVIYSTGPASRAQSKGISGPCTTPKKPGTSGTTHP
ETATKAPSPEQQRSRSLHRPGKISELAALRHQPRSATPPARLTKTPSSSSSQTSPASQPL
PRRSPLATPTGGPLPGPGGSPVPKSPARALLAKQHKTQKSPVRIPFMQRPARRMPPPLAR
PSPEPGSRGRAGAEGTPGARGSRLGLVRMASARSSGSESSDRSGFRRQLTFIKESPGLLR
RRRSELSSADSTASTSQAASPRRGRPALPAVFLCSSRCDELRVSPRQPLAAQRSPQTKPG
LAPRAPRRTSSESPSRLPVRATPGRPETVKRYASLPHISVSRRPDSAVSVPTTQAKTTRR
GSDGETRPLPRVAAPGTTWRRIKDEDVPHILRSTLPATALPLRGSSPEDSPAGTPQRKTS
DAVVQTEDVATSKTNSSTSPSLESRDPPQAPARGPVAPQGSDVDGPVLSKPPASAPFPHE
GLSAVMAGFPTSRHGSPSRAARVPPFNYVPSPMAAATMASDSAVEKAPVSSPASLLE
NT seq 6834 nt   +upstreamnt  +downstreamnt
atgaccagctccgtggcctcgtatgagcagctggtgcgccaggtggaggccctgaaggcc
gagaacactcacttaaggcaggagctgagggataactccagccatctctccaagctggag
acagagacctctgggatgaaggaggttctgaaacaccttcagggcaagctggagcaggag
gcgagagtgctcgtgtcctccgggcagacagaagtgttagagcaactgaaagctcttcag
actgacatcagtagcctgtacaacctcaagttccatgctcccgccctgggcccggagcct
gcctcccggactccagagggaagcccagtgcacggctctggaccatccaaggacagcttc
ggagaactgagcagggccaccatccgcctgctggaagaactagaccaggagagatgcttc
ctgctgagcgagatagagaaagaggagaaggagaagctatggtattactctcagctccag
ggcctgtccaagcgcttggatgagctgccacacgtggacacgttttcgatgcagatggat
ctgattcggcagcagctggagttcgaggcccagcacatccgctctctgatggaggagcgc
ttcggtacctccgacgagatggtacagcgcgcgcagatccgggcttcgcgcttggagcag
attgacaaggagctgttggaagcccaggaccgggtgcagcagacagagcctcaggctctg
ctggcagtgaagcctgtggcagtggaggaggagcaggaggcagaagtccccacacaccct
gaggatggcacccctcagcctggcaacagcaaggtggaggtggtgttctggcttctatct
atgttggcaacgcgcgaccaggaagatactgcgcgcacgctgctggccatgtccagctcg
ccagagagctgtgtagccatgcgccgctcgggctgtctgccactgctgctccagattctt
catggcactgaggctgggtctatggggcgcgcagggacccccggagcgcccggtgccaaa
gatgcacgcatgcgcgccaacgcggccctgcacaacatcgttttctcccagccggatcag
ggcctggcacgcaaggagatgcgtgtgctgcatgtgctggagcagatccgagcctactgc
gagacctgctgggactggcttcaggcgagggacagcgggacagaaagtggtacaggagac
actcctgtccccatcgagccacagatctgccaggctacctgtgcagtgatgaagctgtca
tttgacgaagaataccgtcgggctatgaatgagctagggggcctgcaggctgtggcagaa
ctactgcaggtggattatgagatgcacaagatgacccgggacccactcaaccttgccctg
cgccgctacgctggcatgaccctcaccaacctcacctttggagacgttgccaacaaggcc
acactgtgtgcccgccgaggctgcatggaagccattgtggcccaacttgcctctgagagt
gaggagctgcatcaggttgtttccagtattctgcgtaatctgtcatggagggcagacatc
aatagcaagaaggtgctgagggaggttggcagcatgactgccttgatggaatgtgtgctg
cgggcctccaaggagtccaccctaaagagcgtgctcagtgctctgtggaacctgtcggca
cacagcacagagaacaaggcggccatctgccaggtgggcggcgccctggctttcctggta
agcaccctcacataccgttgccaagggaattctctggcggtcatcgagagtggcggtggg
atcctgcgcaacgtgtcaagcctcattgccacacgggaggactacaggcaggtgctccgt
gaccacaactgcctgcagacactgctgcagcatctcacgtcccacagtttgaccatcgtg
agcaatgcctgcggcaccctctggaacctgtctgcccgcagcccccgcgaccaggaactg
ttgtgggacctgggggctgtgggcatgctacgcaacctcgttcactccaaacacaagatg
atcgccatgggtagcgccgccgctctgcggaacctgctagcccaccgacccgccaagtat
caggctgcagccatggctgtctccccaggcacctgcgtgcccagtctgtacgtccgcaag
cagagagctctggaagctgagttggacactcggcacctggtgcatgcactcggtcactta
gagaagcagggtctgcctgaggcagagaccacttcaaagaagcccctgccacccctccgc
cacctggatgggctggtacaggactatgcctctgattctggctgctttgacgacgatgac
gcaccatccctggctgctgcgatcaccacagccgagcccgccagcccagcagtgatgtct
atgttccttggcggtcccttccttcagggccaggcactggcccgcaccccacctgcccgc
cagggtggcttagaaaccgagaaggaggctggtggggaggcagctgtggctgccaaggcc
aaggccaagctggcattggctgtggctcggatcgaccgattggtggaggacatctctgcc
ctgcacacctcatcagacgacagcttcagtctcagctcgggggaccctgggcaggaggcg
ccaagggagggccgtgctcagtcctgttctccatgccggggcaccgagggtgggcggcgt
gatgctggcagcagggcgcaccctctgctgaggctcaaggcggcccacaccagcctctct
aatgatagcctgaacagcggtagcaccagcgatggctactgtacccgggaacacatgacg
ccttgcccgctggctgcgttggcggagcaccgtgatgaccctgtgcgcggacagactcgg
cccagccgactggacctggaccttcccagccgggctgagcttcctgcccgggacacagca
gccaccgatgcccgagtgcgcacaatcaagttatccccaacctatcagcacgttccactg
ctcgatggggccgctggggcaggtgtccgacccctggttgggccgggaacctccccgggg
gctcggaaacaggcatggatacctgcggacagcctgagcaaagtccctgagaaactggtg
gcctctccactgcccgtagctagcaaggtgctgcagaagctggtggcacaggatgggccg
atgtccctctccaggtgcagctctctgtcctctctgtcttccacgggccatgccgtcccc
agccaggcggagaaccttgacagcgattcatccctggaggggcttgaggaggctggtcct
ggtgaggccgagctgggcagggcgtggcgagcatccgggtccacctctcttccagtgtcc
atcccagccccgcagcgggggcgcagtcgaggtcttggggtggaggatgcaacaccatcc
agctcatctgagaactgtatccaggagacacccttggtcttgagccgttgtagttccgtg
agctccctgggcagctttgagagccgctccattgccagctccatccccagtgacccgtgc
agtgggctgggcagtggcacagtgagtcccagcgagctgccagacagccccgggcagacg
atgccaccgagccgcagcaagacgccaccggcacctcctgggcagcctgagaccagccag
ttcagcctgcagtgggaaagctatgtgaaacgcttcctagacatcgcggactgtcgagaa
agatgccagccgccctcggagctggacgcgggcagcgttcgcttcacagtggagaagcca
gacgagaatttctcctgcgcctccagcctcagtgcactggccctgcatgagctgtatgtt
cagcaggatgtggagctgcgtctgaggccaccagcctgcccagaacgtgcggtgggtggt
gggggccatcgtcggagggacgaggctgccagccgcctggatggcccagcaccagctggt
tctagggctcggtcagcagctgataaagaactggaggctttgcgtgaatgtctgggggca
gccatgcctccccggctccgcaaggtggcctcagccttggtgcctggccgccgcgcattg
ccagtccctgtgtacatgttagtgcccgccccggctcggggtgatgactcgggcacggac
tccgcagagggcacacccgttaacttttccagtgcagcctcgctcagtgatgagacctta
cagggaccctccagggacaagccggccaggcctggggacaggcagaaacctacaggccga
gctgcccctgccaggcagacccgatctcaccggcccaaggcagcaggtgctggtaaaagc
acagaacacacccggggacccagtaggaaccgggcaggattggagctacccctcagccga
ccccagagtgctcggtccaacagggatggctcatgccagacccggacccgcggagacggt
gccctgcagtcgctatgcctcacaacacccacagaggaagctgtgtactgcttctatgac
tctgacgaggaaccaccagcctctgcaccaccacctcggcgggcatccgccatcccacgg
gctctaaagcgagagaaacccacaggcagaaaggagactccatccagggtggcccagcct
gccacactccctgtgagagcccagcccagactgatcgtggatgagaccccgccctgctat
tccctgacttcctcagctagttccctcagtgagcctgaggcctctgaacagccggccagc
catgctcgaggcccggagcagggcagtgaacaggtcagctctcctagcccaagggcagaa
gaggaacttctgcagaggtgtatcagcttggccatgcccaggcgccggacccaggtaccg
ggctcacggcgtcgcaagcccagagccatgaggtcagacataaggcccactgagataacc
cagaaatgccaggaggaggtggctggctctgatccagcctctgacctagacagcgtggag
tggcaggctatccaagagggcgcaaactccatcgtcacgtggctgcatcaggcagcggcc
aaagccagcctggaggcgtcttctgagtctgactccctcttgtctttggtgtctggggtg
tcctccaccctccagccctccaagctcaggaaagggcgaaagcctgcagcagaggctgga
ggtgcctggcgtcctgagaaacgggggacagcttccaccaagatcactgggagtccccgg
cttcctagtggctccgagaaggcaaagggtacccagaaaatgatggcaggggagtcaacc
atgctccggggacggacagtgatctactcaaccggcccagcctcccgtgctcagtccaaa
ggtatttctggaccttgtaccacacctaagaagccagggacatctggcaccactcatcca
gaaactgccaccaaagcccccagccctgagcaacaacgttcacggagcctccaccgaccc
ggcaagatctctgagctggcagccttgcgccaccagcccaggagcgccactcctccagcc
cgcctcaccaagaccccgtcctcaagctcttcacagacctccccagcatcccagcccctg
cctagacggtcccctctggccactccaacaggagggcctctgcctggccctggggggtcc
ccagtgcccaagtcaccagcgcgggcccttctggctaagcaacacaagacccagaagtca
cctgtgcggatcccattcatgcaaaggccagccaggcgaatgccacctccactggccaga
ccatccccagagcctggctccaggggccgagctggggctgaggggactcctggggcacgt
ggcagccgcctgggcctggtgcgtatggcgtcagctcgctccagtggcagtgagtcctcg
gatcgctcaggcttccgaaggcagctgactttcatcaaggaatccccagggctccttcgg
cgccgcagatcagagctgtcctctgcggactccacggcctccacctcccaggctgcttcg
ccccgccgtggacggcccgcactccctgctgtctttctctgctcctctcgttgcgatgag
ctgcgggtatccccacggcagcccctggcagcacagaggtcccctcagaccaagccaggt
ctcgcaccacgtgcgcccagacgtaccagctccgagagcccctcacgcctgcctgtacgg
gcgacccctgggcggcctgagacagtcaagcggtacgcatccctgccacatattagtgtg
tcccgcagacccgatagcgctgtctctgtgcccaccacccaggccaagaccactcgccgg
ggaagtgatggtgagaccaggccgctgcccagggtagctgctcccggtacgacctggcgt
cgaatcaaagatgaagatgtcccccacatcctgcgcagcacgctgcctgccactgccctg
cctctcaggggctcatcaccggaagacagccccgctggaactccacagcgcaagaccagt
gacgcagtggtgcaaacagaggacgtggctacttctaagaccaattccagcacgtcacct
agcctggagagcagggatcccccacaggccccagccagaggccctgtggctccccagggc
agcgatgtggatggaccagtactcagcaagcctcctgcctcagcgcccttcccccatgag
ggtctgagtgctgtcatggcgggctttcccaccagcaggcatggctcccccagcagggct
gcacgggttccccccttcaactatgtgcccagccccatggcagcggccacaatggccagt
gactcagcagtggagaaagcccctgtctcctccccagccagcctcctggagtag

KEGG   Mus pahari (shrew mouse): 110333195
Entry
110333195         CDS       T05929                                 

Gene name
Apc
Definition
(RefSeq) adenomatous polyposis coli protein isoform X1
  KO
K02085  adenomatosis polyposis coli protein
Organism
mpah  Mus pahari (shrew mouse)
Pathway
mpah04310  Wnt signaling pathway
mpah04390  Hippo signaling pathway
mpah04550  Signaling pathways regulating pluripotency of stem cells
mpah04810  Regulation of actin cytoskeleton
mpah04934  Cushing syndrome
mpah05010  Alzheimer disease
mpah05022  Pathways of neurodegeneration - multiple diseases
mpah05165  Human papillomavirus infection
mpah05200  Pathways in cancer
mpah05206  MicroRNAs in cancer
mpah05210  Colorectal cancer
mpah05213  Endometrial cancer
mpah05217  Basal cell carcinoma
mpah05224  Breast cancer
mpah05225  Hepatocellular carcinoma
mpah05226  Gastric cancer
Brite
KEGG Orthology (KO) [BR:mpah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04310 Wnt signaling pathway
    110333195 (Apc)
   04390 Hippo signaling pathway
    110333195 (Apc)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04550 Signaling pathways regulating pluripotency of stem cells
    110333195 (Apc)
  09142 Cell motility
   04810 Regulation of actin cytoskeleton
    110333195 (Apc)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    110333195 (Apc)
   05206 MicroRNAs in cancer
    110333195 (Apc)
  09162 Cancer: specific types
   05210 Colorectal cancer
    110333195 (Apc)
   05225 Hepatocellular carcinoma
    110333195 (Apc)
   05226 Gastric cancer
    110333195 (Apc)
   05217 Basal cell carcinoma
    110333195 (Apc)
   05213 Endometrial cancer
    110333195 (Apc)
   05224 Breast cancer
    110333195 (Apc)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    110333195 (Apc)
  09164 Neurodegenerative disease
   05010 Alzheimer disease
    110333195 (Apc)
   05022 Pathways of neurodegeneration - multiple diseases
    110333195 (Apc)
  09167 Endocrine and metabolic disease
   04934 Cushing syndrome
    110333195 (Apc)
 09180 Brite Hierarchies
  09181 Protein families: metabolism
   01009 Protein phosphatases and associated proteins [BR:mpah01009]
    110333195 (Apc)
  09182 Protein families: genetic information processing
   03036 Chromosome and associated proteins [BR:mpah03036]
    110333195 (Apc)
  09183 Protein families: signaling and cellular processes
   04812 Cytoskeleton proteins [BR:mpah04812]
    110333195 (Apc)
Protein phosphatases and associated proteins [BR:mpah01009]
 Protein serine/threonine phosphatases
  Phosphoprotein phosphatases (PPPs)
   Protein phosphatase-1
    PP1-interacting proteins (PIPs)
     110333195 (Apc)
Chromosome and associated proteins [BR:mpah03036]
 Eukaryotic type
  Centrosome formation and ciliogenesis proteins
   Other centriole associated proteins
    110333195 (Apc)
Cytoskeleton proteins [BR:mpah04812]
 Eukaryotic cytoskeleton proteins
  Microtubules
   Tubulin-binding proteins
    EB / APC
     110333195 (Apc)
SSDB
Motif
Pfam: Arm_APC_u3 APC_basic EB1_binding APC_r APC_u5 APC_rep APC_u14 APC_u15 APC_u9 APC_N_CC SAMP Arm APC_u13 APC_15aa Suppressor_APC JIP_LZII
Other DBs
NCBI-GeneID: 110333195
NCBI-ProteinID: XP_021070368
Ensembl: MGP_PahariEiJ_G0018819
LinkDB
Position
15
AA seq 2841 aa
MAAASYDQLLKQVEALKMENSNLRQELEDNSNHLTKLETEASNMKEVLKQLQGSIEDETM
TSGQIDLLERLKEFNLDSNFPGVKLRSKMSLRSYGSREGSVSSRSGECSPVPMGSFPRRA
FVNGSRESTGYLEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSLPLTENFSLQT
DMTRRQLEYEARQIRAAMEEQLGTCQDMEKRAQRRIARIQQIEKDILRVRQLLQSQAAEA
ERSSQSRHDAASHEADRQHEGHGVAESNTAASGSGQSPAARVDHETASVLSSSGTHSAPR
RLTSHLGTKVEMVYSLLSMLGTHDKDDMSRTLLAMSSSQDSCISMRQSGCLPLLIQLLHG
NDKDSVLLGNSRGSKEARARASAALHNIIHSQPDDKRGRREIRVLHLLEQIRAYCETCWE
WQEAHEQGMDQDKNPMPAPVEHQICPAVCVLMKLSFDEEHRHAMNELGGLQAIAELLQVD
CEMYGLTNDHYSVTLRRYAGMALTNLTFGDVANKATLCSMKGCMRALVAQLKSESEDLQQ
VIASVLRNLSWRADVNSKKTLREVGSVKALMECALEVKKESTLKSVLSALWNLSAHCTEN
KADICAVEGALAFLVGTLTYRSQTNTLAIIESGGGILRNVSSLIATNEDHRQILRENNCL
QTLLQHLKSHSLTIVSNACGTLWNLSARNPKDQEALWDMGAVSMLKNLIHSKHKMIAMGS
AAALRNLMANRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPK
ASHRSKQRHKQGLYGDYAFDANRHDDSRSDNFNTGNMTVLSPYLNTTVLPSSSSSRGSLD
SSRSEKDRSLERERGIGLSTYHPATENPGTSSKRGLQITTTAAQIAKVMEEVSAVRTSQD
DRSSASATEFHCAADDRSAARRSSASQTHSNTYNFSKSENSNRTCSMPYAKVEYKRSSND
SLNSVTSSDGYGKRGQMKPSVESYSEDDESKFCSYGQYPADLAHKIHSANHMDDNDGELD
TPINYSLKYSDEQLNSGRQSPSQNERWARPKHVIEDEIKQNEQRQARSQNTSYPVYSEKT
DDKHLKFQPHFGQQECVSPYRSRGTSGSETNRMGSSHAINQNVNQSLCQEDDYEDDKPTN
YSERYSEEEQHEEEEERPTNYSIKYNEEKHHVDQPIDYSLKYATDISSSQKPSFSFSKNS
SAQSTKPEHISPSSENTSAPSSNAKRQNQLRPSSAQRNSQTQKGTTCKVPSINQETIQTY
CVEDTPICFSRCSSLSSLSSAEDEIGCDQTTQEADSANTLQIAEIKENDVTRSTEDPAAE
VPAVSQNARTKPSRLQASGLSSESTRHKAVEFSSGAKSPSKSGAQTPKSPPEHYVQETPL
VFSRCTSVSSLDSFESRSIASSVQSEPCSGMVSGIISPSDLPDSPGQTMPPSRSKTPPPP
PQTVQTKREVPKSKVPAAEKRESGPKQTAVSAAVQRVQVLPDADTLLHFATESTPDGFSC
SSSLSALSLDEPFIQKDVELRIMPPVQENDNGNETEPEQPEESNENQDKEVEKPDSEKDL
LDDSDDDDIEILEECIISAMPTKSSRKAKKLAQTASKLPPPVARKPSQLPVYKLLPSQNR
LQAQKHVSFTPGDDVPRVYCVEGTPINFSTATSLSDLTIESPPNELATGDGVRTGIQSGE
FEKRDTIPTEGRSTDEAQRGKMSSRVTPDLDDNKAEEGDILAECINSAMPKGKSHKPFRV
KKIMDQVQQASSTSSGTNKNQVDTKKKKPTSPVKPMPQNTEYRPRVRKNTDSKVNVNTEE
TFSDNKDSKKQSLKPNPKAFNDKLPNNEDRVRGSFTFDSPHHYTPIEGTPYCFSRNDSLS
SLDFDDDDVDLSREKAELRKGKESKDSEAKVTCHTEPNSSQQSASKSQASTKHPVNRGQS
KPVPQKQPTFPQSSKDGPDRGAATDEKLQNFAIENTPVCFSRNSSLSSLSDIDQENNNNK
ESEPIKEAEPTNSQGEPSKPQASGYAPKSFHVEDTPVCFSRNSSLSSLSIDSEDDLLQEC
ISSAMPKKKRPSRLKGESEKQSPRKVGGMLAEDLTLDLKDIQRPDSEHGLSPDSENFDWK
AIQEGANSIVSSLHQAAAAAACLSRQASSDSDSILSLKSGISLGSPFHLTPDQEEKPFTS
NKGPRILKPGEKSTLEAKKIESENKGIKGGKKVYKSLITGKIRSSSEISSQMKQPLPTNM
PSISRGRTMIHIPGLRNSSSSTSXVSKKGPPLKTPASKSPSEGPGATTSPRGTKPAVKSE
LSPITRQTSQIXGSNKGSSRSGSRDSTPSRPTQQPLSXPMQSPGRNSISPGRNGISPXNK
LSQLPRTSSPSTASTKSSGSGKMSYTSPGRQLSQQNLTKQAGLSKNASSIPRSESASKGL
NQMNNGNGSNKKVELSRMSSTKSSGSESDRSERPALVRQSTFIKEAPSPTLRRKLEESAS
FESLSPSSRPDSPTRSQAQTPVLSPSLPDMCLSTHPPVQAGGWRKLPPNLSPTIEYNDGR
PAKRQDIARSHSESPSRLPINRAGTWKREHSKHSSSLPRVSTWRRTGSSSSILSASSESS
EKAKSEDERHVSSVPAPRQMKENQVPTKGTWRKIKESDISPTNMVSQTASSGAASGAESK
TLIYQMAPAVSKTEDVWVRIEDCPINNPRSGRSPTGNTPPVIESVSEKGSSSIKDSKDTQ
GKQSVGSGSPVQTVGLETRLNSFIQVEAPEQKGTEAKPGQSHPVPVAETAETCIAERTPF
SSSSSSKHSSPSGTVAARVTPFNYNPSPRKSSADSTSARPSQIPTPVSNNTKKRDSKTDS
TESSGAQSPKRHSGSYLVTSV
NT seq 8526 nt   +upstreamnt  +downstreamnt
atggctgcagcttcatatgatcagttgttaaagcaagttgaggcactgaagatggagaac
tcaaatcttcgacaagagctagaagataattccaatcatcttacaaaactggaaactgag
gcatctaatatgaaggaagtacttaagcagctacagggaagtattgaagatgagactatg
acttctggacagattgacttactagagcgtcttaaagaatttaacttggatagtaatttc
cccggagtgaaactacgctcaaaaatgtcccttcgctcctatggaagtcgggaaggttct
gtatcaagccgttcaggagaatgcagtcctgtccccatggggtcattcccaagaagagca
tttgtaaatggaagcagagaaagtactggatatctagaagagcttgaaaaagaaagatca
ttactccttgctgatcttgacaaagaagagaaggaaaaggactggtattatgctcaactt
cagaacctcacaaaaagaatagatagcctgcctctaactgaaaatttttccttacagaca
gacatgacaagacggcagctggagtatgaagcaaggcaaatcagggctgccatggaggag
cagcttggcacttgccaggacatggagaagcgcgcacagcgaagaatagctaggatccag
caaatagaaaaggacatactgcgagtgcgacagcttttacagtcccaggcagcggaagca
gagaggtcatctcagagcaggcatgatgctgcctcccatgaagctgaccggcagcacgaa
ggtcacggagtggcagaaagcaacacggcagcctccggtagtggtcagagtccagctgca
cgtgtggatcacgaaacagccagtgttttgagttctagcggcactcactctgctcctcga
aggttgacaagtcatctggggacaaaggtggaaatggtgtattccttgttgtcaatgctt
ggtactcatgataaggacgatatgtcacgaactttgctagctatgtctagctcccaggac
agctgtatatccatgcggcagtctggatgcctccctctcctcatccagcttttacatggc
aatgacaaagactctgtattgttgggaaattcccggggcagtaaagaggctcgggccagg
gccagtgcagcactccacaacatcattcactcacagcctgatgacaagagaggcaggcgt
gaaatccgagtccttcatcttttggaacagatccgagcttactgtgaaacctgttgggag
tggcaggaagcccacgaacaaggcatggaccaggacaaaaacccaatgccagctcccgtt
gagcatcagatctgccctgctgtgtgtgttctgatgaagctttcatttgatgaagagcat
aggcatgcaatgaatgaacttgggggactgcaggccattgcagagttattgcaggtggac
tgtgagatgtatgggcttactaatgaccactacagtgttactttaagacggtatgctgga
atggctttgacaaacttgacctttggagatgttgccaacaaggctacgctgtgttctatg
aaaggctgtatgagagcacttgtggcccagttaaaatctgaaagtgaagacttacagcag
gttattgcaagtgttttgaggaatttgtcttggcgagcagatgtaaatagcaaaaagacg
ttgagagaagttggaagtgtgaaagcattgatggaatgtgctttggaagttaaaaaggaa
tcaaccctcaaaagcgtcttgagtgccctatggaacctgtctgcacactgcactgagaat
aaggctgacatctgtgccgtggagggagcactggcatttctggttggcaccctcacttac
cggagccagacaaatactttagccattattgaaagtggaggtgggatattacggaatgtg
tccagcttgatagctacaaacgaagaccacaggcaaatcctaagagagaacaattgccta
caaactttattacagcacttgaaatctcacagcttgaccatagtcagtaatgcatgtgga
actttatggaatctctcagcaagaaatcctaaagaccaggaagccttgtgggacatgggg
gcagtgagcatgctcaagaacctcattcattctaagcacaaaatgattgccatgggaagt
gcggcagctttaaggaatctcatggcaaacagacctgcaaagtataaggacgccaatatc
atgtctcctggttcaagtctgccgtcccttcacgttaggaaacagaaagctctagaagcc
gagctagatgctcagcatttatcagaaaccttcgacaacattgacaacttaagtcccaag
gcctctcaccgtagtaagcagagacacaagcagggcctttacggtgactatgcttttgat
gccaatcgacatgatgatagtaggtcagacaatttcaatactggaaacatgactgttctt
tcaccatatttaaatactacggtattgcccagctcttcttcctcaaggggaagtttagac
agttctcgttctgagaaggacagaagtttggaaagagaacgaggtattggcctcagcact
taccatccagcaacagaaaatccaggaacctcttcaaaacgaggtctgcagatcactacc
actgcagcccagatagccaaagttatggaagaagtgtcagccgttcgtacctcccaggac
gacagaagttctgcttctgccaccgagttccactgtgcggcagacgataggagtgccgca
cgaagaagctccgcctcccaaacccactcaaacacatacaacttctctaagtcggaaaat
tcaaataggacatgctctatgccttatgccaaagtggaatataaacgatcttcaaatgac
agtttaaatagtgtcactagtagtgatgggtatggtaaaagaggccaaatgaaaccctca
gttgaatcctattctgaggatgatgagagtaagttttgcagttatgggcagtatccagct
gacctagcccataagatacacagtgcgaatcacatggatgataatgatggagaactggat
acaccaataaattacagtcttaaatattcagatgagcagttgaactcaggaaggcagagt
ccttcacagaatgaaaggtgggcaagacccaagcatgtgatagaagatgaaataaagcaa
aatgagcaaagacaagcaagaagccagaacaccagttatcctgtctactctgagaagact
gatgacaaacacctcaaattccaaccacattttggacaacaagaatgtgtttccccatat
aggtctaggggaaccagtggttcagaaacaaatcgaatgggttctagtcatgcaattaat
caaaatgtaaaccagtctctgtgtcaggaagatgactatgaagatgataaacctaccaac
tacagtgaacgttattctgaggaagaacaacatgaagaagaagaagagagaccgacaaat
tatagcataaaatataatgaagagaaacatcatgtagatcagcctattgattatagttta
aaatacgccactgacatttcttcctcacaaaaaccatcattttcattctcaaagaattca
tcagcacaaagcactaaacctgaacatatctctccaagcagcgagaatacatctgcacct
tcgtctaatgccaaaaggcagaatcagctgcgtccaagttcagcacaaagaaacagtcag
actcaaaaagggactacttgcaaagtcccctccatcaaccaagaaacaatacagacttac
tgtgtagaagacaccccgatatgtttctcaaggtgcagttcattatcatcactgtcatca
gctgaagatgaaataggatgtgatcagacaacacaggaagcagattctgctaatactctg
cagatagcagaaataaaagagaatgatgtaactcggtcaactgaagatcctgcagctgaa
gttccagcagtgtcccagaatgctagaaccaaacctagccgactccaggcttctggctta
tcttcagaatcaaccaggcataaagctgttgagttttcttcaggagccaagtctccctcc
aaaagtggtgctcagacacccaaaagtcccccagaacactatgtccaggagactccactt
gtattcagcaggtgtacttctgtcagttcacttgacagttttgagagtcgctccattgcc
agctctgttcagagtgagccatgtagtggaatggtgagtggcatcataagccccagtgac
cttccagatagtcctgggcagaccatgccaccaagcaggagcaaaacccctccacctcct
ccacagacagtgcagaccaagagagaggtgccaaaaagtaaagtacctgctgctgagaag
agagagagcggacctaagcagactgctgtcagtgctgctgtgcagagggtgcaggtcctt
ccagacgctgacactttgttacacttcgccacagaaagtactccggatgggttttcttgt
tcctccagcctaagtgctctgagcctggatgagccatttatacagaaagatgtagaatta
agaataatgcctccagttcaggaaaatgacaatgggaatgaaactgaaccagaacagcct
gaggaatcaaatgaaaaccaggataaagaggtagaaaagcctgactctgaaaaagactta
ttagatgattctgatgatgatgatattgaaatattagaagaatgtattatttcagccatg
ccaacaaagtcatcacgcaaagccaaaaaactagcccagactgcttcaaaattacctcca
cctgtggcaaggaagccaagtcagctgcctgtgtataaacttctgccatcacagaacagg
ctgcaggcacaaaaacatgttagctttacaccaggggatgatgtgccccgggtgtactgt
gtagaagggacacctataaacttttccacagcaacgtctctaagtgatctgacaatagag
tcccctccaaatgagttggctactggagatggggtcagaacaggtatacagtcaggtgaa
tttgaaaaacgagataccattcctacagaaggcagaagtacagatgaggctcagcgaggg
aaaatgtcatctagagttacaccggacctggatgacaacaaagcagaggaaggagatatt
cttgcagaatgtatcaattctgctatgcccaaaggaaaaagtcacaagcctttccgagtg
aaaaagataatggaccaagtccaacaagcatcctcgacttcatctggaactaacaaaaat
caagtagacactaagaaaaagaagcctacttcaccagtaaagcccatgccacaaaatact
gaatatagaccgcgtgtgagaaagaatacagactcaaaagttaatgtaaatactgaagaa
actttctcagacaacaaagactcaaagaaacagagcttaaaacccaatcccaaggccttc
aatgataagctaccaaacaatgaagaccgagtacgggggagcttcactttcgattcaccg
catcattacacccctattgagggaacgccttactgcttttcccgaaatgattctttgagt
tctctagattttgatgatgacgatgttgacctttccagggaaaaggctgagttaagaaag
ggcaaagaaagtaaggattccgaagccaaagttacctgccacacagaaccaaactcaagc
cagcagtcagctagtaagtcacaagccagtacaaaacatccagtaaacagaggacagtcc
aaaccagtgccgcagaaacaacccactttcccccagtcctccaaagacgggccagataga
ggggcagcaactgacgaaaaactgcagaattttgctattgaaaatactccagtttgcttt
tctcgaaattcctctctgagttcccttagtgacattgaccaggaaaacaacaataacaaa
gaaagtgaaccaatcaaagaagctgaacctaccaactcacaaggagaaccaagtaagcct
caggcatctgggtatgcgccgaagtcatttcatgttgaagacacccctgtctgtttctca
agaaacagctctctcagttctcttagtattgactctgaggacgacctgttgcaggagtgt
ataagttctgccatgccaaaaaagaaaaggccttcaagactcaagggtgagagtgaaaaa
cagagtcctagaaaagtgggtggcatgttagctgaagatctgacacttgatttgaaagat
atacagaggccagattcagaacatggtttatcccccgattcagaaaattttgactggaaa
gctattcaggaaggtgcaaactccatagtaagtagtttgcaccaagctgctgctgctgct
gcatgtttatctagacaagcatcatctgattcagattccattctgtcactaaagtctggc
atttccctgggntcgccttttcatcttacacctgatcaagaggaaaagccattcacaagc
aataaaggcccaagaattctcaaacctggagagaaaagcacattagaagcaaaaaaaata
gaatctgaaaacaaaggaatcaaaggtggaaaaaaggtttataaaagcttgattacggga
aagattcgatccagttcagaaatttccagccaaatgaaacaacccctcccgacaaacatg
ccttcaatctcaagaggcaggacaatgattcatatcccagggcttcggaatagctcctca
agtacaagccntgtctctaagaaaggcccgcccctcaagactccagcctctaaaagcccc
agtgaagggccgggagctaccacttctcctcgnggaactaagccagcagtaaagtcagag
cttagccctattaccaggcaaacttcccaaatcantgggtcaaataaggggtcttctaga
tcaggatctagagactccactccctcaagacctacacagcagccattaagtagnccaatg
caatctccagggcgaaactcaatttcccctggtagaaatggaataagccctnctaacaaa
ctgtctcagctgcctagaacatcatctcccagtactgcttcaactaagtcctcgggttct
gggaaaatgtcatatacatccccaggcagacagctgagccaacaaaatcttaccaaacaa
gcaggcttatccaagaatgccagcagtatccccagaagtgagtcagcatctaaaggactg
aatcagatgaataacggcaatgggtcaaataaaaaggtagaactttctagaatgtcttca
actaaatcaagtggaagtgaatcagacagatcagaaagacctgcattagtacgccagtct
actttcatcaaagaagctccaagcccaaccctaaggaggaaactggaggaatctgcctca
tttgaatccctttctccatcttctagaccagattctcccaccaggtcccaagcacagacc
ccagttttaagcccttcccttcccgatatgtgtctatccacacatccacctgttcaggca
ggtggatggcgaaagcttccacccaacctcagccccactatagagtataatgatgggagg
cctgccaaacggcaggacattgcacgctcccattcggaaagcccttccaggctacccatc
aaccgggcaggaacctggaagcgtgaacacagcaaacattcctcgtcccttcctcgagtg
agcacttggagaagaactggaagctcatcttctattctttctgcttcatcagaatccagt
gaaaaagcaaaaagtgaggatgaaaggcatgtgagctccgtgccagcacccagacagatg
aaggaaaaccaggtgcccacaaaaggaacatggaggaaaatcaaggaaagtgacatttct
cccacaaacatggtttctcagaccgcttcctcaggtgctgccagtggtgctgaatcaaag
actctgatctatcagatggcacctgctgtctctaaaacagaggatgtttgggtgagaatt
gaggactgccccattaacaaccctagatctggaagatctcccacaggcaacaccccccca
gtgattgagagtgtttcagagaagggaagttcaagcattaaagattcaaaagacacccaa
gggaaacagagtgtgggcagtggcagtcctgtgcaaaccgtgggtctggaaacccgcctg
aactcctttattcaggtagaggccccagaacagaaaggaactgaggcaaaaccaggacag
agtcacccagtccctgtagcggagactgctgagacatgtatagcagagcgcaccccattt
agttccagtagctccagcaagcacagctcacctagtgggactgttgctgccagagtgaca
ccttttaattacaaccctagccctcggaaaagcagcgcagacagcacttcagcccggcca
tctcagatccctacaccagtgagcaacaacacaaagaagagagattcaaagactgacagc
acagaatccagtggagcccaaagtcctaaacgccattctgggtcttacctcgtgacgtct
gtttaa

DBGET integrated database retrieval system