KEGG   Rousettus aegyptiacus (Egyptian rousette): 107500934
Entry
107500934         CDS       T06036                                 

Gene name
APC2
Definition
(RefSeq) LOW QUALITY PROTEIN: adenomatous polyposis coli protein 2
  KO
K02085  adenomatosis polyposis coli protein
Organism
ray  Rousettus aegyptiacus (Egyptian rousette)
Pathway
ray04310  Wnt signaling pathway
ray04390  Hippo signaling pathway
ray04550  Signaling pathways regulating pluripotency of stem cells
ray04810  Regulation of actin cytoskeleton
ray04934  Cushing syndrome
ray05010  Alzheimer disease
ray05022  Pathways of neurodegeneration - multiple diseases
ray05165  Human papillomavirus infection
ray05200  Pathways in cancer
ray05206  MicroRNAs in cancer
ray05210  Colorectal cancer
ray05213  Endometrial cancer
ray05217  Basal cell carcinoma
ray05224  Breast cancer
ray05225  Hepatocellular carcinoma
ray05226  Gastric cancer
Brite
KEGG Orthology (KO) [BR:ray00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04310 Wnt signaling pathway
    107500934 (APC2)
   04390 Hippo signaling pathway
    107500934 (APC2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04550 Signaling pathways regulating pluripotency of stem cells
    107500934 (APC2)
  09142 Cell motility
   04810 Regulation of actin cytoskeleton
    107500934 (APC2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    107500934 (APC2)
   05206 MicroRNAs in cancer
    107500934 (APC2)
  09162 Cancer: specific types
   05210 Colorectal cancer
    107500934 (APC2)
   05225 Hepatocellular carcinoma
    107500934 (APC2)
   05226 Gastric cancer
    107500934 (APC2)
   05217 Basal cell carcinoma
    107500934 (APC2)
   05213 Endometrial cancer
    107500934 (APC2)
   05224 Breast cancer
    107500934 (APC2)
  09164 Neurodegenerative disease
   05010 Alzheimer disease
    107500934 (APC2)
   05022 Pathways of neurodegeneration - multiple diseases
    107500934 (APC2)
  09167 Endocrine and metabolic disease
   04934 Cushing syndrome
    107500934 (APC2)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    107500934 (APC2)
 09180 Brite Hierarchies
  09181 Protein families: metabolism
   01009 Protein phosphatases and associated proteins [BR:ray01009]
    107500934 (APC2)
  09182 Protein families: genetic information processing
   03036 Chromosome and associated proteins [BR:ray03036]
    107500934 (APC2)
  09183 Protein families: signaling and cellular processes
   04812 Cytoskeleton proteins [BR:ray04812]
    107500934 (APC2)
Protein phosphatases and associated proteins [BR:ray01009]
 Protein serine/threonine phosphatases
  Phosphoprotein phosphatases (PPPs)
   Protein phosphatase-1
    PP1-interacting proteins (PIPs)
     107500934 (APC2)
Chromosome and associated proteins [BR:ray03036]
 Eukaryotic type
  Centrosome formation and ciliogenesis proteins
   Other centriole associated proteins
    107500934 (APC2)
Cytoskeleton proteins [BR:ray04812]
 Eukaryotic cytoskeleton proteins
  Microtubules
   Tubulin-binding proteins
    EB / APC
     107500934 (APC2)
SSDB
Motif
Pfam: APC_basic APC_rep APC_N_CC Suppressor_APC APC_r Arm Arm_APC_u3 SAMP Wtap bZIP_1 Sec2p HALZ
Other DBs
NCBI-GeneID: 107500934
NCBI-ProteinID: XP_015982146
LinkDB
Position
Unknown
AA seq 2300 aa
MPSVPQVPGASTVLPAAVHAGLPGPGSSPGVRWEGSGFGPRLPPPQSLWECSDLWSQELQ
ELKMTSSVAPYEQLVRQVEALKAENSHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKL
EQEARMLVSSGQTEVLEQLKALQVDITSLYNLKFQPPALGPEPAARTPEGSPVHGSGPSK
DGFGELSRATIRLLEELDRERCFLLNEIEKEEKEKLWYYSQLQGLSKRLDELPHVETQFS
MQMDLIRQQLEFEAQHIRSLMEERFGTSDEMVQRAQIRASRLEQIDKELLSAQDRVQQTE
PQALLAVKSVPLEEDPETEVPTHPGDGAPQPGNSKVEVVFWLLSMLATRDQEDTARTLLA
MSSSPESCVAMRRSGCLPLLLQILHGTEAEAGGRPGTPGAPGAKDARMRANAALHNIVFS
QPDQGLARKEMRVLHVLEQIRAYCETCWDWLRAQDGGAGGSPVPIEPQICQATCAVMKLS
FDEEYRRAMNELGGLQAVAELLQVDYEMHKMTRDPLNLALRRYAGMTLTNLTFGDVANKV
PVGRAGSVLSFGLSLLPPCPDLETVVSSILRNLSWRADINSKKVLREVGSMTALMQCVLR
ASKESTLKSVLSALWNLSAHSTENKAAICQVDGALGFLVSTLTYKCQSNSLAIIESGGGI
LRNVSSLIATREDYRQVLRDHGCLQTLLQHLTSHSLTIVSNACGTLWNLSARSPRDQELL
WDLGAVAMLRNLVHSKHKMIAMGSAAALRNLLAHRPAKHQAAATAVSPGACAPSLYVRKQ
RALEAELDARHLAQALDHLEKQGLPEAEAEAAAKKPLPPLRHLDGLAQDYASDSGCFDDD
DAPSLAAVAATAEPASPAVMSLFLGSPFLQGQALARAPPARRGGLEAEKEVGGEAAVAAR
AKAKLALAVARIDRLVEDISALHTSSDDSFSLSSGDPGQEVAREGRAQSCSPCRGPEGGR
REAGGRAHPLLRLKAAHASLSNDSLNSGSTSDGHGPREHAQPWPLAALAERREGPPRGQA
RPSQLDLGLPGGRAEPAREAAXPTASARTIKLSPTYQHVPLLEGRPGPXGLLASGARKPA
WLPAEGLGKGPEKLAAEAAPLCLSRCSSLSSLSSAGRPGPSEAGDLEDSDSSLEGLEEAG
PSEAELDGAWRRPGATSLPLAIPAPSRGRALGVEDATPSSSSENCVQETPLVLSRCSSVS
SLGSFESPSIASSIPSDPCSGLGSGTVSPSELPDSPGQTMPPSRSKTPPLAPAPPGEREA
TPFSLQWESYVKRFLDIADCRERCRLPPELDAGSVRFTVEKPDENFSCASSLSALALHEH
YVQKDVELRLLPPACPELGSGSAAGPGLHFAGHRRRDEASGCAEGPPAADQELELLRECL
GAAVPARLRKVASALVSGRRARPVPVYMLVPAPAREDDSCTDSAEGTPVNFSSAASLSDE
TLQPPRDPPGARADGHKPAGRVAPAGPAAGHRHKGGGTGKSTEPAQGAGGGRAGLGRPAR
RPPSAGANRDGPHAGRVRAEGALPSLRLTTPTEEAVYCFCGDDSDEEPVALASAPAPRHA
SAIPRAVKGERRAGRKEAQATPKATAPSRAQPRLIADETPPCCSLSSSASSLGEPEPAGT
QDVGPGAGRRGSPRPRAEAELLRGCSSSAVPRRRPQVSXPRRRRPRAARPEERPAEGPRG
RGRSEEALGSDHASDLDSVEWRAIQEGANSIVTWLHQAAAADTREASSESDSILSLASGL
SGSTLQPALRRRGPRPRSEGQAGSAARPEKRNATAAQRSGGPHSPCGPESXRSAGKTASG
APATLRGRTVLHAQPGPTRPPGGAPGPRPTPARMGPPSHGRPAAPSKVPGPGQRSRSLHR
PGKISELAALSPPHRSATPPARLTKTPSSSSSQTSPVAQALPRRSPSTSRVAGSLPSPGA
SPVPRTPARALLAKQHKTQKSPVRIPFMQKPARRGPPPLARVAPDPGPRGRAGAEGAPGA
RGSHSGLVRVASARASGSESDRAGFRRQLTFIKESPGLPRRCRSALSTAEGAAATPRXSS
SRRSRPGLPAVFLCSSRCDELRTAXPAVPRPQRLPTPRRPSSESPSRLPVRSQSAQPETV
KRYASLPHISVARRPEGAALGPAPXGCPRSSDGEGQPPPRVAVPGGTWRRIRDEDVPHIL
RSTLPATALPLVGAAPTEGPGSPPQRKTSDAVVQTEDFAAAKTNSSTSPSLESRGPPQAP
ASGPAPLLGSDVDGPGPAKAPASAPFVHEGLGVAAGGFPASRHSSPSRSVRVPPFNYVPS
PMVAAAADSAVEKAPVPADL
NT seq 6903 nt   +upstreamnt  +downstreamnt
atgccatctgtccctcaggttcccggggccagcactgtcctgcctgcggcagtgcatgcc
ggcctccccggcccggggtccagccccggggttaggtgggagggctcaggcttcggtccc
aggctgccccctcctcagagcctctgggaatgctctgacctgtggtctcaggagctgcag
gagctgaagatgacgagctctgtggcgccctacgagcagctggtgcggcaggtggaggcc
ttgaaggccgagaacagtcacctgaggcaggagctgcgagacaactcaagccacctgtcc
aagctggagacggagacgtccggcatgaaggaggtcctgaagcacttgcagggcaagctg
gagcaagaggcccgaatgctggtgtcctccgggcagaccgaggtgctagagcagctgaaa
gccctgcaggtggacattaccagcctgtacaacctgaagttccagccccccgccttgggc
cctgagcccgctgcccggacccccgagggaagcccggtccatggctccgggccctccaaa
gacggttttggggagctgagccgtgccaccatccggctgctggaggaactggacagggaa
cggtgttttctgttgaatgagattgagaaggaggagaaggagaagctctggtactactcc
cagctgcagggcttgtccaagcgcctggacgagctgccgcacgtggagacacagttctcc
atgcagatggacctgatccggcagcagctggagttcgaggcccagcacatccgctcgctg
atggaggagcgtttcggcacctcggacgaaatggtgcagcgggcgcagatccgcgcttcg
cgcttggagcagatcgacaaggaactgctgtcagcacaggaccgggtgcagcagaccgag
cctcaggccctgctggcggtgaagtcagtgccattggaggaggaccccgaaactgaggtc
cctacgcaccctggggacggtgcccctcagccgggcaacagcaaggtggaggtggtcttc
tggttgctgtccatgctggcgacgcgtgaccaggaggacacggcgcgcacgctgctcgcc
atgtccagctctcctgagagctgcgtggccatgcgccgctcgggctgcttgccgctgctg
ctgcagatcctgcacggcacagaggccgaagctgggggccgccccgggacccccggggcg
ccaggagccaaggacgcgcgcatgcgcgccaacgcggccctgcacaacatcgtcttctcc
cagccagaccagggcctggcgcgcaaggagatgcgcgtcctgcacgtgctggagcagatc
cgtgcctactgcgagacctgctgggactggctgcgggcccaggacggcggtgcgggcggc
agccccgtccccatcgagccgcagatctgccaggccacctgtgcggtgatgaagctgtct
tttgacgaggaataccgccgtgccatgaacgagctgggtgggctgcaggccgtggcagag
ttgctgcaggtcgactatgagatgcacaagatgacccgggacccgctcaacctcgcgctg
cgcagatacgccggcatgaccctcaccaacctcacctttggggacgtggccaacaaggtg
cccgttgggcgggcggggtcagtgctatcctttggtctgagcctccttccaccttgtcct
gaccttgaaacagtggtgtccagcatcctgcgcaacctgtcctggagggccgacatcaac
agcaagaaggtgctgagggaggtgggcagcatgaccgccctgatgcagtgcgtcctgcgg
gcctccaaggagtccaccctgaagagtgtgctcagcgccctgtggaacctctcggcccac
agcacggagaacaaggcggccatctgccaggtggacggcgccctgggcttcctggtgagc
acgctcacctacaagtgccagagcaactcgctggccatcatcgagagcggcgggggcatc
ctgcgcaacgtgtccagcctcatcgccacccgcgaggactacaggcaggtgctgcgggac
cacggctgcctgcagacgctgctgcagcacctcacctcgcacagcctgaccatcgtgagc
aacgcctgcggcacgctctggaacctgtccgcccgcagcccgcgggaccaggagctgctg
tgggacctgggcgcggtggccatgctgcgcaacctggtgcactccaagcacaagatgatc
gccatgggcagtgccgcggccctgcgcaacctgctggcccaccggcccgccaagcaccag
gcagcggccaccgccgtgtcgcccggcgcctgcgcgcccagcctgtacgtgcgcaagcag
cgcgcgctggaggcggagctggacgcgcgccacctggcccaggcgcttgaccacctggag
aagcagggcctgcccgaggctgaggccgaggccgccgccaagaagccgctgccgccgctg
cggcacctggacgggctggcccaggactacgcctccgactccggctgcttcgacgacgac
gacgcgccctcgctggccgccgtggccgccaccgccgagcccgccagccctgccgtgatg
tcgctcttcctgggcagccccttcctgcaggggcaggcgctggcccgcgccccgcctgcc
cgccgaggcggcctggaggcggaaaaggaggttggcggggaggcggccgtggcggccagg
gccaaggccaagctggctctggcagtggcgcggatcgaccggctggtggaggacatctcg
gccctgcacacctcgtccgacgacagctttagcctcagctccggggaccctgggcaggag
gtggcgagggagggccgcgcccagtcctgctcaccctgccggggccccgagggcgggcgg
cgggaggctggtggccgagcgcacccgctgctgcggctgaaggcggcccacgccagcctc
tccaacgacagcctcaatagcggcagcacgagcgacgggcatggtccgcgggagcatgcg
cagccctggccgctggccgcgctggccgagcgtcgtgaggggcccccacgtggccaggca
cggcccagccagcttgacctcggcctgcccggcggccgggccgagcccgcccgggaggcc
gccncaccgaccgcgagtgcgcgcaccatcaagctgtcgcccacctaccagcacgtgccg
ctgctcgaggggcggccagggcccnnggggctgctggcgtccggggcccgcaagccggcg
tggctgcccgcggaaggcctgggcaaggggcccgagaagctggcggcagaggcggcgccg
ctctgcctgtcccgctgcagctccctgtcctcgctgtcctcggccggccgcccgggcccc
agtgaggcaggggacctggaagacagtgactcgtccctggagggactggaggaggcaggc
cccagcgaggcggagctggacggggcctggcgcaggccaggggccacctccctgcccttg
gccatcccagcgccctcgcgcggccgggccctgggggtggaggacgccacgccgtccagc
tcctcggagaactgcgtgcaggagacgccgctggtgctgagccgctgcagctcggtgagc
tcgctgggcagcttcgagagcccgtccatcgcaagctccatccccagcgacccgtgcagc
gggctgggcagcggcacggtcagccccagcgagctgcccgacagccccgggcagaccatg
ccgcccagccgcagcaagacgccgccgctggcccctgccccgcccggcgagcgtgaggcc
accccgttcagcctgcagtgggagagctacgtgaagcgcttcctggacatcgcggactgc
cgggagcgctgccggctgccgcctgagctggacgccggcagcgtgcgcttcaccgtggag
aagccggacgagaacttctcgtgcgcctccagcctgagcgcgctggccctgcacgagcac
tacgtgcagaaggacgtggagctgcggctgctgcccccagcctgccccgagctcggcagc
ggcagcgccgctggccccggcctgcacttcgccgggcaccgccggcgggatgaggccagc
ggctgcgccgaggggcccccggccgccgaccaggagctggagctgctgcgggagtgcctg
ggcgcggccgtgcccgcccggctccgcaaggtggcctcggcactggtgtccggccgccgc
gcacggcccgtgcccgtctacatgctggtgcctgccccagcccgggaggacgactcgtgc
accgactcggccgagggcacgccggtcaacttctccagtgcggcctcgctcagcgacgag
acgctgcagcctcccagggacccacccggcgcccgtgcagatggacacaagcctgcgggc
cgcgtggctcctgctgggccagccgctgggcacaggcacaagggagggggcacaggcaag
agcacggagccggcccagggggcgggcgggggccgggcggggctggggcggcccgcccgc
aggcccccgagtgccggtgccaacagggacggcccccacgcaggccgggtgcgtgcggag
ggggcgctgccgtccctgcgcctcaccacgcccaccgaggaggctgtgtactgcttctgt
ggcgacgactcagatgaggagccggtggcgctggcctcggcgccggccccccggcacgca
tctgccatcccccgggcggtcaagggggaacgccgggctggcaggaaggaggcgcaggcc
acgcccaaggccacagcgcccagccgggcacagcccaggctcatcgccgacgagacgcca
ccctgctgctccctgagctcctccgccagctccctcggcgagccggagcccgcggggacc
caggacgtggggcccggagccgggcgccggggctcccctcgcccacgggccgaggcagag
ctgctgcggggctgtagcagctcggctgtgcccaggcgccggccccaggtgtcanggccg
cggcgtcgccggccccgagcggcccgcccggaggagcggccagcagaggggccacgggga
cggggacgcagcgaagaggccctggggtcggaccacgcctcggacctggacagcgtcgag
tggcgcgccatccaggagggcgccaactccatcgtcacctggctgcaccaagcggcggcg
gcagacacccgcgaggcctcttccgagtccgactccatcctgtccttggcatcggggctg
tcgggctctacgctgcagcccgccctgcgcaggagggggccgcggccgcggtcagagggc
caggcgggcagcgccgcgcggccggagaagcggaatgcgacggcggcccagcgcagtggc
ggtccccactcaccctgcggccccgagagcncacgcagcgctgggaagacggcgtccggg
gcgccggccacactccggggaaggacggtgctgcacgcccagcctggccccacgcgcccg
cccggaggggcccccggcccccgccccacgcccgccaggatggggcccccgagccacgga
cggccagcagcccccagcaaagtcccgggccctgggcaacggtctcggagcctgcaccgg
cccggcaagatctcggagctggcggccctcagcccgccccacaggagtgcgacgccgccc
gcccgcctcacgaagacgccctcgtccagctcctcccagacctcccctgtcgcccaggcc
ctgcccaggcggtcgccctccacttccagggtagccgggtccttgcccagccccggggcc
tcgccagtgcccaggacgcccgcacgggccctgctggccaagcagcacaagacacagaag
tcgcccgtgcggattcccttcatgcaaaagcccgcccggcgggggccaccgcccctggcc
agggtggctccagacccgggccccaggggccgggcaggggccgagggggcaccaggggcc
cgagggagccactcgggcctggtgcgcgtggcctcggcccgcgccagcggcagcgagtcg
gaccgcgcaggcttccggcgccagctgaccttcatcaaggagtcacccggcctgccgcgc
cgctgccggtcggctctgtccactgctgagggtgccgccgccaccccccgangcagctcg
agccgccgcagccggcccgggctgcccgccgtcttcctctgctcctcccgctgcgacgag
ctacgcaccgccnccccggcggtcccccggccccagcggctccccacgccccggcgcccc
agctccgaaagcccatcccgcctgcccgtgcgctcgcagtctgcccagcccgagacggtc
aagcgctacgcctcgctgccccacatcagcgtggcccgcaggcccgaaggcgccgccctg
gggcccgcgccgnnaggctgtccgcgcagcagtgacggggagggccagccgccaccccgg
gtggccgtgcccggcggcacgtggcgtcgcatccgggacgaggacgtgccgcacatcctg
cgcagcacgctgccagccaccgcgctgccactcgtgggcgccgcgcccacggaggggccg
ggcagccccccgcagcgcaagaccagtgacgccgtggtccagacagaggacttcgctgcc
gccaagaccaactccagcacgtccccaagcctggagagcagggggcccccgcaggccccg
gccagtggccccgcgcccctcctcggcagcgatgtggacgggccggggcccgccaaggcg
cccgcctctgccccattcgtccacgagggcctgggggtggctgcggggggcttccccgcc
agccggcacagttcccccagccgctcggtccgagtgcctcccttcaactacgtgcccagc
cccatggtggcggctgctgccgactcggccgtggagaaagcccctgtccccgcggacctc
tga

KEGG   Rousettus aegyptiacus (Egyptian rousette): 107512008
Entry
107512008         CDS       T06036                                 

Gene name
APC
Definition
(RefSeq) adenomatous polyposis coli protein isoform X1
  KO
K02085  adenomatosis polyposis coli protein
Organism
ray  Rousettus aegyptiacus (Egyptian rousette)
Pathway
ray04310  Wnt signaling pathway
ray04390  Hippo signaling pathway
ray04550  Signaling pathways regulating pluripotency of stem cells
ray04810  Regulation of actin cytoskeleton
ray04934  Cushing syndrome
ray05010  Alzheimer disease
ray05022  Pathways of neurodegeneration - multiple diseases
ray05165  Human papillomavirus infection
ray05200  Pathways in cancer
ray05206  MicroRNAs in cancer
ray05210  Colorectal cancer
ray05213  Endometrial cancer
ray05217  Basal cell carcinoma
ray05224  Breast cancer
ray05225  Hepatocellular carcinoma
ray05226  Gastric cancer
Brite
KEGG Orthology (KO) [BR:ray00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04310 Wnt signaling pathway
    107512008 (APC)
   04390 Hippo signaling pathway
    107512008 (APC)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04550 Signaling pathways regulating pluripotency of stem cells
    107512008 (APC)
  09142 Cell motility
   04810 Regulation of actin cytoskeleton
    107512008 (APC)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    107512008 (APC)
   05206 MicroRNAs in cancer
    107512008 (APC)
  09162 Cancer: specific types
   05210 Colorectal cancer
    107512008 (APC)
   05225 Hepatocellular carcinoma
    107512008 (APC)
   05226 Gastric cancer
    107512008 (APC)
   05217 Basal cell carcinoma
    107512008 (APC)
   05213 Endometrial cancer
    107512008 (APC)
   05224 Breast cancer
    107512008 (APC)
  09164 Neurodegenerative disease
   05010 Alzheimer disease
    107512008 (APC)
   05022 Pathways of neurodegeneration - multiple diseases
    107512008 (APC)
  09167 Endocrine and metabolic disease
   04934 Cushing syndrome
    107512008 (APC)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    107512008 (APC)
 09180 Brite Hierarchies
  09181 Protein families: metabolism
   01009 Protein phosphatases and associated proteins [BR:ray01009]
    107512008 (APC)
  09182 Protein families: genetic information processing
   03036 Chromosome and associated proteins [BR:ray03036]
    107512008 (APC)
  09183 Protein families: signaling and cellular processes
   04812 Cytoskeleton proteins [BR:ray04812]
    107512008 (APC)
Protein phosphatases and associated proteins [BR:ray01009]
 Protein serine/threonine phosphatases
  Phosphoprotein phosphatases (PPPs)
   Protein phosphatase-1
    PP1-interacting proteins (PIPs)
     107512008 (APC)
Chromosome and associated proteins [BR:ray03036]
 Eukaryotic type
  Centrosome formation and ciliogenesis proteins
   Other centriole associated proteins
    107512008 (APC)
Cytoskeleton proteins [BR:ray04812]
 Eukaryotic cytoskeleton proteins
  Microtubules
   Tubulin-binding proteins
    EB / APC
     107512008 (APC)
SSDB
Motif
Pfam: Arm_APC_u3 APC_basic EB1_binding APC_r APC_u5 APC_u14 APC_rep APC_u15 APC_u9 SAMP APC_u13 Arm APC_15aa Suppressor_APC
Other DBs
NCBI-GeneID: 107512008
NCBI-ProteinID: XP_016002551
LinkDB
Position
Unknown
AA seq 2872 aa
MYSSLGLGPVAALPASVPLSALGSWSGKGCIIPHERKLPGGARASGCGASVWQEVLKQLQ
GSIEDEAMTSSGQIDLLERLKELNLDSSNFPGVKLRSKMSLRSYGSREGSVSSRSGECSP
VPMGSFPRRGFVNGSRENTGYLEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSL
PLTENFSLQTDMTRRQLEYEARQIRVAMEEQLGTCQDMEKRAQRRVTRIQQIEKDILRIR
QLLQSQATEAERSSQNKHEAGSHEAERQNEGQGVAEINMATSGNGQGSTARMDHETASVL
SSSSTHSAPRRLTSHLGTKVEMVYSLLSMLGTHDKDDMSRTLLAMSSSQDSCISMRQSGC
LPLLIQLLHGNDKDSVLLGNSRGSKEARARASAALHNIIHSQPDDKRGRREIRVLHLLEQ
IRAYCETCWEWQEAHEQGMDQDKNPMPAPVEHQICPAVCVLMKLSFDEEHRHAMNELGRK
ATRGISSQELGQGLSGGLQAIAELLQVDCEMYGLTNDHYSITLRRYAGMALTNLTFGDVA
NKATLCSMKGCMRALVAQLKSESEDLQQVIASVLRNLSWRADVNSKKTLREVGSVKALME
CALEVKKESTLKSVLSALWNLSAHCTENKADICAVDGALAFLVGTLTYRSQTNTLAIIES
GGGILRNVSSLIATNEDHRQILRENNCLQTLLQHLKSHSLTIVSNACGTLWNLSARNPKD
QEALWDMGAVSMLKNLIHSKHKMIAMGSAAALRNLMANRPAKYKDANIMSPGSSLPSLHV
RKQKALEAELDAQHLSETFDNIDNLSPKATHRSKQRHKQSLYGDYVFDTNRHDENRSDNF
NTGNMTVLSPYLNTTVLPSSSSSRGSLDSSRSEKDRSLERERGISLGNYHPATENPGTSS
KRGLQISTTAAQIAKVMEEVSAIHTAQEDRSSGSTTELHCGADERNALRRSSAAHTHSNT
YNFTKSETSNRTCPMPYAKLEYKRSSNDSLNSVSSSDGYGKRGQMKPSIESYSEDDESKF
CSYGQYPADLAHKIHSANHMDDNDGELDTPINYSLKYSDEQLNSGRQSPSQNERWARPKH
IIEDEMKQSEQRQSRSQSTTYPVYTESTDEKHLKFQPHFGQQECVSPYRSRGANGSETNR
VGSTHGINQNVNQSLCQDDDYEDDKPTNYSERYSEEEQHEEEERPTNYSIKYNEEKHHVD
QPIDYSLKYATDISSSQKPAFSFSKSSSGQSTKTEHISSSSENTSTPSSNAKRQNQLHPS
SAQSRSGQTQKATSCKVPSINQETIQTYCVEDTPICFSRCSSLSSLSSAEDEIGCDQTTK
ETDSADTLQRAEIKENSGTRSTEDSVSEVPTVSQHIRTKSSRLQSSGLPSESTRHKTVEL
SSGAKSPSKSGAQTPKSPPEHYVQETPLMFSRCTSVSSLDSFESRSIASSVQSEPCSGMV
SGIISPSDLPDSPGQTMPPSRSKTPPPPPPPQTVQAKQEVPKTKVPNAEKRESGPKPAVA
NAAVQRVQVLPDADTLLHFATESTPDGFSCSSSLSALSLDEPFIQKDVELRIMPPVQEND
NGNETESEQPEESNENQKKEAEKPADSEKDLLDDSDDDDIEILEECIISAMPTKSSRKAK
KPAQNASKLPPPVARKPSQLPVYKLLPSQNRIQAQKHVSFTPGDDMPRVYCVEGTPINFS
TATSLSDLTIESPPNELAAGESVRTGAQSGEFEKRDTIPTEGRSTDEAQRGKTSSITIPE
LDDSKTEEGDILAECINSAMPKGKSHKPFRVKKIMDQVQQASMTSSGTNKNQFDGKKKKP
TSPVKPMPQNTDYRTRVKKNTDSKNNLNAERNFSDNNDSKKQNSKNNSKDFNDKLPNNEE
RVKGSFTFDSPHHYTPIEGTPYCFSRNDSLSSLDFDDDDVDLSREKAELRKGKENKESEA
KVNSHTELTSSQQSANKTQAVTKHPVSRGQSKTVLQKQSTFPQSSKDIPDRGAATDEKLQ
NFAIENTPVCFSRNSSLSSLSDIDQENNNNKESELIKETEPADSQGEPSKPQPSGYAPKS
FHVEDTPVCFSRNSSLSSLSIDSEDDLLQECISSAMPKKKKPSRLKGDNEKHSPRNMGGI
LAEDLTLDLKDIQRPDSEHGLSPDSENFDWKAIQEGANSIVSSLHQAAAAACLSRQASSD
SDSILSLKSGISLGSPFHLTPDQEDKPFTSNKGPRILKPGEKSTLEAKKLESENKGIKGG
KKVYKSLITGKVRSNSEISSQMKQPLQTNMPSFSRGRTMIHIPGVRNSSSSTSPVSKKGP
PLKTPASKSPSECQTATTSPRGAKPSVKSELSPVTRQTSQQAGSNKGPSRSGSRDSTPSR
PAQQPLSRPIQSPGRNSISPGRNGISPPNKLSQLPRTSSPSTASTKSSGSGKMSYTSPGR
QMSQQNLTKQAALSKNGSSIPRSESASKGLNQLNNSNGSNKKVELSRMSSTKSSGSESDR
SERPVLVRQSTFIKEAPSPTLRRKLEESASFESLSPSSRPDSPTRSQAQTPVLSPSLPDM
SLSTHSSVQSGGWRKLPPNLSPTVEYNDGRPAKRHDIARSHSESPSRLPINRSGTWKREH
SKHSSSLPRVSTWRRTGSSSSILSASSESSEKAKSEDEKHVNCVSGTKQTKENQVSTKGT
WRKIKESEISPVNSTSQTTSSGAANGAETKTLIYQMAPAVSKTEDVWVRIEDCPINNPRS
GRSPTGNTPPVIDSVSEKGNPNAKDSKDNQGKQNVGNGSGPVRTMGLENRLNSFIQVDAS
DQKGTETKPGQSNSVPASETNESSIAERTPFGSSGSSKHSSPSGTVAARVTPFNYNPSPR
KSSVDSTSARPSQIPTPVNNNTKKRDSKTDSTESSGTQSPKRHSGSYLVTSV
NT seq 8619 nt   +upstreamnt  +downstreamnt
atgtactcctccttgggcttgggtccggtagccgctttgcccgcttctgtaccactctct
gccctcgggtcctggagcggcaaaggctgcattatcccgcatgagaggaagctcccgggc
ggcgcccgtgcttctggctgcggggcgagcgtctggcaggaagtacttaaacaactgcaa
ggaagtattgaagatgaagctatgacatcttctggacaaattgatttattagagcgtctc
aaagagcttaacttagatagcagtaattttcctggagtgaaactacggtcaaaaatgtcc
ctccgttcttatggaagccgggaaggatctgtatcgagtcgttcaggagagtgcagtcct
gtacctatgggttcatttccaagaagagggtttgtaaatggaagcagagaaaataccggt
tatttagaagaacttgagaaagagaggtcattacttcttgctgaccttgacaaagaagaa
aaagaaaaagactggtattatgctcaacttcagaatcttactaaaagaatagatagtctt
cctttaactgaaaatttttctttacaaacagatatgactagaaggcaattggaatatgaa
gcaagacaaatcagagttgcaatggaagagcaactaggaacgtgccaagatatggaaaaa
cgagcacagcgaagagtaaccagaattcaacaaatagagaaggacatacttcgtatacga
cagcttttacagtcccaagcaacagaagcagagaggtcatctcagaacaagcatgaagct
ggctcacatgaagctgagcggcagaatgaaggtcaaggagtggcagaaatcaacatggca
acttctggtaatggtcagggttccactgcacgtatggatcatgaaacagccagtgttttg
agttctagtagcacacactctgcacctcgaaggctgacaagtcatctgggaaccaaggtg
gaaatggtgtattcattgttgtcaatgcttggtactcatgataaggatgatatgtcgcga
actttgctagctatgtctagctcccaagacagctgtatatccatgcgacagtctggatgt
cttcctctcctcatccagcttttacatggcaatgacaaagactctgtgttgttgggaaat
tcccggggcagtaaagaggctcgggccagggccagtgcagcactccacaacatcattcac
tcacagcctgatgacaagagaggcaggcgtgaaatccgagtccttcatcttttggaacag
atacgagcttactgtgaaacctgttgggagtggcaggaagcccacgaacaaggcatggac
caggacaaaaacccaatgccagctcctgtcgaacatcagatctgtcctgctgtgtgtgtt
ctaatgaaactttcatttgatgaagagcatagacatgcaatgaatgaacttggtaggaag
gctacccggggcatttcatcacaggagctagggcaggggctttcagggggactgcaggcc
attgcagaattattgcaagtggactgtgaaatgtatgggcttactaatgaccactacagt
attacattaagacgatatgctggaatggctttgacaaacttgacttttggagatgtagcc
aacaaggctacgctctgctctatgaaaggctgcatgagagcactggtggcccaactaaaa
tctgaaagtgaggacttacagcaggtaattgcaagtgttttgaggaatttgtcttggcga
gcagatgtaaatagtaaaaagacattgcgtgaagttggaagtgtgaaagcattgatggaa
tgtgctttggaagttaaaaaggaatcaaccctcaaaagcgtattgagtgccttatggaat
ttgtcagcacattgcactgagaataaagctgatatatgtgctgtagatggtgcgcttgca
tttttggttggcactctcacttaccggagccagacaaatactttagccattattgaaagt
ggaggtgggatattacggaatgtgtccagcttgatagctacaaatgaagaccacaggcaa
atcctaagagagaacaactgcctgcaaaccttattacaacacttaaaatctcacagtttg
acaatagtcagtaatgcatgtggaaccttgtggaatctctcagcaagaaaccctaaagac
caggaagcattatgggacatgggggcagtcagcatgctcaagaacctcattcattcaaag
cacaaaatgattgctatgggaagcgctgcagctttaaggaatctcatggcaaatagacct
gcaaagtataaggatgccaatattatgtctcctggttcaagcttgccatctcttcatgta
agaaaacaaaaagccctagaagcagaattagatgctcagcatttatcagaaacttttgat
aatattgacaatttaagtcccaaggcaactcatcgtagtaagcaaagacacaagcaaagt
ctctatggtgactatgtttttgacaccaatcgacatgatgaaaacaggtcagacaatttt
aatactggaaacatgactgtcctttcaccatatttaaatactacagtgttgcccagctcc
tcttcatcgaggggaagtttagatagttctcgttctgaaaaagatagaagtttggagaga
gaacgaggaattagcctaggcaattatcacccagcgacagaaaatccaggaacctcttca
aagcgaggtttgcaaatttccaccactgcagcccaaattgctaaagtcatggaagaagta
tcagctattcacaccgcccaggaagacagaagttctgggtctaccaccgaattacactgt
ggggcagatgagaggaatgcactaagaagaagctctgctgcccacacacattcaaatact
tacaacttcactaaatcagaaacttcaaataggacatgtcctatgccttatgccaaattg
gaatataagagatcttcaaatgatagtttaaatagcgttagtagtagtgatggttatggt
aaaagaggtcaaatgaaaccttcaattgaatcctattctgaagacgatgaaagtaaattt
tgcagctatggtcaatatccagctgacctagcccataaaatacatagtgcaaatcatatg
gatgataatgatggagaactagacacaccaataaattatagtcttaaatattcagatgag
cagttgaactctggaaggcaaagtccttcacagaatgaaagatgggcgagacccaaacat
attatagaagatgaaatgaaacaaagtgagcaaagacaatcaaggagtcaaagcacaact
tatcctgtatatactgagagcactgatgagaaacacctcaagttccagccacattttggg
cagcaagaatgtgtttccccatataggtcaagaggagccaatggttcagaaacaaatcga
gtaggttctactcatggaattaatcaaaatgtaaaccagtctttgtgtcaggatgatgac
tatgaagatgataagccaaccaactatagtgaacgttactctgaggaagagcagcatgaa
gaagaagagagaccaacaaactatagcataaaatataatgaagaaaaacatcatgtggat
cagcctattgattacagtttaaaatatgccacagacatttcttcctcacaaaaaccagct
ttttcattctcaaagagttcatctggacaaagcactaaaactgaacacatctcttcaagc
agtgagaatacatccacaccttcatctaatgccaagaggcagaatcagctccatccaagt
tcagcgcagagcagaagtggtcagacacaaaaagccacctcttgcaaagttccctctatc
aaccaagaaacgatacagacgtattgtgtagaagataccccaatatgtttttcaaggtgt
agttcattatcatctttgtcatcagctgaagatgaaataggatgtgatcagacaacaaag
gaaacagattctgctgatacactacaaagagcagaaataaaagaaaacagtggaactaga
tcaactgaagattctgtaagtgaagttccaacagtgtcacagcacattagaaccaaatcc
agcagactccagtcttctggtttaccatcagaatcgaccagacacaaaactgttgagctt
tcttcaggagccaaatctccatcaaaaagtggtgctcagacacctaaaagtccaccagag
cactatgtccaagagactccactcatgtttagcagatgtacttctgtcagttcacttgat
agttttgagagccgttcaattgccagctctgttcagagtgaaccctgcagcggaatggta
agtggcattataagccccagtgatcttccagatagccctggacaaaccatgccaccaagc
agaagtaaaactcctcctcctcctcctcctcctcaaacagttcaagctaaacaagaggta
cctaaaactaaagttcctaatgctgaaaagagagaaagtggacctaagccagctgtggca
aatgctgcggttcagagggtccaggtgcttcctgacgctgatactttattacattttgcc
acagaaagtactccagatggtttttcttgttcatctagcctgagtgctctgagccttgat
gagccatttatacagaaggatgtagaattaagaataatgcctccagttcaggaaaatgac
aatgggaatgaaacagaatctgagcagccagaagaatcgaatgaaaaccagaaaaaggag
gctgaaaaacctgctgattctgaaaaagacctattagatgattcagatgatgacgatata
gaaatactggaagaatgtattatttctgccatgccaacaaaatcttcacgcaaagccaaa
aaaccagcccagaatgcttcaaagttacctccacctgtcgcaaggaaaccaagtcaacta
cctgtatacaaacttctgccatcacaaaacaggatacaagcacaaaagcatgttagtttt
acaccaggagatgatatgccacgcgtgtattgtgtagaagggacacctataaacttttcc
acagctacatctctaagtgatttaacaatagaatcccctccaaatgagttagctgctgga
gaaagcgttagaacaggggcccagtcaggtgaatttgaaaaacgagatactattcctaca
gaaggccgaagcacggatgaggctcaaagaggaaaaacgtcatctataactatacctgaa
ctggatgacagtaaaacagaagaaggtgatattcttgcagaatgcattaattctgccatg
cccaaaggaaaaagtcacaagcctttccgtgtgaaaaagataatggaccaggtccaacaa
gcatctatgacgtcatctggaactaacaagaatcaattcgatggtaagaaaaagaaacct
acttcaccagtaaaacctatgccacaaaatactgactataggacacgtgtaaaaaaaaat
acagattcaaaaaataatttaaatgctgaaagaaatttttcagacaacaacgattcaaaa
aaacagaattcaaaaaataattccaaggatttcaatgataagttaccaaataatgaagaa
cgagtcaaaggaagttttacttttgattcacctcatcattatacacctattgaaggaact
ccatactgtttttcacgaaatgactctttgagttctctagattttgatgatgatgatgtt
gacctttccagggaaaaggctgaattaagaaagggtaaagaaaataaggaatcagaagct
aaagttaacagtcacacagaactaacctccagccaacaatcagctaataaaacacaagct
gtaacaaaacatccagtaagtcgaggacagtctaaaaccgtgctgcagaagcagtccact
tttccccagtcatccaaagacataccagacagaggggcagcaactgatgaaaaattacag
aattttgctattgaaaatactccagtttgcttttctcgaaattcctctttaagttctctt
agcgacattgatcaggaaaacaacaacaacaaagaaagtgaacttatcaaagagacagag
cctgctgattcacagggagaaccaagtaaacctcagccatcaggttatgctccgaaatca
tttcacgttgaagatacccctgtttgtttctcaagaaacagttctctcagttctctcagt
atagattctgaagatgatctgttgcaagagtgtataagttctgcaatgccaaaaaagaaa
aagccttcaagactcaagggtgataatgaaaagcatagtcccagaaatatgggtggcata
ttagcagaagatttgacgcttgatttgaaagatatacagagaccagattctgaacatggt
ttatcccctgattcagaaaattttgattggaaagccattcaggaaggtgcaaattccata
gtaagtagtttacatcaagctgctgctgctgcatgtttatctagacaagcttcatcagat
tcagattccatcctttcactaaagtcaggaatctctctgggatcaccatttcatcttaca
cctgatcaagaggataaaccctttacaagtaataaaggcccacgaattctgaaacctggg
gagaaaagtacgttggaagctaaaaagttagaatctgaaaataagggaatcaaaggaggg
aaaaaggtttataagagtttgattactggaaaagttcggtcaaattcagaaatttcaagc
caaatgaaacagccccttcaaacaaacatgccttcattctctcgaggaaggacaatgatt
cacattccaggagttcgaaatagctcttcaagtacaagtccagtttctaaaaaaggccca
cccctaaagactccagcctccaaaagccccagtgaatgtcagacagccaccacttctcct
agaggagccaagccatcggtaaagtcagaattaagccctgttaccaggcagacatcccaa
caggctgggtcaaataaagggccatctaggtcaggatctagagattccactccttcaaga
cctgcccagcaaccgttaagtagacctatacagtctccagggcgaaactcaatttctcct
ggtagaaatggaataagccctcctaacaaattatctcaactgccaaggacatcatcccct
agtactgcttcaaccaagtcctcgggttctgggaaaatgtcctacacatctccaggcaga
cagatgagtcagcagaaccttaccaaacaagcggctttatccaagaatggcagcagtatc
ccaagaagtgagtctgcctccaaaggactaaatcagctgaataatagcaatgggtccaac
aaaaaggtagaactttctagaatgtcttcaactaaatcaagtggaagtgaatctgatagg
tcagagagacccgtattagtacgccagtcaactttcatcaaagaagctccaagcccaacc
ctgaggagaaaattggaggaatccgcatccttcgaatctctctctccatcttctagacca
gattctcccactaggtcccaggcacagactccagttttaagtccttcccttcctgatatg
tctctgtcaacacattcatctgttcagtctggtggatggcgaaaactcccacctaatctc
agtcccactgtagagtataatgatggaagaccagcaaagcgccatgacatagcacgctcc
cattctgaaagtccttccagacttccaatcaataggtcaggaacctggaaacgtgagcac
agcaaacattcgtcatcccttcctcgagtaagcacgtggagaagaactggaagttcatcc
tcaattctttctgcttcatcagaatccagtgaaaaagcaaaaagcgaggatgaaaaacat
gtgaactgtgtttcaggaaccaaacaaactaaagaaaaccaagtatccacaaagggaaca
tggagaaaaataaaggaaagtgaaatttctcccgtaaatagtacttctcagaccacttcc
tcaggtgctgcaaatggtgctgaaacaaagactctaatttatcaaatggcacctgctgtt
tcaaaaacagaggatgtttgggtgagaattgaggactgtcccattaacaaccctagatcc
ggaagatctcccacaggaaatactcccccagtgattgacagtgtttcggaaaagggaaac
ccaaatgctaaagattcaaaagataaccagggaaaacaaaatgtgggtaatggcagtggt
cctgtacgcaccatgggtttggaaaaccgcctgaactcctttattcaggtagatgcctca
gaccaaaaaggaactgaaacaaaaccgggacaaagtaattctgttcctgcatcagagact
aatgaaagttctatagctgaacgtaccccatttggttctagcggttcaagcaagcacagt
tcacctagtgggactgttgctgccagagtgactccttttaattacaacccaagccctagg
aaaagcagcgtagatagcacttcggctcggccatctcagatcccaactccagtgaataac
aacacaaagaaacgagattcaaaaactgacagcacagaatccagtggaactcaaagccct
aagcgccattctgggtcttaccttgtgacgtctgtttaa

DBGET integrated database retrieval system