Delphinapterus leucas (beluga whale): 111170519
Help
Entry
111170519 CDS
T05885
Gene name
APC2
Definition
(RefSeq) adenomatous polyposis coli protein 2
KO
K02085
adenomatosis polyposis coli protein
Organism
dle
Delphinapterus leucas (beluga whale)
Pathway
dle04310
Wnt signaling pathway
dle04390
Hippo signaling pathway
dle04550
Signaling pathways regulating pluripotency of stem cells
dle04810
Regulation of actin cytoskeleton
dle04934
Cushing syndrome
dle05010
Alzheimer disease
dle05022
Pathways of neurodegeneration - multiple diseases
dle05165
Human papillomavirus infection
dle05200
Pathways in cancer
dle05206
MicroRNAs in cancer
dle05210
Colorectal cancer
dle05213
Endometrial cancer
dle05217
Basal cell carcinoma
dle05224
Breast cancer
dle05225
Hepatocellular carcinoma
dle05226
Gastric cancer
Brite
KEGG Orthology (KO) [BR:
dle00001
]
09130 Environmental Information Processing
09132 Signal transduction
04310 Wnt signaling pathway
111170519 (APC2)
04390 Hippo signaling pathway
111170519 (APC2)
09140 Cellular Processes
09144 Cellular community - eukaryotes
04550 Signaling pathways regulating pluripotency of stem cells
111170519 (APC2)
09142 Cell motility
04810 Regulation of actin cytoskeleton
111170519 (APC2)
09160 Human Diseases
09161 Cancer: overview
05200 Pathways in cancer
111170519 (APC2)
05206 MicroRNAs in cancer
111170519 (APC2)
09162 Cancer: specific types
05210 Colorectal cancer
111170519 (APC2)
05225 Hepatocellular carcinoma
111170519 (APC2)
05226 Gastric cancer
111170519 (APC2)
05217 Basal cell carcinoma
111170519 (APC2)
05213 Endometrial cancer
111170519 (APC2)
05224 Breast cancer
111170519 (APC2)
09164 Neurodegenerative disease
05010 Alzheimer disease
111170519 (APC2)
05022 Pathways of neurodegeneration - multiple diseases
111170519 (APC2)
09167 Endocrine and metabolic disease
04934 Cushing syndrome
111170519 (APC2)
09172 Infectious disease: viral
05165 Human papillomavirus infection
111170519 (APC2)
09180 Brite Hierarchies
09181 Protein families: metabolism
01009 Protein phosphatases and associated proteins [BR:
dle01009
]
111170519 (APC2)
09182 Protein families: genetic information processing
03036 Chromosome and associated proteins [BR:
dle03036
]
111170519 (APC2)
09183 Protein families: signaling and cellular processes
04812 Cytoskeleton proteins [BR:
dle04812
]
111170519 (APC2)
Protein phosphatases and associated proteins [BR:
dle01009
]
Protein serine/threonine phosphatases
Phosphoprotein phosphatases (PPPs)
Protein phosphatase-1
PP1-interacting proteins (PIPs)
111170519 (APC2)
Chromosome and associated proteins [BR:
dle03036
]
Eukaryotic type
Centrosome formation and ciliogenesis proteins
Other centriole associated proteins
111170519 (APC2)
Cytoskeleton proteins [BR:
dle04812
]
Eukaryotic cytoskeleton proteins
Microtubules
Tubulin-binding proteins
EB / APC
111170519 (APC2)
BRITE hierarchy
SSDB
Ortholog
Paralog
GFIT
Motif
Pfam:
APC_basic
APC_rep
APC_N_CC
APC_r
Suppressor_APC
Arm_APC_u3
Arm
SAMP
bZIP_1
Sec2p
HALZ
Wtap
APG6_N
Motif
Other DBs
NCBI-GeneID:
111170519
NCBI-ProteinID:
XP_030616532
LinkDB
All DBs
Position
Unknown
AA seq
2270 aa
AA seq
DB search
MGLLGLLSLLHSAFFGDQALQELKMTSSVAPYEQLVRQVEALKAENSHLRQELRDNSSHL
SKLETETSGMKEVLKHLQGKLEQEARVLVSSGQTEVLEQLKALQMDITSLYNLKFQPPAL
VPEPTARTPEGSPVHSSGPSKDSFGELSRATIQLLEELDRERCFLLNEIEKEEKEKLWYY
SQLQGLSKRLDELPHVETQFSMQMDLIRQQLEFEAQHIRSLMEERFGTSDEMVQRAQIRA
SRLEQIDKELLSAQDRVQQTEPQALLAVKSMPMDEDPEAEVPTHPEDGAPQPGNSKVEVV
FWLLSMLATRDQEDTARTLLAMSSSPESCVAMRRSGCLPLLLQILHGTEAGAGGRNGTPG
APGAKDARMRANAALHNIVFSQPDQGLARKEMRVLHVLEQIRAYCETCWDWLQARDGGPE
GSSAGGAPVPIEPQICQATCAVMKLSFDEEYRRAMNELGGLQAVAELLQVDYEMHKMTRD
PLNLALRRYAGMTLTNLTFGDVANKAALCARRGCMEAIVAQLASESEELHQVVSSILRNL
SWRADISSKKVLREVGSMTALMQCVLRASKVGTGRGASHVAGRHQQQVVSSPPWGKSSQM
DSPPPGSGSRPGASNPKDLAVAGCVTFFPPHPSGPRRQVLRDHNCLQTLLQHLTSHSLTI
VSNACGTLWNLSARSAGDQELLWDLGAVGMLRNLVHSKHKMIAMGSAAALRNLLAHRPAK
YQAAATAVSPSACAPSLYVRKQRALEAALDARHLAQALDHLEKQGLPEAEAASKKPLPPL
RHLDGLAQDYASDSGCFDDDDAPSLATAAAATAEPASPAVLPLFLGSPFLQGQALARAPP
ARRGGLESEKEAGGEAAVAARAKAKLALAVARIDRLVEDISALHTSSDDSFSLSSGDPGQ
EAPREGRAHSCSPCRGPEAGRREAGSRAHPLLRLKAAHASLSNDSLNSGSTSDGHCPREH
SQPCSLAALAEHCEGPLCGQARPSRLDLNLPSGQVEPKARDTAATDARVCTIKLSPTYQH
VPLLEGTARAGAGSLAPRARKQAWLPAEDLSKVPEKLAVEKAPLSLSRCSSLSSLSSAGR
PGPSEAGDLDDSDSSLEGLEEAGPSEAGLDGAWQGPGAASLPMAIPVPQRGRGLGVEDAT
PSSSSENCVQETPLVLSRCSSVSSLGSFESPSIASSVASDPCSGLGSGTVSPSELPDSPG
QTMPPSRSKTPPPAPVPPGEREVTQFSLQWESYVKRFLDIADCRERCRLPSELDAGSVRF
TVEKPDENFSCASSLSALALHELYVQKDVELRLLPPACPERSSAGGVGPGHRRRDEASGR
LEGPTSTDRDLELLRECLGGAVPARLRKVASALVPGHRTLPVPVYMLVPAPAREDESCTD
SAEGTPVTFSSATSLSEETLQGPPGDGGPAQGQKAAGRAAPTRQPAGHRHRAGGTGRSTE
QPRGAGRSRAGLELPLCRPPSACRDRDSSRPGQARGDGALQSLCLTTPTEEAVYCFYGND
SDEEPSAAVAVAPPRRTSAIPRAVKREYPAGGRKEVQAVPKVAPPKAAAPKVAPPARAQP
SLIADETPPCYSLSSSASSLSEPEPFEHAASRPRAREPGVTKDPGPGGRRDSAPRPRAEA
ELLRRCTGSAVPRRRPQVSGPGCRQSRAVQQEKRPAEGPREHSEEAAGSDHASDLDSVEW
RAIQEGANSIVTWLHQAAAAATHEASSESDSVLSFASGRSVGSTLQLPPHRKGRRPRAEG
QAGSAMRPEKRDRALAQRSSGLEKPRGTQKAASGVPAVLRGRTVIYMPSPTTRAQPKGAP
GPRNVPRKTGVPNPVQPAAPAKIPGPGQQRSRSLHRPGKISELAALSPPQRSATPPARLA
KTPSSSSSQTSPASQPLPRRSPPATQAAGTLPGPGASPATKTPARALLAKQHKTQKSPVR
IPFMQRPTRRGPPPLAKAAPEPGPRARGGRPGLVRVASARSSGSVASARSSGSEASDRSG
FRRQLTFIKESPGLLRRRRTELSAAEAATPAAQAGLPRRGRPALPAVFLCSSRCDELRAA
PRQAPAPQRPPTARPGLGERPPRRTSSESPSRLPVRTPAARPDTVKRYASLPHISVARGP
DAPVPVADNAPRSSTGEAAPGTTWRRIRDEDVPHILRSTLPPRALPLLGSSPEDGPTGPP
QRKTSDAVVQTEDFAATKTNSSTSPSLESWVTPQATTGGAPSLLLGSDVDGPGPAKAPAP
GPFVPASRHGSPSRSARVPPFNYVPSPMVVATTDSAVEKAPAPAPTGLLG
NT seq
6813 nt
NT seq
+upstream
nt +downstream
nt
atggggctcctggggctgctgagcctgctgcactcggccttcttcggggaccaggcgctg
caggagctgaagatgacgagctccgtggcgccctacgagcagctggtgcggcaggtggag
gccttgaaggccgagaacagtcacctgaggcaggagcttcgggataactcaagccacttg
tccaagctggagacagaaacgtcgggcatgaaggaggtcctgaagcacttgcagggcaag
ctggagcaggaggcccgtgtgctggtgtcctcggggcagaccgaggtgctggagcagctg
aaagccctgcagatggacatcaccagcctgtacaacctcaagttccagcccccggctctg
gtccctgagcccactgcccggacccccgagggaagcccggtgcacagctctgggccctcc
aaggacagctttggggagctgagccgggccaccatccagctgctggaggaactggaccgg
gaacggtgtttcctattgaatgagatcgagaaggaggagaaggagaagctctggtattac
tcgcagctgcagggcctatccaagcgcctggacgaactcccgcacgtggagacgcagttc
tcgatgcagatggatctgatccggcagcagctggagttcgaagcccagcacatccgctcg
ctgatggaggagcgcttcggcacctcggacgagatggtgcagcgggcgcagatccgtgct
tcccgcctggagcagatagacaaggaattgctgtcggcacaggaccgggtgcaacagacc
gagccccaggccttgctggcagtgaagtcgatgccgatggatgaggacccagaggccgag
gtccccacgcaccctgaggatggtgcccctcagccgggcaacagcaaggtggaggtggtc
ttctggctgctgtccatgctggcgacgcgtgaccaggaggacacggcacggacgctgctc
gccatgtccagctcacctgagagctgcgtggccatgcgccgctcgggctgcctgccgctg
ctgctgcagatcctgcacggcaccgaggcgggggctgggggtcgcaacgggaccccaggg
gcgccgggagccaaggatgcgcgcatgcgcgccaacgcagcgctgcacaacatcgtcttc
tcccagccggaccagggtctggcacgcaaggaaatgcgcgtcctgcacgtgctcgagcag
atccgtgcctactgtgagacctgctgggactggctgcaggcccgggacggtgggcccgag
ggcagcagcgccggcggcgccccggtccccatcgagccacagatctgccaggccacctgc
gccgtgatgaagctgtccttcgacgaggaataccgccgtgccatgaacgagctgggtggg
ctgcaggccgtggcggagttactgcaggtcgactatgagatgcacaagatgacccgggac
cctctcaacctggccctgcgccgatacgccggcatgaccctcaccaacctaaccttcggg
gacgtcgccaacaaggctgcactgtgcgcccgccggggctgcatggaggccatcgtggcc
cagctggcgtccgagagcgaggagctgcaccaggtggtgtccagcatcctgcgcaacctg
tcctggagggcggacatcagcagcaagaaggtgctgagggaggtgggcagcatgacggcc
ctgatgcagtgcgtcctgcgagcctccaaggtgggcactgggcggggtgccagccacgtg
gctgggaggcaccagcagcaggtggtgagttcgccaccttgggggaagtcaagccagatg
gacagcccgccgcctggcagcgggtccaggccgggggcttccaacccaaaagatcttgct
gtggccggttgtgttactttttttcctcctcacccctccggcccccgcaggcaggtgctg
cgggaccacaactgcctgcagacgctgctgcagcacctgacgtcgcacagcctgaccatc
gtgagcaacgcatgtggcacgctctggaacctgtcggcccgcagcgccggtgaccaggag
ctgctgtgggacctgggtgctgtgggcatgctgcgcaacctggtgcactccaagcacaag
atgatcgccatgggcagcgccgccgccctgcgcaacctgctggcccaccggcccgccaag
taccaggcggcggccaccgccgtctcccccagcgcctgtgcgcccagcctgtacgtgcgc
aagcagcgggcgctggaggctgcgctggacgcgcggcacctggctcaggcactcgaccac
ctggagaagcagggcctgcccgaggccgaggccgcctccaagaagccgctgccgcccctg
cgccacctggacggcctagcccaggactacgcttccgactcgggctgcttcgatgatgac
gacgcaccctccctggccacagccgctgccgccactgccgagcccgccagccccgccgtg
ctgcccctcttcctgggcagccccttcctgcagggccaggcgctggcccgtgccccgccc
gcccgccggggcggcctggagtccgagaaggaggccggcggggaggcagctgtggcagcc
agggccaaggccaagctggcactggcagtggcgcgcatcgaccggctggtggaggacatc
tcggctctgcacacctcgtctgacgacagcttcagcctcagctctggggatcccgggcag
gaggccccacgggagggccgcgcgcactcctgctccccttgccgggggcccgaggcgggg
cggcgagaggccggcagccgggctcacccgctgttgcggctcaaggcggcccatgccagc
ctctccaacgacagtcttaacagcggcagcaccagcgacgggcactgtccccgcgagcac
tcgcagccctgctcgctggccgcgctggccgagcattgcgagggacccctgtgtggccag
gcgcggcccagccggcttgacctcaacctgcccagcggccaggttgagcccaaggcccgg
gacaccgcggccacagatgcccgcgtgtgcaccatcaagctgtcacccacctaccagcat
gtgccactgcttgagggcaccgccagagcgggtgcggggtccctggcccccagggcccgg
aaacaggcctggctgcccgcagaggacctgagcaaggtgcccgagaagctggcggtggag
aaggcgcccctctccctatcccgctgcagctccctgtcctcactgtcctcggccggccgc
ccagggcccagtgaggccggggacctggacgacagcgactcgtccctggaggggctggag
gaggctggccccagcgaggccgggctggacggggcctggcaggggccgggcgccgcctcc
ctgcccatggccatcccggtgcctcagcggggccggggcctgggggtggaggacgccacg
ccgtccagctcatctgagaactgcgtacaggagacgccgctggtgctgagccgctgcagc
tcagtcagctcgctgggcagcttcgagagcccatccatcgccagctccgtcgccagcgat
ccatgcagcgggctgggcagcggcacagtcagccccagcgagctgcccgacagccccggg
cagaccatgccaccaagccgcagcaagacgcccccgccggcccccgtgccgccgggcgag
cgtgaggtcacccagtttagcctgcagtgggagagctacgtgaagcgcttcctggacatc
gccgactgccgggagcgctgccggctgccgtctgagctggacgcgggcagcgtgcgcttc
accgtggagaagcccgacgagaacttctcgtgtgcttccagcctgagcgcgctggccctg
catgagctctatgtgcagaaggacgtggagctgcggctgctgcccccggcctgccctgag
cgcagcagtgcgggaggcgtgggccccgggcaccgccggcgggacgaggccagcggccgc
ctcgaagggccaacatccaccgaccgggacctggagctgctgcgcgagtgcctgggtggg
gccgtgcccgcccggctccgcaaggtggcctcggcgctggtgcctggccaccgcaccctg
cccgtgcccgtctacatgctggtgcctgccccggcacgggaggatgagtcctgcaccgac
tcggccgagggcacgccggtcaccttctccagcgccacctccctcagcgaggagacactg
cagggaccccccggggatggcgggcctgcgcaggggcagaaggctgcgggccgtgccgcc
cccaccaggcagcccgctgggcaccggcacagggcggggggcacgggccggagcacagag
cagccccggggggctggcaggagccgcgcagggctggagctgcccctctgccggccccct
agcgcctgcagggacagggacagctcccgcccgggccaggcacgtggggacggggccctg
cagtctctgtgcctcacgacgcccaccgaggaggccgtgtactgcttctacggcaacgac
tcagacgaagagccgtccgcggcagtggcggtggcacccccgcggcggacatctgcgatc
ccccgcgcggtgaagagggagtacccggctggtggcaggaaggaggtgcaggccgtgccc
aaggtcgcgccgcccaaggctgctgcgcccaaggtcgcgccgcccgcccgggctcagccc
agcctcatcgctgatgagacgccaccatgctactccttgagctcctccgccagctccctg
agcgagcctgagccctttgagcacgcggccagccggccccgagcccgtgagccaggggtc
accaaggacccaggccccgggggcaggcgggacagcgccccccgcccgcgggccgaggca
gagctgctccggcgctgcactggctcagccgtgcccaggcgccggccccaggtgtctggc
ccagggtgccgccagtccagagcggtgcagcaggagaagaggccagcagaggggccccgg
gagcacagtgaggaggcagcgggctcggaccatgcctcagacctggacagcgtcgagtgg
cgcgccatccaggagggggccaactccatcgtcacgtggctacaccaggcagcggcggca
gccacccacgaggcctcctctgagtccgactccgttctgtcctttgcctcagggcggtcg
gtgggctccaccctgcagcttcccccgcacaggaagggtcgaaggccaagggcagagggt
caggcgggcagtgccatgcggccagagaaacgggacagggctctggcccagcgcagcagc
ggcctggagaagccacgtggcactcagaaggccgcgtctggggtgccagccgtgctccgg
ggacggacggtgatctacatgcccagcccaaccacccgggcccagcccaaaggtgcccct
gggccccgcaatgtgccgagaaagacgggagtcccaaatccagtgcagccagcagccccc
gccaaaatccctggccccgggcagcagcggtctcgaagcctgcaccgacccggcaagatc
tcggagctggcggcgctgagcccccctcagaggagtgccacgccacctgcccgcctcgcc
aagaccccctcgtcgagctcctcccagacctccccggcctcacagcctctgccgaggcgg
tcacccccggccacccaggccgcaggaaccctgcccggccccggggcctcccccgcaacc
aagactcccgcccgggccctgctggccaagcagcacaagacacagaagtcgcccgtgcgg
atccccttcatgcagagacccaccaggcgggggccgccacccctggccaaggcagccccg
gaaccaggtccaagggcccgagggggccgcccagggctcgtgcgtgtggcctccgcccgc
tccagcggcagcgtggcctccgcccgctccagcggcagcgaggcctccgaccgctccggt
ttccggaggcagctgaccttcatcaaggagtcgccgggcctgctgcgccgcagacgcacc
gaactgtccgccgccgaggccgccaccccggctgcccaggcaggcctgccccgccgcggc
cggcccgcgctacccgccgtcttcctatgctcctcgcgctgtgatgagctgcgggcggcc
ccccggcaggctcccgccccccagcggccacccacggcccggcccggcctgggcgagcgg
ccgccccggcgcaccagctctgagagcccgtctcgcctgcccgtccgcacgccagccgcc
cggcccgatacggtcaagcgctacgcctccctgcctcacatcagtgtggcccgcgggccc
gacgcccccgtgcctgtggcagacaacgcgccccgcagcagcaccggggaggccgcgccg
ggcaccacgtggcgtcgcatccgggacgaggacgttccgcacatcctgcggagcacgctg
cccccccgcgccctgcccctgctgggctcctcaccggaggacggccccacaggccctccg
cagcgcaagaccagcgacgccgtggtccagaccgaggacttcgcggctaccaagaccaac
tcgagcacgtccccgagcctggagagctgggtgaccccacaggccacgaccggcggcgcc
ccctccctcctcctcggcagcgacgtggacgggccgggtcccgccaaggcgcccgccccc
ggccccttcgtccccgccagccgacacggttcccccagccgctccgcccgcgtccccccc
ttcaactacgtgcccagccccatggtggtagccaccactgactctgccgtggagaaagcc
cccgcccccgcccctaccggcctcctgggatag
Delphinapterus leucas (beluga whale): 111183639
Help
Entry
111183639 CDS
T05885
Gene name
APC
Definition
(RefSeq) adenomatous polyposis coli protein isoform X1
KO
K02085
adenomatosis polyposis coli protein
Organism
dle
Delphinapterus leucas (beluga whale)
Pathway
dle04310
Wnt signaling pathway
dle04390
Hippo signaling pathway
dle04550
Signaling pathways regulating pluripotency of stem cells
dle04810
Regulation of actin cytoskeleton
dle04934
Cushing syndrome
dle05010
Alzheimer disease
dle05022
Pathways of neurodegeneration - multiple diseases
dle05165
Human papillomavirus infection
dle05200
Pathways in cancer
dle05206
MicroRNAs in cancer
dle05210
Colorectal cancer
dle05213
Endometrial cancer
dle05217
Basal cell carcinoma
dle05224
Breast cancer
dle05225
Hepatocellular carcinoma
dle05226
Gastric cancer
Brite
KEGG Orthology (KO) [BR:
dle00001
]
09130 Environmental Information Processing
09132 Signal transduction
04310 Wnt signaling pathway
111183639 (APC)
04390 Hippo signaling pathway
111183639 (APC)
09140 Cellular Processes
09144 Cellular community - eukaryotes
04550 Signaling pathways regulating pluripotency of stem cells
111183639 (APC)
09142 Cell motility
04810 Regulation of actin cytoskeleton
111183639 (APC)
09160 Human Diseases
09161 Cancer: overview
05200 Pathways in cancer
111183639 (APC)
05206 MicroRNAs in cancer
111183639 (APC)
09162 Cancer: specific types
05210 Colorectal cancer
111183639 (APC)
05225 Hepatocellular carcinoma
111183639 (APC)
05226 Gastric cancer
111183639 (APC)
05217 Basal cell carcinoma
111183639 (APC)
05213 Endometrial cancer
111183639 (APC)
05224 Breast cancer
111183639 (APC)
09164 Neurodegenerative disease
05010 Alzheimer disease
111183639 (APC)
05022 Pathways of neurodegeneration - multiple diseases
111183639 (APC)
09167 Endocrine and metabolic disease
04934 Cushing syndrome
111183639 (APC)
09172 Infectious disease: viral
05165 Human papillomavirus infection
111183639 (APC)
09180 Brite Hierarchies
09181 Protein families: metabolism
01009 Protein phosphatases and associated proteins [BR:
dle01009
]
111183639 (APC)
09182 Protein families: genetic information processing
03036 Chromosome and associated proteins [BR:
dle03036
]
111183639 (APC)
09183 Protein families: signaling and cellular processes
04812 Cytoskeleton proteins [BR:
dle04812
]
111183639 (APC)
Protein phosphatases and associated proteins [BR:
dle01009
]
Protein serine/threonine phosphatases
Phosphoprotein phosphatases (PPPs)
Protein phosphatase-1
PP1-interacting proteins (PIPs)
111183639 (APC)
Chromosome and associated proteins [BR:
dle03036
]
Eukaryotic type
Centrosome formation and ciliogenesis proteins
Other centriole associated proteins
111183639 (APC)
Cytoskeleton proteins [BR:
dle04812
]
Eukaryotic cytoskeleton proteins
Microtubules
Tubulin-binding proteins
EB / APC
111183639 (APC)
BRITE hierarchy
SSDB
Ortholog
Paralog
GFIT
Motif
Pfam:
Arm_APC_u3
APC_basic
EB1_binding
APC_r
APC_u5
APC_u14
APC_u15
APC_rep
APC_u9
SAMP
APC_N_CC
Arm
APC_u13
APC_15aa
Suppressor_APC
JIP_LZII
Motif
Other DBs
NCBI-GeneID:
111183639
NCBI-ProteinID:
XP_022446871
UniProt:
A0A2Y9PNP0
LinkDB
All DBs
Position
Unknown
AA seq
2864 aa
AA seq
DB search
MAAASYDQLLKQVEALKMENSNLRQELEDNSNHLTKLETEASNMKEVLKQLQGSIEDEAM
ASSGQIDLLERLKELNLDSSNFPGVKLRSKMSLRSYGSREGSVSSRSGECSPVPMGSFPR
RGFVNGSRENTGYLEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSLPLTENFSL
QTDMTRRQLEYEARQIRVAMEEQLGTCQDMEKRAQRRITRIQQIEKDILRIRQLLQSQAT
EAERSSQSKHEAGSHEAERQNESQGVAEINMATSGSGQGSTARIDHETASVLSSSSTHSA
PRRLTSHLGTKVEMVYSLLSMLGTHDKDDMSRTLLAMSSSQDSCISMRQSGCLPLLIQLL
HGNDKDSVLLGNSRGSKEARARASAALHNIIHSQPDDKRGRREIRVLHLLEQIRAYCETC
WEWQEAHEQGMDQDKNPMPAPVEHQICPAVCVLMKLSFDEEHRHAMNELGRKATRGISSQ
ELGQGLSGGLQAIAELLQVDCEMYGLTNDHYSITLRRYAGMALTNLTFGDVANKATLCSM
KGCMRALVAQLKSESEDLQQVIASVLRNLSWRADVNSKKTLREVGSVKALMECALEVKKE
STLKSVLSALWNLSAHCTENKADICAVDGALAFLVGTLTYRSQTNTLAIIESGGGILRNV
SSLIATNEDHRQILRENNCLQTLLQHLKSHSLTIVSNACGTLWNLSARNPKDQEALWDMG
AVSMLKNLIHSKHKMIAMGSAAALRNLMANRPAKYKDANIMSPGSSLPSLHVRKQKALEA
ELDAQHLSETFDNIDNLSPKASHRSKQRHKQNLYGDYVFDTNRHDDNRSDNFNTGNMTVL
SPYLNTTVLPSSSSSRGSLDSSRSEKDRSLERERGISLVNYHPATENPGTSSKRGLQIST
TAAQIAKVMEEVSAIHTSQEDRSSGSTPELHCGTDERNALRRSSTTHTHANSYNFTKSEN
SNRTCPVPYAKVEYKRSSNDSLNSVSSSDGYGKRGQMKPVESYSEDEESKFCSYGQYPAD
LAHKIHSANHMDDNDGELDTPINYSLKYSDEQLNSGRQSPSQNERWARPKHIIEDEIKQN
EERQSRSQSTTYPVYPESTDDKHLKFQPHFGQQECVSPYRSRAANGSETNRVGSNHGINQ
NVNQSLCQEDDYEDDKPTNYSERYSEEEQHEEEERPTNYSIKYNEEKHHVDQPIDYSLKY
TTDIPSSQKPAFSFSKNSSGQSTKTEHISSSSENTATPSSNAKRQNQLHHSSAQSRSGQT
QKATSSSCKVPSINQETIQTYCVEDTPICFSRCSSLSSLSSAEDEIGCDQTTQEADSANT
LQIAEIKESSGTRSTEDSVSEVPTVSQHIRTKSSRLQASGLSSESTRHKAVEFSSGAKSP
SKSGAQTPKSPPEHYVQETPLMFSRCTSVSSLDSFESRSIASSVQSEPCSGMVSGIISPS
DLPDSPGQTMPPSRSKTPPPPPPQSAQTKQEVPKNKAPSAEKRESGPKQAAVNAAVQRVQ
VLPDADALLHFATESTPDGFSCSSSLSALSLDEPFIQKDVELRIMPPVQENDNGNETENE
QPEESNESQGKEAEKPTDSEKDLLDDSDDDDIEILEECIISAMPTRSSRKAKKPAQTSSK
LPPPVARKPSQLPVYKLLPSQNRLQAQKHVSFTPGDDMPRVYCVEGTPINFSTATSLSDL
TIESPPNELAAGEGVRAGTQSGEFEKRDTIPTEGRSTDEAQTGKASSVTVPELDDNKTEE
GDILAECINSAMPKGKSHKPFRVKKIMDQVQQASMSSSGTNKNQLDGKKKKPTSPVKPIP
QNTEYRTCVRKNTDSKNNLNAERNFSDNKDSKKQNLKNNSKDFNDKVPNNEDRVRGSFTF
DSPHHYTPIEGTPYCFSRNDSLSSLDFDDDDVDLSREKAELRKGKESKESEAKVTSHTEL
TSNQQSANKTQAVPKHPINRGQPKPVLQKQSTFPQPSKDIPDRGAATDEKLQNFAIENTP
VCFSRNSSLSSLSDIDQENNNNKENEPIKETEPPDSQGEPSKPQASGYAPKSFHVEDTPV
CFSRNSSLSSLSIDSEDDLLQECISSAMPKKKKPSRLKADNEKHSPRNMGGTLAEDLTLN
LKDIQRPDSEHGLSPDSENFDWKAIQEGANSIVSSLHQAAAAACLSRQASSDSDSILSLK
SGISLGSPFHLTPDQEEKPFASNKGPRILKPGEKSTLEAKKLESENKGIKGGKKVYKSLI
TGKIRSNSEVSSQMKQPLQTNMPSISRGRTMIHIPGVRNSSSSTSPVSKKGPPLKTPASK
SPSEGQAATTSPRGAKPSVKSELSPVTRQASQTPGSNKGPSRSGSRDSTPSRPAQQPLSR
PMQSPGRNSISPGRNGISPPNKLSQLPRTSSPSTASTKSSGSGKMSYTSPGRQMSQQNLT
KQTGLSKNVSSIPRSESASKGLSQMSTSNGSNKKVELSRMSSTKSSGSESDRSERPVLVR
QSTFIKEAPSPTLRRKLEESASFESLSPSSRPDSPSRSQAQTPILSPSLPDMSLSTHSSV
QAGGWRKLPPNLSPTIEYNDGRPVKRHDIARSHSESPSRLPINRSGTWKREHSKHSSSLP
RVSTWRRTGSSSSILSASSESSEKAKSEDEKHVNSTSGTKQTKENQVSTKGTWRKMKESE
ISPTNSTSQTTSSGAANGAESKTLIYQMAPAVSKTEDVWVRIEDCPINNPRSGRSPTGNT
PPVIDTVSEKGNPNTKDSKDNQGKQNVSNGSAPVRTMGLENRLNSFIQVDAPDQKGTEGK
PGQSHPVAASETNESSIAERTPFSSSSSSKHSSPSGTVAARVSPFNYNPSPRKSSADSTS
ARPSQIPTPVNNNTKKRDSKSDNTESSGTQSPKRHSGSYLVTSV
NT seq
8595 nt
NT seq
+upstream
nt +downstream
nt
atggctgcagcttcatatgatcagttgttaaagcaagttgaggcactgaagatggagaac
tcaaatcttcgacaagagctagaagataattccaatcatcttacaaaactggaaactgag
gcatctaatatgaaggaagtacttaaacaactacaaggaagtattgaagatgaagctatg
gcttcgtctggacagattgatttattagagcgtctcaaagaacttaacttagatagcagt
aattttcctggagtgaaactacggtcaaaaatgtccctccgttcttacggaagccgagaa
ggatctgtatctagtcgttcaggagagtgcagtcctgttcctatgggttcatttccgaga
agagggtttgtaaatggaagcagagaaaataccggttatttagaagaacttgaaaaagag
agatcattgcttcttgctgaccttgacaaagaagaaaaggaaaaagactggtattatgct
caacttcagaatctcactaaaagaatagatagtcttcctttaactgaaaatttttcctta
cagacagatatgaccagaaggcagttggaatatgaagcaaggcaaatcagagttgcaatg
gaagaacaactgggtacttgccaggatatggaaaaacgagcacagcgaagaataaccaga
attcaacaaatagagaaggacatacttcgtatacgacagcttttacagtcccaagcaaca
gaagcagagaggtcatctcagagcaagcatgaagccggctcacatgaagctgagcgacag
aatgaaagtcaaggagtggcagaaatcaacatggcaacttcgggtagtggtcagggttca
actgcacgaatagatcacgaaacagccagtgttttgagttctagcagcacacattctgct
cctcgaaggctgacaagtcatctgggaaccaaggtggaaatggtgtattcattgttgtca
atgcttggtactcatgataaggatgatatgtcgcgaactttgctagctatgtctagctcc
caagacagctgtatatccatgcgacagtctggatgtcttcctctcctcatccagctttta
catggcaatgacaaagactctgtgttgttgggaaattcccggggcagtaaagaggctcgg
gccagggccagtgcagcactccacaacatcattcactcacagcctgatgacaagagaggc
aggcgtgaaatccgagtccttcatcttttggaacagatacgagcttactgtgaaacctgt
tgggagtggcaggaagcccatgaacaaggcatggaccaggacaaaaatccaatgccagct
cctgttgaacatcaaatctgtcctgctgtgtgtgttctaatgaaactttcgtttgatgag
gagcatagacacgcgatgaatgaacttggtaggaaggctacccggggcatttcatcacag
gagctagggcaggggctttcagggggactacaggccattgcagaattattgcaagtggac
tgtgaaatgtatggacttactaatgaccactacagtattaccttaagacgatatgcagga
atggctttgacaaacttgactttcggagatgtagccaacaaggctacactatgctctatg
aaaggctgcatgagagcacttgtggcccaactaaaatctgaaagtgaggacttacagcag
gttattgcaagtgttttgaggaatctgtcttggagagcagatgtaaatagtaaaaagact
ttgcgtgaagttggaagtgtgaaagcattgatggaatgtgctttggaagtgaaaaaggaa
tcaaccctcaaaagcgtattgagtgccttatggaatttgtcagcacactgcactgagaat
aaagctgatatatgtgccgtagatggtgcgcttgcatttttggttggcactctcacttac
cggagccagacaaatactttagctattattgaaagtggaggtgggatattacggaatgtg
tccagcttgatagctacgaatgaggaccacaggcaaatcctaagagagaataattgctta
caaaccttattacaacacttgaaatctcacagtttgacaatagtcagtaatgcatgcgga
accttgtggaatctctcagcaagaaatcctaaagatcaggaagcattatgggacatggga
gcagtcagcatgctcaagaacctcattcattcaaagcacaagatgattgctatgggaagt
gctgcagctttaaggaatctcatggcaaatagacctgcaaagtataaggatgccaatatc
atgtctcctggttcaagcttgccttctcttcatgtcaggaaacaaaaagccctagaagca
gaattagatgctcagcatttatcagaaacttttgacaatattgacaatttaagtcccaag
gcatctcatcgtagtaagcagagacacaagcaaaatctctatggtgactatgtttttgac
accaatcgacatgatgataataggtcagacaattttaatactggaaacatgactgtcttg
tcaccatatttaaatactacagtgttgcccagctcctcttcatcaaggggaagtttagat
agttctcgttctgagaaagatagaagtttggagagagaacgaggtattagcctagtcaac
taccacccagcaacagaaaatccaggaacctcttcgaagcgaggtttgcagatttctacc
actgcagcccagattgccaaagtcatggaagaagtatcagccattcatacctcccaggaa
gacagaagttctgggtctaccccggaactacattgtggaacagatgagaggaatgcacta
agaagaagctctaccacccacacacatgcaaactcttacaacttcaccaagtcagaaaac
tcaaacaggacatgtccagtgccatatgccaaagtagaatacaagagatcttcaaatgat
agtttaaatagtgtcagcagtagtgatggttatggtaaaagaggtcaaatgaaaccagtt
gaatcctattctgaagatgaggaaagtaaattttgcagctatggtcagtatccagctgac
ctagcccataaaatacatagtgcaaatcatatggatgataatgatggagaactagataca
ccaataaattatagtcttaaatattctgatgaacagttgaactccggaaggcaaagtcct
tcacagaatgaaaggtgggcaagacccaaacatataatagaagatgaaataaaacaaaat
gaggaaagacaatcaaggagtcaaagcacaacttatcctgtatatcctgagagcactgat
gataaacacctcaagttccaaccacattttggacagcaagaatgtgtttccccatatagg
tcaagagcagccaatggttcagaaacaaatcgagtaggttctaatcatggaattaatcaa
aatgtaaatcagtctttgtgtcaggaagatgactatgaagatgataagccaaccaactat
agtgaacgttactctgaggaagagcaacatgaggaagaagagagaccaaccaattatagc
ataaaatataatgaagaaaaacatcatgtggatcagcctattgattatagtttaaaatat
accacagacattccttcttcacagaaaccagcattttcattctcaaagaattcatctgga
cagagcactaaaactgaacacatctcttcaagcagtgagaatacagccacaccttcatct
aatgccaagaggcagaatcaactccatcacagttcagcacagagcaggagtggtcagacc
caaaaagccacctcttcctcttgcaaagttccctctatcaaccaagaaacaatacagact
tactgtgtagaagataccccaatatgtttttcaagatgtagttcattatcatctttgtca
tcagctgaagatgaaatagggtgtgatcagacaacacaagaagcagattctgctaatacc
ctacaaatagcagaaataaaagaaagcagtggaactagatcaactgaagattctgtgagt
gaagttccaacagtgtcacagcacattagaaccaaatccagcagactccaggcttctggt
ttatcttcagaatcaaccaggcacaaagctgttgaattttcttcaggggccaaatctcca
tcaaagagtggtgctcagacacctaaaagtccaccagagcactacgttcaggagactcca
ctcatgtttagcaggtgtacttctgtcagttcacttgatagttttgagagtcgttcaatt
gccagctccgttcagagtgaaccctgcagtggaatggtgagtggcattataagccccagt
gaccttccagatagccctggacaaaccatgccgccaagcagaagtaaaacccctcctccc
cctcctcctcagtcagctcagactaagcaagaagtacctaaaaataaagcacctagtgct
gagaagagagaaagtggccctaagcaagctgctgtaaatgctgcagtacagagggtccag
gttcttccagatgctgatgctttgttacattttgccacagaaagtactcctgatggattt
tcttgttcatctagcctgagtgctctgagcctcgatgagccatttattcagaaagatgtg
gaattaagaataatgcctccggttcaggaaaatgacaatgggaatgaaacagaaaatgag
cagcctgaagaatcaaatgaaagccagggaaaagaggcggaaaaacccaccgattctgaa
aaagatctgttagatgattcagatgatgatgatattgaaatactagaagagtgtattatt
tctgccatgccaacaagatcttcacgcaaagccaaaaagccagcccagacgtcttccaaa
ttacctccacctgtggcaaggaaaccaagtcagctgcctgtgtacaaacttctgccatca
caaaacagattacaagcacaaaagcatgttagttttacaccaggagatgatatgcctcgg
gtgtattgtgtagaagggacacctataaacttttccacagctacatctctaagtgatcta
acgatagaatcccctccaaatgagttagctgctggagaaggtgttagagcaggaacacag
tcaggtgaatttgaaaaacgagacaccattcctacagaaggcagaagtacagatgaggct
caaacagggaaagcctcatctgtaactgtacctgaactggatgacaataaaacagaagaa
ggcgatattcttgcagaatgcattaattctgctatgcccaaaggaaaaagtcacaagcct
ttccgtgtgaaaaagataatggaccaggtccaacaagcatctatgtcttcatctggaact
aacaaaaatcaattagatggtaagaagaagaaacctacttcaccagtaaaacctatacca
caaaatactgaatacaggacatgtgtaaggaaaaatacagactcaaaaaataatttaaat
gctgaaagaaatttctcagacaacaaagattcaaagaaacagaacttgaaaaataattcc
aaggacttcaatgataaggtcccaaataatgaagatcgagtcagaggaagttttactttt
gattcacctcatcattacacacctattgaaggcactccatactgtttttcacgaaatgat
tctttgagttctctagattttgatgatgatgatgtggacctttccagggaaaaggctgaa
ttaagaaaggggaaggaaagtaaggaatcagaagctaaagttaccagccacacagaacta
acctcaaaccaacaatcagctaataagacacaagctgttccaaaacatccaataaatcga
ggtcagcctaaacccgtgctgcagaagcaatccacttttccccagccctccaaagatata
ccagacagaggggcagcaacagacgagaaattacagaattttgctattgaaaacactccg
gtttgcttttcccgaaattcctctctaagttctcttagtgacattgatcaagaaaacaac
aacaacaaggaaaatgaacctatcaaagagacagagccccctgactcacagggagaacca
agtaaacctcaggcgtcaggttatgctcctaaatcatttcacgttgaagatacccctgtt
tgtttctcaagaaacagttctctcagttctctcagtattgattctgaagatgacctgttg
caggaatgtataagttctgcaatgccaaaaaagaaaaagccttcaagactcaaggctgat
aatgaaaagcatagtcccagaaatatgggtggcacattagcagaagatttgacactcaat
ttgaaagatatacagagaccagattcagaacatggtttatcccctgattcagaaaatttc
gattggaaagctattcaggaaggtgcaaattccatagtaagtagtttacatcaagctgct
gctgccgcatgtttatctagacaagcttcgtctgattcagattccatcctttccctgaaa
tcaggaatctctctgggatcaccatttcatcttacacctgatcaagaggaaaaacccttt
gcaagtaataaaggcccacgaattctaaaacctggggagaaaagtacattggaagctaag
aagttagaatctgaaaataaaggaataaaaggagggaaaaaagtttataaaagtttgatt
actggaaaaattcgatctaattcggaagtttcaagccaaatgaaacaaccccttcaaaca
aacatgccttcaatctctcgaggtaggacaatgattcatattccaggagttcggaatagc
tcttcaagtacaagtccggtttctaaaaaaggcccaccccttaagactccagcctccaaa
agccctagtgaaggtcaggcagctaccacttcccccagaggagccaagccatcagtgaag
tcagaattaagccctgttacgaggcaggcgtcccagacacctgggtcaaataaagggcct
tctagatcaggatctagagattccactccttcaagacctgcccagcagccattaagtaga
cctatgcagtctccagggcgaaactcaatttctcctggtagaaatggaataagtcctccc
aacaaattatctcaactgccaaggacgtcatcccctagtactgcttcaactaagtcctcg
ggttctgggaaaatgtcttacacatctcctggcagacagatgagccaacagaacctcacc
aaacaaacgggtttatccaagaatgtcagtagtatcccaagaagtgaatctgcctccaaa
ggactaagtcaaatgagtactagcaatggatccaataaaaaggtagaactttctagaatg
tcttcaactaaatcaagtggaagtgaatctgataggtcagagagacctgtattagtacgc
cagtcaactttcatcaaagaagctccaagcccaaccctaaggagaaaattggaggaatcc
gcttcatttgaatctctttctccatcttctagaccagattctcccagtaggtcccaggca
cagactccaattttaagtccttcccttcctgatatgtcgctgtctacacattcatctgtt
caggctggtggatggcgaaaactcccacctaatctcagtcccaccatagagtataatgat
ggaagaccagtaaagcgccatgatatagcacgctctcattctgaaagtccttccagactt
cccatcaataggtcaggaacctggaaacgtgagcacagcaaacactcatcatcccttcct
cgagtaagcacttggaggagaactggaagttcatcctcaattctttctgcttcatcagaa
tctagtgaaaaagcaaaaagtgaggatgaaaaacatgtgaactctacttcaggaaccaaa
caaactaaagaaaaccaagtatccacaaaaggaacatggagaaaaatgaaagaaagtgaa
atttctcccaccaatagtacttctcagaccacttcctcaggtgctgcaaatggtgctgaa
tcaaagactctgatttatcaaatggcacctgctgtttctaaaacagaggatgtttgggtg
agaattgaggactgccccattaacaaccctagatctggaagatctccaacaggaaatact
cccccagtgattgacactgtttcagaaaagggaaacccaaacactaaagattcaaaagat
aatcaggggaaacaaaatgtgagcaatggtagtgctcctgtacgcaccatgggtttggaa
aaccgcctgaactcctttattcaggtagacgccccagaccaaaaaggaactgagggaaaa
ccgggacaaagtcatcctgtcgctgcatcagagactaatgaaagttctatagctgaacgt
accccatttagttctagcagctcaagcaagcacagttcaccgagtgggactgttgctgcc
agagtgagtccttttaattacaacccaagcccaaggaaaagcagcgcagatagcacttca
gcccgaccatctcagatcccaacgccagtgaataacaacacaaagaaacgagattcaaaa
agtgacaatacagaatccagtggaactcaaagtcctaaacgccattctgggtcttacctt
gtgacatctgtttaa
DBGET
integrated database retrieval system