KEGG   Theobroma cacao (cacao): 18588401
Entry
18588401          CDS       T02994                                 
Name
(RefSeq) beta-galactosidase
Organism
tcc  Theobroma cacao (cacao)
SSDB
Motif
Pfam: Glyco_hydro_35 BetaGal_gal-bd GHD Gal_Lectin Glyco_hydro_42 BetaGal_ABD2
Other DBs
NCBI-GeneID: 18588401
NCBI-ProteinID: XP_017983039
LinkDB
Position
9:complement(4652303..4659294)
AA seq 839 aa
MWNRDMLSRVTVFMLWLLFSSWVFSVSATVSYDSKAIIINGRRRILLSGSIHYPRSTPQM
WPDLIAKAKEGGLDVIQTYVFWNGHEPSPGKYYFDDRYDLVRFIKLVQQAGLYVHLRIGP
YVCAEWNFGGFPVWLKYVPGIVFRTDNGPFKAAMQKFTEKIVSMMKAEKLFQTQGGPIIM
SQIENEFGPVEWEIGAPGKAYTKWAAQMAVGLGTGVPWIMCKQDDAPDPVINTCNGFYCE
NFTPNAKYKPKMWTENWTGWFTEFGGAVPTRPAEDIAFSVARFIQNGGSFVNYYMYHGGT
NFGRTAGGPFIATSYDYDAPIDEYGLPREPKWGHLRDLHKAIKLSEPALVSADPTVTSLG
SNQEAHVFKAKSGACAAFLANYDTKYSVKVTFGNVQYDLPAWSISILPDCKTAVFNTARL
GAQSSQKKMETVNSAFSWQSYNEESPSADDQDATVKDGLLEQIYVTRDASDYLWYMTDVQ
IDPNEGFLTSGQDPSLTIWSAGHALHVFINGQLSGTAYGELDNPKLTFSKNVKLRAGINK
ISLLSIAVGLPNVGVHFETWNAGVLGPVTLKGLNEGSRDLSKQKWSYKIGLKGEALSLHT
VTGSSSVEWVKGSLLVKKQPMTWYKTTFNAPGGNEPLALDMSSMGKGQIWINGQSIGRHW
PGYIARGACGACDYAGTYSDKKCRTNCGEPSQRWYHVPRSWLNPSGNLMVVFEEWGGDPS
GISLVKRTTGSVCADIFEAQPTMKNWGMLASGKINRPKAHLWCPPGQKISEIKFASYGMP
EGTCGSFSEGSCHAHRSYDAFQKNCIGKQSCSVTVAPEVFGGDPCPDSMKKLSVEAACN
NT seq 2520 nt   +upstreamnt  +downstreamnt
atgtggaacagagacatgttgtcaagggtcaccgtgttcatgttatggctattgttttct
tcttgggttttttcagtttcagctactgtttcttatgacagtaaagctatcatcattaat
ggcaggagaaggattcttctttctggctccattcattaccccagaagcactccgcagatg
tggcctgatcttatagcaaaggctaaagaaggaggcttggatgttatacaaacttatgtt
ttctggaacggacacgagccttctcctggaaaatattattttgacgataggtatgatctg
gttcgatttattaagctggtgcaacaggctggactttatgttcatctccggattggtccc
tatgtttgtgctgaatggaactttgggggatttcctgtgtggctgaaatatgtccccggc
attgttttcaggacagacaatggacctttcaaggctgcaatgcaaaaattcacagagaag
atagtcagcatgatgaaagcagaaaagctgtttcagactcaaggaggtccaataattatg
tctcagattgaaaatgaatttggtcctgttgaatgggaaattggtgctccaggtaaagct
tacaccaaatgggctgcacaaatggcagtgggacttggcactggagtcccatggattatg
tgcaagcaagatgatgctcctgaccctgtgataaacacctgcaatggattctactgtgaa
aattttactcccaacgcgaaatacaaaccaaagatgtggacagagaactggactggctgg
tttacagagtttggtggtgctgtccctaccagacctgcagaagacatagcattttcagtt
gcacgattcattcagaatggtggttcatttgttaattattatatgtaccatggaggaacc
aattttgggcggacagctggtggtcccttcattgctaccagctatgactatgatgctcct
attgatgaatatgggctaccaagggaaccaaaatggggacatctgagagatttgcataaa
gccatcaaattaagtgaaccagctttagtttctgcagatcctaccgtgacttcacttgga
agtaatcaggaggctcacgtattcaaggcaaagtctggtgcatgtgctgcattccttgca
aactatgacacaaaatactctgtaaaagtaactttcggaaatgtgcaatatgacttacca
gcttggtccatcagcatccttcccgactgtaaaactgctgttttcaacactgccaggctt
ggtgcccaaagctcacaaaagaagatggaaactgtaaacagcgcattctcttggcaatca
tataatgaagaaagcccctctgctgatgatcaggatgcaactgtaaaagacgggctcttg
gaacagatatatgtcaccagagatgcttcagattatttgtggtacatgacagatgtacaa
atagatcctaatgaaggatttttgacaagtggacaagatccttctctgaccatttggtca
gcaggtcatgctttgcatgttttcattaatggtcaattatccgggactgcgtatggggaa
ttggacaatccaaaattaacattcagcaaaaatgtcaaactacgagctgggattaacaag
atttctttattaagcattgcagtgggacttccaaatgttggcgttcattttgagacatgg
aatgctggggttctaggtcctgttacattgaagggtctcaatgaggggtcaagagactta
tctaagcagaaatggtcttacaagattggtctaaaaggggaggccttaagccttcatacc
gttactggaagctcctctgttgaatgggtcaaaggatcgctattggtaaagaaacaacct
atgacttggtacaagacaacttttaatgcaccgggtggcaatgaaccattggctttagat
atgagtagcatgggaaaagggcaaatatggataaatggccagagcattggacgccactgg
cctggatatatagcacgtggtgcgtgtggtgcttgtgattatgctggaacttatagtgat
aagaaatgccgaactaattgtggagagccgtctcaaagatggtaccatgttccacgctca
tggctgaacccaagtggaaacctcatggttgtgtttgaagaatggggtggtgatccatct
ggaatttctttggtcaaaagaacaaccggaagtgtttgtgctgatatttttgaagcgcaa
ccaacaatgaagaattggggaatgctagcttctggcaaaatcaatcgacccaaagcccat
ttgtggtgtcctcctgggcagaaaatttctgaaataaagtttgctagttatggaatgccc
gaggggacttgtggaagctttagtgagggaagctgccatgcccacaggtcatatgatgcg
tttcaaaagaattgcattggaaaacaatcatgttcggtaactgtggctccagaagttttt
ggaggagatccatgtccagatagcatgaagaagctctcagttgaagctgcctgcaactga

DBGET integrated database retrieval system