KEGG   Theobroma cacao (cacao): 18604054
Entry
18604054          CDS       T02994                                 
Name
(RefSeq) DNA mismatch repair protein MSH2
  KO
K08735  DNA mismatch repair protein MSH2
Organism
tcc  Theobroma cacao (cacao)
Pathway
tcc03430  Mismatch repair
Brite
KEGG Orthology (KO) [BR:tcc00001]
 09120 Genetic Information Processing
  09124 Replication and repair
   03430 Mismatch repair
    18604054
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03400 DNA repair and recombination proteins [BR:tcc03400]
    18604054
DNA repair and recombination proteins [BR:tcc03400]
 Eukaryotic type
  SSBR (single strand breaks repair)
   MMR (mismatch excision repair)
    Mismatch and loop recognition factors
     18604054
SSDB
Motif
Pfam: MutS_V MutS_III MutS_IV MutS_II MutS_I
Other DBs
NCBI-GeneID: 18604054
NCBI-ProteinID: XP_017973885
UniProt: A0AB32W4E4
LinkDB
Position
3:1745010..1753015
AA seq 942 aa
MDENFDERNKLPELKLDAKQAQGFLSFFKTLPNDARAVRFFDRRDYYTAHGENATFIAKT
YYRTTTALRQLGSGSDGLSSVTVSKNMFETIARDLLLERTDHTLELYEGSGSHWRLMKSG
SPGNLGSFEDVLFANNEMQDTPVVVALLPNFRENGCTIGFSYVDLTKRVLGLAEFLDDSH
FTNTESALVALGCKECLLPIESGKASECRTLNDALTRCGVMVTERKKTEFKARDLVQDLG
RLIKGSIEPVRDLVSGFEFAPAALGALLSYAELLADEGNYGNYSIRRYNLGSYMRLDSAA
MRALNVLESRTDANKNFSLFGLMNRTCTAGMGKRLLHMWLKQPLLDVSEINSRLDLVQAF
VEDTELRQALRQHLKRISDIERLMRNIEKTRAGLQHVVKLYQSSIRIPYIKSALEKYDGQ
FSSLIKERYLDPFELFTDDDHLNKFISLVETSVDLDQLENGEYMISPSYDDALAALKNEQ
ESLELQIHNLHKQTAIDLDLPVDKALKLDKGTQFGHVFRITKKEEPKVRKKLSTQFIILE
TRKDGVKFTSTKLKKLGDQYQKILEEYKNCQKELVNRVVQTTATFSEVFEPLAGLLSELD
VLLSFADLASSCPTPYTRPEITPADVGDIVLEGSRHPCVEAQDWVNFIPNDCRLVRGKSW
FQIITGPNMGGKSTFIRQVGVNILMAQVGSFVPCEKASISVRDCIFARVGAGDCQLRGVS
TFMQEMLETASILKGATDKSLIIIDELGRGTSTYDGFGLAWAICEHIVEVIKAPTLFATH
FHELTALAHENVNDEPQAKQIVGVANYHVSAHIDSSSRKLTMLYKVEPGACDQSFGIHVA
EFANFPESVISLAREKAAELEDFSPTSIISSDARQEEGSKRKRECDPIDMSRGAAKAHKF
LKDFADLPLESMDLKQALQQVNKLRGDLEKDAVNCNWLRQFL
NT seq 2829 nt   +upstreamnt  +downstreamnt
atggatgaaaattttgatgaacgaaacaagcttccagagctcaaactagatgctaagcag
gctcaagggtttctctctttcttcaaaaccctacccaatgatgcaagggcagttcggttt
tttgatcgccgggattattatactgctcatggtgaaaatgcaacctttattgcaaagaca
tattaccgcactactactgctctccggcaactgggtagcggctctgatggcctttcaagt
gtaactgttagtaaaaacatgtttgaaacaattgctcgtgatcttctcctggagagaaca
gaccacactctggagctctatgaaggcagtggctcccattggaggttaatgaaaagtggc
agtcctgggaatctgggcagttttgaagatgttctgtttgccaacaatgagatgcaggac
acacctgttgttgttgcattgcttcctaacttccgtgaaaatgggtgcactattgggttc
agttatgttgatttaacgaagagggtacttggattggctgaatttcttgatgatagtcac
tttacaaatacagagtcggctttggttgctctcggttgcaaggaatgccttttgcccata
gagagtggaaaagccagtgaatgtagaactctcaatgatgctttgaccagatgtggtgtt
atggtaactgagagaaagaaaactgagtttaaagcaagggatctggttcaggatcttggc
agactaatcaaaggttccattgaaccagttcgagacttggtttctggatttgaatttgca
cctgctgctttaggagccttactatcttatgcagaactactggcagatgaaggcaattat
ggaaattatagcatccggagatacaatcttggcagctacatgagattagattctgctgct
atgagggcattgaatgtcctagaaagcagaactgatgcaaacaaaaattttagtttgttt
ggtcttatgaatagaacctgtaccgctgggatgggtaagcggttgcttcatatgtggcta
aaacagcctttgttagatgtaagtgagataaactcaaggctagatttggtacaagctttt
gtggaggataccgagcttcgccaagctttgaggcagcatctgaaaagaatttcagatatt
gagcgacttatgcgcaatattgaaaagacaagagctggtttgcagcatgttgtaaaactt
tatcagtcaagtataagaattccctacattaaaagtgccctggaaaaatatgatggacag
ttttcatccttgatcaaggaaagatatttggatccttttgagctcttcactgacgacgat
catttgaacaagttcatttctcttgttgaaacttctgtcgacctagatcaacttgaaaat
ggggaatacatgatttcacctagttatgatgatgccctagctgcactaaaaaatgagcag
gagtcactagagctccaaatacacaacttacataaacaaactgctattgatcttgatctg
ccagtagacaaggcattaaagttagataagggcacacagtttggacatgttttcagaatt
acaaagaaagaagagccaaaagtaagaaaaaagctctccacccaatttattattcttgaa
actcgaaaggatggagtaaaattcactagcacaaagcttaaaaagttgggggaccagtac
caaaagatacttgaggagtataagaactgtcaaaaagaactagtcaaccgagtggttcaa
actacagcaactttctctgaggtgtttgagcccttagctgggttgctctccgaattggat
gtcttgcttagttttgctgatttagcttctagttgccctaccccatacacaagacctgaa
attactccagcggatgtaggagatattgtattagaaggaagtagacatccctgtgtggag
gcgcaagactgggtgaattttataccaaatgattgtagacttgtaagaggaaagagctgg
ttccagatcatcactgggcctaatatgggtggaaaatcaacattcatccggcaggttggt
gtcaacattctgatggcacaagtaggttcttttgttccttgtgaaaaagctagcatttct
gtccgagactgcatttttgcccgtgttggtgctggtgactgccaactacgtggagtttct
acctttatgcaagaaatgcttgaaactgcatcaatattgaaaggagctactgacaagtca
ttgataatcattgatgagttggggcgaggaacatcaacctatgatggatttggtttagca
tgggccatatgcgagcatattgttgaagtgatcaaagcacctactttgttcgctacccac
ttccatgaactgactgcattagctcatgaaaatgtcaatgatgagccacaggcaaaacag
attgttggtgtggcaaactatcatgttagtgctcacattgactcatcaagtcgcaaattg
acaatgctgtacaaggttgagccaggtgcctgtgatcaaagttttggtatccatgtagca
gaatttgccaactttcctgaaagtgttatatcccttgcaagagaaaaggctgctgaattg
gaagatttctcgccaacttcaatcatttccagtgatgctagacaagaggaaggttctaaa
aggaagcgagagtgtgatcctattgacatgtctagaggtgctgcaaaggctcacaagttc
ttgaaggactttgctgatttgccattagagtctatggacctgaagcaggctctgcaacaa
gtaaacaagctaaggggtgacttagaaaaggatgcagtaaactgtaactggctccggcaa
ttcctttag

DBGET integrated database retrieval system