KEGG   Theobroma cacao (cacao): 18597540
Entry
18597540          CDS       T02994                                 
Name
(RefSeq) uncharacterized LOC18597540
  KO
K06100  symplekin
Organism
tcc  Theobroma cacao (cacao)
Pathway
tcc03015  mRNA surveillance pathway
Brite
KEGG Orthology (KO) [BR:tcc00001]
 09120 Genetic Information Processing
  09122 Translation
   03015 mRNA surveillance pathway
    18597540
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03021 Transcription machinery [BR:tcc03021]
    18597540
   03019 Messenger RNA biogenesis [BR:tcc03019]
    18597540
Transcription machinery [BR:tcc03021]
 Eukaryotic type
  RNA polymerase II system
   Other transcription-related factors
    Transcription termination factor
     18597540
Messenger RNA biogenesis [BR:tcc03019]
 Eukaryotic type
  mRNA processing factors
   3' end processing
    Cleavage stimulation factor (CSTF) complex
     18597540
    U7-dependent histone pre-mRNA processing factors
     18597540
SSDB
Motif
Pfam: Symplekin_C SYMPK_PTA1_N HEAT_2 HEAT_EZ
Other DBs
NCBI-GeneID: 18597540
NCBI-ProteinID: XP_007026695
UniProt: A0AB32V1F5
LinkDB
Position
5:complement(731626..742172)
AA seq 1336 aa
MVGIMNPVSREKLASLFNSVKLAVDLASKLDLSHQLKQTLLEEDAAALSEFLPRVFDLYS
DPSGPVRKLATEIIGEIGVKNLDFVPEIAPFLITVLEDATPAVARQSIACSIDLFRLTLE
KIAIQGLYSSELDSDLEASWSWMLKLKEKIYSIAFQPGSGGIRLVALKFVEAVILLYTPD
PTGSPEAPPDEGTPVEFNATWLCGGHPLLNVGDLSIEASQQLGLLLDQLRFPIVKSLTNS
VIVVLINSLSGIAKKRPAYYGRILSVLLGLDSPSVVIKGVHVYGAHHALKNALLSCLKCT
HPSAAPWRDRVLGALREMKAGGLAEPALNQVLKTNGSVEEGKDDSSVIKEEKPLVRARDA
AGSNMGRKRSVTEDSSDLAENDDVPGKRVRSTPSVSEESTKELNRNTTTSQGDICSTQPT
INKGAVDTGPVQQLVAMFGALVAQGEKAVGSLGILISSISADLLAEVVMANMRNLPPDHP
HTDGDDELLENMSIVGSDTQAKYPPSFLADVVSLSSTFPPIASLLNSQLSVSNKIVKTEG
EEEVDVVAGPNNAVAYAGMAHEAEHALLATDLPVSSDIVLPGKEKIDLPPPSDIHDVGYL
ESEIPGLDSSVRTDGLSDTQAASSLVSTDLEDASQEQVTSFGGRSPLHVLPSISTDRSEE
LSPKAAVMDSNSLISSTATSVVSSYIALPKMSAPVVNLSDDQKDDLQKLAFIRIIEAYKQ
IALSGSLQVCFSLLAYLGVELLSELDLQKLLREHVLSDYINHQGHELTLRVLYRLFGEAE
EESDFFSCTTAASAYETFLLAVAETLRDSFPPSDKSLSKLLGEAPRLPKSVLNLLECLCS
PGISEKAENESQSGDRVTQGLSTVWSLILLRPPIRDVCLKIALKSAVHHLEEVRMKAIRL
VANKLYPLSSIAQQIEDFAREMLLSVVNGDGIERTDAEGSITEPHKESDSEKPSNEHQSM
SSIGKDISADVHQSETSQSVSSLSVPEAQQSMSLYFALCTKKHSLFRQIFVIYKSASKAV
KQAIHRHIPILVRTMGSSSDLLEIISDPPSGSESLLMQVLHTLTDGTVPSAELMFTIKKL
FDSKLKDVEILIPVLPFLPRDEVLLLFPHLVNLPLDKFQAAVTRLLQGSSHSAPALSPAE
VLIAIHGIDPERDGIPLKKVTDACNACFEQRQIFTQQVLAKVLNQLVEQIPLPLLFMRTV
LQAIGAFPALVDFIMEILSRLVSKQIWKYPKLWVGFLKCALLTKPQSFSVLLQLPPPQLE
NALNRTAALKAPLVAHASQQNIRTSLPRSILAVLGLSLDSQNSSQAQTSQAHTGDTSNSD
KDAVAVEKSKESSSAS
NT seq 4011 nt   +upstreamnt  +downstreamnt
atggtggggatcatgaatccagtttcgagagaaaagctcgcgagtttgttcaactcagtg
aagctagctgtagacttagcttcgaagctggatctttctcatcaattgaagcaaactttg
ttggaggaagacgccgctgccctctccgagttccttcctcgcgtcttcgacctttactct
gacccgtccggtccggttcgcaagctcgccaccgagattattggtgaaattggagtgaaa
aacctcgattttgtacctgaaattgcaccgtttctgataactgttttggaagatgctaca
cctgctgttgctcgacaatctattgcttgcagcattgatttgtttcgtcttacactcgaa
aaaattgcaattcagggtctatactctagtgaattggacagtgatcttgaagcatcatgg
tcatggatgttaaagctaaaggagaagatatactctatagcttttcagccaggaagtggt
gggataagattggtggcactgaagtttgttgaagcagttattctgctttatactccagat
cctactggctctccagaggcccctcctgatgaaggaactcctgtagaatttaatgcaact
tggctttgtgggggccatcctttactcaatgttggggatttgtcgattgaagctagccaa
cagttgggtttgttgcttgatcagcttagatttccaatagtaaaatctctcaccaactcg
gtgattgttgtgcttattaacagtctttcaggtattgcaaagaaacggcctgcatattat
ggacgtattctatcagttttgcttggtttggattccccaagtgttgttatcaaaggggtt
catgtctatggagcacatcatgctttaaaaaatgccttactctcctgcttgaaatgtaca
cacccaagtgctgcgccgtggagggatcgtgtacttggtgccttgagagagatgaaagct
ggaggactagcagagccagctctaaaccaagttcttaaaactaatggaagtgtggaagag
ggaaaagatgattcttcggttattaaggaagaaaaacccttggttagagcacgtgatgct
gctggtagcaatatggggaggaaaagatctgtaactgaggacagtagtgatttggctgaa
aatgatgatgtgcctggcaaacgtgtcaggtcaacacctagtgtctcagaagagtcaaca
aaagagttaaacaggaatactactacgtctcagggtgacatttgttcaactcaaccaacc
attaataaaggagctgttgatactggaccggtccagcaacttgttgccatgtttggtgcc
ttggttgctcaaggagaaaaagctgtgggatctttggggattcttatttcaagcatatct
gctgacttgctggctgaggtagtgatggctaacatgcgtaaccttcctcccgatcatcct
cacactgatggggatgatgaattgctggagaacatgagcatagttggaagtgacactcaa
gccaagtatccaccatcattcttagctgatgttgtttcgttgtcgagtactttcccccca
atagcctcactgctaaactcccagctgtctgtttccaataaaatagtgaaaacagaaggg
gaggaagaagttgatgttgtggctggccctaataatgctgttgcatatgctggcatggct
cacgaagccgaacatgctcttttggctactgatttaccagtttcttctgacattgttttg
cctggaaaggagaagattgatctacctcctccatctgatatccatgatgtggggtatctt
gaaagtgagatacctggcttggattcttctgttcgtactgatggattgtcagacacccaa
gctgcttcttcattggtctctactgatctagaagatgctagtcaagagcaagttactagt
tttggtggaaggtcaccactgcatgtgcttccatcaatttcaacagataggtctgaggag
cttagtccaaaagcagctgttatggattccaacagcctgatttcctcaacagcaacttct
gttgtttcgtcttatattgccttgcccaagatgtcagcacctgttgtcaatctttctgat
gatcagaaggatgatttgcaaaagttggcttttatacgtatcattgaggcatataagcaa
atagctctatctggaagtttgcaagtttgcttttctctgcttgcttatctaggagtcgag
ttgctgtccgagttagacctacagaaactgctacgagaacacgttttgtcagattatata
aatcaccagggacacgagttgacattgcgtgtcctctacaggttatttggcgaggcagaa
gaggaaagcgatttcttctcatgtacaactgctgcttctgcatatgaaacattccttctg
gctgttgccgaaactcttagggactcttttccaccttcagacaaatctttaagtaaactt
cttggtgaagctcctcgcctgccaaagtcagttttgaatttattagagtgtttgtgttct
cctggaatctcagagaaagctgagaatgagtcacaaagcggagatagagttactcaaggc
ctcagcactgtatggagcctaattctactgagacctcccattcgtgatgtgtgcttgaaa
attgctttgaagagtgcagttcatcacttggaggaagtccgaatgaaagcaatacgtctg
gtagcaaataagctttatcctttatcatccattgctcaacaaatagaagattttgcaagg
gaaatgctgctctctgtagtaaatggcgatggtatagaaagaacagatgctgaaggatcc
attactgaaccacacaaggaatctgattcagaaaagccatcaaatgagcatcagtcaatg
agttccattggtaaagacatctctgctgatgttcatcagtcagaaacatctcagagtgtg
tcatccctttctgttcctgaggctcaacagagcatgtcactttattttgctctgtgtaca
aagaagcactctctctttcgccaaatatttgttatctacaagagtgcatcaaaggcagtt
aagcaggcaatccatcgtcatattcccatactagttcgtaccatgggctcgtcatctgac
ctccttgaaattatttcagatcctcctagtggaagtgagagtcttcttatgcaggttttg
catacactgacagatgggacagttccttctgcagaattaatgttcaccattaagaagtta
tttgattcaaaactaaaggatgttgaaattctaattccagtattgccattcctaccaaga
gatgaggttttgctgctctttccacatcttgtaaatcttccgctggataagttccaagct
gcagttacccggttgctacagggatcttctcattctgctccagcgctctctccagccgaa
gtgttaatcgctatccatggtattgatcctgaaagagatggaattcccttaaagaaggtc
acggatgcatgtaatgcttgttttgagcaacggcagatattcacccagcaagttcttgcg
aaggtcttaaatcaattggttgagcaaattcctcttcctttgttgtttatgcgtacagta
ttgcaagccattggtgcttttcctgcactggtggactttataatggagatcctttctcgt
cttgtaagcaagcagatatggaagtatccaaagttgtgggtaggatttttgaaatgtgca
ttattgacaaagcctcaatctttcagcgtgttgcttcagctacctccaccgcagctggaa
aatgcactgaatagaactgcagcactcaaagctcctttggttgctcatgctagccaacag
aatatccgaacttcacttccaaggtctatactggcagttttgggactttctctggactct
cagaactctagtcaggcacaaacaagtcaggctcacaccggagatactagtaactcagac
aaggatgcagtggcagtggagaaatctaaagaatcatctagtgctagctga

DBGET integrated database retrieval system