KEGG   Chlorocebus sabaeus (green monkey): 103247372
Entry
103247372         CDS       T04361                                 
Symbol
THOC2
Name
(RefSeq) THO complex 2
  KO
K12879  THO complex subunit 2
Organism
csab  Chlorocebus sabaeus (green monkey)
Pathway
csab03013  Nucleocytoplasmic transport
csab03040  Spliceosome
Brite
KEGG Orthology (KO) [BR:csab00001]
 09120 Genetic Information Processing
  09121 Transcription
   03040 Spliceosome
    103247372 (THOC2)
  09122 Translation
   03013 Nucleocytoplasmic transport
    103247372 (THOC2)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03019 Messenger RNA biogenesis [BR:csab03019]
    103247372 (THOC2)
   03041 Spliceosome [BR:csab03041]
    103247372 (THOC2)
Messenger RNA biogenesis [BR:csab03019]
 Eukaryotic type
  mRNA surveillance and transport factors
   Transport factors
    TREX complex
     103247372 (THOC2)
Spliceosome [BR:csab03041]
 Complex C
  Other components
   EJC/TREX
    103247372 (THOC2)
 Other splicing related proteins
  Spliceosome associated proteins (SAPs)
   TREX complex
    103247372 (THOC2)
SSDB
Motif
Pfam: Tho2 THOC2_N Thoc2
Other DBs
NCBI-GeneID: 103247372
NCBI-ProteinID: XP_008017667
Ensembl: ENSCSAG00000018332
LinkDB
Position
X
AA seq 1642 aa
MAAAAVVVPAEWIKNWEKSGRGEFLHLCRILSENKSHDSSTYRDFQQALYELSYHVIKGN
LKHEQASNVLNDISEFREDMPSILADVFCILDIETNCLEEKSKRDYFTQLVLACLYLVSD
TVLKERLDPETLESLGLIKQSQQFNQKSVKIKTKLFYKQQKFNLLREENEGYAKLIAELG
QDLSGSITSDLILENIKSLIGCFNLDPNRVLDVILEVFECRPEHDDFFISLLESYMSMCE
PQTLCHILGFKFKFYQEPNGETPSSLYRVAAVLLQFNLIDLDDLYVHLLPADNCIMDEHK
REIAEAKQIVRKLTMVVLSSEKMDEREKEKEKEEEKVEKPPDNQKLGLLEALLKIGDWQH
AQNIMDQMPPYYAASHKLIALAICKLIHITIEPLYRRVGVPKGAKGSPVNALQNKRAPKQ
AESFEDLRRDVFNMFCYLGPHLSHDPILFAKVVRIGKSFMKEFQSDGSKQEDKEKTEVIL
SCLLSITDQVLLPSLSLMDCNACMSEELWGMFKTFPYQHRYRLYGQWKNETYNSHPLLVK
VKAQTIDRAKYIMKRLTKENVKPSGRQIGKLSHSNPTILFDYILSQIQKYDNLITPVVDS
LKYLTSLNYDVLAYCIIEALANPEKERMKHDDTTISSWLQSLASFCGAVFRKYPIDLAGL
LQYVANQLKAGKSFDLLILKEVVQKMAGIEITEEMTMEQLEAMTGGEQLKAEGGYFGQIR
NTKKSSQRLKDALLDHDLALPLCLLMAQQRNGVIFQEGGEKHLKLVGKLYDQCHDTLVQF
GGFLASNLSTEDYIKRVPSIDVLCNEFHTPHDAAFFLSRPMYAHHISSKYDELKKSEKGS
KQQHKVHKYITSCEMVMAPVHEAVVSLHVSKVWDDISPQFYATFWSLTMYDLAVPHTSYE
REVNKLKVQMKAIDDNQEMPPNKKKKEKERCTALQDKLLEEEKKQMEHVQRVLQRLKLEK
DNWLLAKSTKNETITKFLQLCIFPRCIFSAIDAVYCARFVELVHQQKTPNFSTLLCYDRV
FSDIIYTVASCTENEASRYGRFLCCMLETVTRWHSDRATYEKECGNYPGFLTILRATGFD
GGNKADQLDYENFRHVVHKWHYKLTKASVHCLETGEYTHIRNILIVLTKILPWYPKVLNL
GQALERRVHKICQEEKEKRPDLYALAMGVEVYLLPSGDYGRLCDHVPTGCYSGQLKSRKS
YMIPENEFHHKDPPPRNAVASVQNGPGGGPSSSSIGSASKSDESSTEETDKSRERSQCGV
KAVNKASSTTPKGNSSNGNSGSNSNKAVKENDKEKGKEKEKEKKEKTPATTPEARVLGKD
GKEKPKEERPNKDEKARETKERTPKSDKEKEKFKKEEKAKDEKFKTTVPNAESKSTQERE
REKEPSRERDIAKEMKSKENVKGGEKTPVSGSLKSPVPRSDIPEPEREQKRRKIDTHPSP
SHSSTVKVTAILPKVPLGSENYASSPVISIHFLQDSLIELKESSAKLYINHTPPPLSKSK
EREMDKKDLDKSRERSREREKKDEKDRKERKRDHSNNDREVPPDLTKRRKEENGTMGVSK
HKSESPCESPYPNEKDKEKNKSKSSGKEKGSDSFKSEKMDKISSGGKKESRHDKEKIEKK
EKRDSSGGKEEKKHHKSSDKHR
NT seq 4929 nt   +upstreamnt  +downstreamnt
atggcggccgcggctgtggtggttcccgcagagtggataaagaactgggagaaatcaggg
agaggcgaatttttgcatttatgtcggattctcagtgaaaataaaagccatgatagttcg
acgtatagagatttccagcaagctctctatgagttgtcatatcatgtaattaaaggaaat
ctaaagcatgaacaggcatctaatgttcttaatgacattagtgaatttcgtgaggatatg
ccctccattcttgctgatgtattctgcatattagatattgagacaaattgtttagaagaa
aaaagcaagagagactattttacacagttggtattagcatgtttgtatttagtttcagac
acagttttaaaggaacgcctggatccagaaacactggaatcattagggcttatcaaacaa
tcacagcaattcaatcaaaagtcagttaaaatcaagacaaaactcttttataagcagcaa
aaattcaatttgttaagagaagagaatgaaggttatgccaagctgattgccgaattgggg
caagatttatctggaagtattactagtgacttaatcttagaaaatatcaaatctttaata
ggatgctttaatctggatcccaatagagttttggatgtcattttagaagtgtttgaatgc
aggccagaacacgacgacttctttatatctttgttagaatcttacatgagtatgtgtgaa
ccgcaaacactgtgtcatattcttgggttcaaattcaagttttaccaggaaccaaatggc
gagacaccatcatctttatacagagttgcagcagtacttctgcaatttaatcttattgat
ttagatgatctttatgtacatcttcttccagctgataattgcattatggatgaacacaaa
cgagaaattgcagaagctaagcaaattgtcagaaagcttacgatggttgtgttgtcttct
gaaaaaatggatgagcgagagaaagaaaaggaaaaagaagaggagaaagtagagaagcca
cctgataaccaaaaacttggcttgttggaagccttattaaagatcggtgattggcaacat
gcacagaacattatggatcagatgcctccatactatgcagcttcacacaagctaatagcc
cttgctatttgcaagctcattcatataactattgagcctctctaccgaagagttggtgtt
cctaaaggtgctaaaggctcacctgttaatgctttgcaaaacaagagggcaccaaaacaa
gctgagagctttgaagatttgaggagagacgtgttcaatatgttctgttaccttggtcct
cacctttctcacgatcccattttatttgcaaaagtggtgcgcataggcaagtcatttatg
aaggagtttcagtctgatggaagcaaacaagaagataaagaaaaaacggaagttatcctt
agctgtttgcttagcattactgaccaggtactacttccatctctttctttgatggactgc
aatgcttgtatgtctgaggaactatggggaatgtttaaaacatttccatatcagcataga
taccgtctgtatggccagtggaagaatgaaacttataacagtcacccacttttagtaaaa
gttaaagctcaaacaatagacagagccaaatatatcatgaagcgcctaaccaaggaaaat
gtgaagccttctggaagacaaattgggaagttgagccacagcaatccaaccattttgttt
gattatatcttgtcacaaatacaaaagtatgataacttaataacacctgtagtagattca
ttgaaatacctcacttcattgaattatgatgtcttggcctattgtatcattgaagcttta
gctaatccagaaaaggaaagaatgaaacatgatgacacaaccatctcaagctggcttcag
agtctggctagtttctgtggtgcagtttttcgtaaatatccaattgatcttgctggtctt
cttcagtatgttgccaatcagctaaaggcgggcaaaagttttgacctgcttatattgaaa
gaagtggtgcaaaaaatggcaggaatagaaattacagaggaaatgacaatggagcaacta
gaggctatgactggtggagagcagctaaaagctgagggtggttattttggtcagatcaga
aacactaaaaaatcctctcagagattaaaggatgctctattggaccatgatcttgccctt
cctctctgtctgcttatggctcagcagagaaatggggtaatctttcaggaaggtggagag
aaacatttaaaacttgtgggaaagctctatgaccagtgtcatgataccctggtgcagttt
ggtgggtttttagcatctaatctgagcacagaagattatataaagcgagtgccttcaatt
gatgtactctgtaatgaatttcatacaccccatgatgcagcatttttcctgtctaggcca
atgtatgcgcatcatatttcgtcaaagtatgatgaacttaaaaaatcagaaaagggaagt
aaacagcaacataaagttcataagtacattacatcatgtgagatggtgatggcacctgtc
catgaagcagtggtctccttacatgtttccaaagtctgggatgacatcagccctcaattc
tatgccacattctggtcattgacaatgtatgaccttgcagttccacacaccagctatgaa
cgagaagtcaataaacttaaagtccagatgaaagcaattgatgacaatcaggaaatgccc
ccaaataaaaagaaaaaagagaaggagcgctgtactgcccttcaggacaagcttcttgaa
gaagaaaagaaacagatggaacatgtacagagagttctacagagattgaaactggaaaag
gacaactggcttttagcaaaatctaccaaaaatgagaccatcacaaaatttctacagctg
tgtatatttcctcgatgtattttttcagcaattgatgccgtttactgtgctcgttttgtt
gaattggtacatcaacagaaaactccaaatttttccacacttctttgctatgatcgagtt
ttctctgacataatttacacagttgcaagctgtactgaaaatgaagccagtcgatatgga
aggtttctgtgctgcatgttagagactgtgaccaggtggcatagtgatagagccacatat
gaaaaggaatgtggaaactatccaggattccttaccatattacgggcaactggttttgat
ggtggaaataaggctgatcaattagactatgaaaattttcgacatgttgtacataaatgg
cattacaaactaaccaaggcatcggtacattgccttgaaacaggcgaatatactcacatc
aggaatatcttgattgtgctaacaaaaatacttccttggtacccaaaagttttgaatctg
ggtcaagctttggaaagaagagtacacaaaatctgccaagaagaaaaagagaagaggcca
gatctatatgcattggctatgggtgttgaagtttatcttctgccttcaggagattatggg
agattatgcgatcatgttcccacaggctgctactctgggcagttgaaaagtagaaagtca
tacatgatacctgaaaatgagtttcatcacaaagacccccctccgaggaatgcagttgcc
agtgtgcaaaatgggcctggtggtgggccttcatcatcatcaataggaagtgcatctaaa
tcggatgaaagcagtactgaggagactgataaatcaagggagagatctcagtgtggtgtg
aaagctgttaataaagcttctagcaccacacctaaagggaattcaagcaatggaaatagt
ggctctaacagcaacaaagctgttaaagaaaatgacaaagaaaaagggaaagagaaagaa
aaagagaaaaaagaaaagactccagctactactccagaggccagggtacttggtaaagat
ggtaaagaaaaaccaaaggaagagcggccaaataaagatgaaaaagcaagagagaccaag
gaaagaacgccaaagtctgacaaagagaaagaaaaattcaagaaggaagaaaaagctaaa
gatgagaaatttaagaccactgtccccaatgcagaatcaaaatcaactcaagaaagggaa
agagagaaggagccatccagagaaagagatatagcaaaggaaatgaaatcaaaggaaaat
gttaaaggaggagaaaaaacgccagtttctgggtccttgaaatcacctgttccccgatca
gatattccagagcctgaaagggaacaaaaacgccgcaaaattgatacccacccttctcca
tcacattcctccacagtaaaggttacagccatacttcccaaagttcctctgggttctgag
aactatgccagctcacctgtcatctccattcattttctacaggacagtctcatcgaactc
aaggaatcttcagcaaagctctacattaatcatactcctccaccactgtccaagagtaag
gagagagaaatggacaagaaagatttggacaagtcaagggaaagatccagagaaagagag
aaaaaagatgaaaaggacaggaaagagcggaaaagggatcactcaaacaacgaccgtgaa
gtgccaccggacttaaccaagcgacgtaaagaggagaatggaacaatgggggtttcaaaa
cataaaagtgaaagtccttgtgaatctccttatccaaatgagaaagacaaggaaaaaaat
aagtcaaaatcttcaggcaaagaaaaaggcagtgattcatttaaatctgagaagatggat
aaaatctcctccggtggcaaaaaggagtccaggcatgataaagaaaagatagaaaagaaa
gagaaacgggacagttcaggaggaaaggaagagaagaaacatcataagtcctcggacaag
cacagataa

DBGET integrated database retrieval system