KEGG   Solea solea (common sole): 131458073
Entry
131458073         CDS       T10853                                 
Symbol
thoc2
Name
(RefSeq) THO complex subunit 2 isoform X1
  KO
K12879  THO complex subunit 2
Organism
ssoe  Solea solea (common sole)
Pathway
ssoe03013  Nucleocytoplasmic transport
ssoe03040  Spliceosome
Brite
KEGG Orthology (KO) [BR:ssoe00001]
 09120 Genetic Information Processing
  09121 Transcription
   03040 Spliceosome
    131458073 (thoc2)
  09122 Translation
   03013 Nucleocytoplasmic transport
    131458073 (thoc2)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03019 Messenger RNA biogenesis [BR:ssoe03019]
    131458073 (thoc2)
   03041 Spliceosome [BR:ssoe03041]
    131458073 (thoc2)
Messenger RNA biogenesis [BR:ssoe03019]
 Eukaryotic type
  mRNA surveillance and transport factors
   Transport factors
    TREX complex
     131458073 (thoc2)
Spliceosome [BR:ssoe03041]
 Complex C
  Other components
   EJC/TREX
    131458073 (thoc2)
 Other splicing related proteins
  Spliceosome associated proteins (SAPs)
   TREX complex
    131458073 (thoc2)
SSDB
Motif
Pfam: Tho2 THOC2_N Thoc2
Other DBs
NCBI-GeneID: 131458073
NCBI-ProteinID: XP_058482756
LinkDB
Position
4:complement(7174830..7196356)
AA seq 1564 aa
MATLIIPGEWFKNWDKSGKHEFVQLCKERTEKTDHGSEVKADVQAALYELCWQVVRGNLK
LDHVASVLGDMMELRDDMPSILADVFSILDLETGALEEKNKRDNYTQLVGACLFFIPDAI
LKERLDPETLESLGLIKQALQFNQKIVKIKTKLFYKQQKFNLLREENEGYAKLITELGQD
LSGNITSHSVLENIKSLIGCFNLDPNRVLDIILEVYQSRSDQDEFFLSLIKSYMCEPLTL
CHILGFKFKFYQEPNEETPKSLYHIAAALLHHNLIELEDLYVHLMPVDATIIEEHKGVIS
DAKQIARKLVMVVLPSEKSEDKEKEKEKEEDKNEKPPDNQKLGLLEALLRIGDWQHAQSI
MDQMPSFYATSHRAIALALCQLLHLTVEPLYRRAGVPKGARGCVLRPLRNKQAPRLAESF
EDLRRDTFSMLCYLGPHLSHDPILFAKIVRLGKGFMKEYQNDARSDVKEKEILLSCFLCI
ADQVLLPSLSLMECNACMSEELWGLFKLFPYQHRYRLYGQWKNETYSSHPLLVKVKAQTV
ERARYIMKRLTKENVKQSGRQIGKLSHSNPTILFDYMLSQIQWYDNLIVPVVDSLKYLTS
LNYDVLAYCIIEALANPEKEKMKHDDTTISSWLQSLASLCGAVFRKYPIELAGLLQYVTN
QLKAGKSFDLLILKEVVQKMAGIEITDEMTSEQLEAMTGGEQLKAEGGYFGQIRNTKKSS
QRLKDALLDHELALPLCLLMAQQRNGVVFLEGGEKHLKLVGHLYDQCHDTLVQFGGFLAS
NLSTEDYIKRVPSIDILCNQFHAPHDAAFFLSRPMYAHQILSKYDELKKAEKGNRQQQKV
HKYVAACEQVMTPVHEAVMSLHSTRVWDDLRPQFYATFWSLTMYDLAVPHSAYEREVNKL
KVQIKAIEDNQEIPMNKKKKEKERCTALQEKLQEEEKKQLEHVQRVLYRLKLEKDNWLLA
KSTKNETITKFLQLCLFPRCIFSSIDAVYCARFVELVHQQKTPNFCTLLCYDRVFSAIIY
TVASCTENESHRYGRFLCCMLETVTRWHSDRAIYEKECVNYPGFLTIFRSSGFDGGNKAD
QLDYVNFRHVVHKWHYMLTKASVHCLETGDYTHIRNILIVLTKILPCYPKVLNLGQALEC
RVHKICLEEKDKRPDLYALAMGYSGRLKSQKVHMVPENEFHDKEQPARSATPASQQNGPG
NMGKPAASTSKTEEGTSEDGDRGKDKSQGTTKPVNKANSAAAKVTTSNGNGALNSTKASK
ERDDKEKSGKEKKEKKEKTPGSTPEAKGDNRREKQRDERAGKDERVVREGKEKTPKADRE
KAKVEEKSSKDDKAKAGNGEPMEPSRERDAIKESKSKEKGDRSTVAGSIKSTIPRAESAE
SEREHKRRKLESHASPSHSSTLKDNSNEPKEPTSKHHITYNSVDRSKSRERETEKKDSEN
ARGRSKEKKEEKDRKERKRDHIVSDRDTSQESKRRKDENGTNSSKNSKSTSPSCDSPLSG
EKEKSKRSKSSSKEKTVSVKPERTSSGGKKESRHDKEKSEKKEKRESSGGKEEKKHHKSS
DKHR
NT seq 4695 nt   +upstreamnt  +downstreamnt
atggcgacactcatcatccccggtgaatggttcaaaaactgggacaaatcgggaaaacac
gaatttgtacaactctgcaaggaacgtacagagaaaacagatcatgggagtgaggttaaa
gcagatgtacaagctgccttatatgagctctgctggcaagttgtacgagggaacctgaag
ttggaccatgtagccagcgtccttggggacatgatggaactccgagatgacatgccatca
attttagcagatgtgttcagtatactagatttggaaactggtgcactggaagagaaaaac
aagcgtgataattacacacagctggtcggagcgtgtttgttttttattccagatgctatc
ctgaaggagaggttggatccagaaacccttgaatctcttggactcataaaacaagccctt
cagttcaatcagaagattgtaaaaatcaagactaaactcttttacaagcaacagaagttt
aacttgctaagggaagagaatgaaggctatgccaaacttatcacagagcttggccaagac
ctctcaggcaacatcaccagccattctgtcctcgaaaacatcaagtccctcataggatgt
ttcaacttggatcctaatcgtgttttggacataatcctggaggtgtatcagagtcgatct
gatcaagatgagttcttcctatctctcatcaagtcctacatgtgtgagcccctcacattg
tgccacatcttgggctttaagttcaaattttatcaggagcccaatgaggaaacccctaag
tccctttaccacattgctgctgctctgcttcaccacaacttgatagagctggaggatcta
tatgtgcatcttatgcccgtggatgctaccatcatagaagaacacaaaggagtcatctca
gacgcgaagcagattgcccgcaagttggtcatggttgtgttgccctcagaaaagagtgaa
gacaaagaaaaggagaaagagaaggaagaggacaaaaatgagaagccacctgataaccag
aagcttggtcttttggaagcactactcagaattggagactggcaacacgcccagagtatt
atggaccagatgccttccttctatgctacatctcacagggcaattgcactggcactatgc
cagcttctgcacctaactgtggaacctctttacagaagagctggtgtcccaaaaggagct
aggggatgtgtactccgcccactaaggaacaagcaagcccctaggcttgcagagagcttt
gaggaccttcgcagggacacattcagtatgctctgctacctgggacctcacctctcccat
gaccctatcctctttgccaaaattgtgcgcctgggcaagggattcatgaaagagtaccaa
aatgatgccaggtcagatgtcaaagaaaaggaaatactactgagttgtttcctgtgcatt
gcagaccaggtgctactcccttcactctccctcatggagtgcaatgcttgcatgtctgag
gaattgtggggtctcttcaaactgtttccttaccagcacaggtaccggttgtatggacaa
tggaagaacgagacatattccagtcatccacttctggttaaagtcaaagctcagactgtg
gaaagggccagatacattatgaagcgattgaccaaagagaatgtgaaacaatctggaaga
cagattggcaagctgagccatagcaatcctaccatcctctttgattatatgctctctcag
atccagtggtacgacaacctcattgttccagtggtggactcattgaaatacctcacatcc
ctcaactatgatgtcttggcttattgcataatcgaagctctagccaatccagagaaggag
aagatgaaacacgatgacactaccatctcctcatggctccagagtctggcaagtctgtgt
ggagctgtgttcagaaaatacccaattgaattggctggccttcttcagtacgtcactaac
cagctaaaagcaggcaagagttttgacctgcttatcctaaaagaggttgtgcagaaaatg
gctggcatagagatcacagatgagatgacctcagagcagttagaggccatgaccggaggg
gagcaacttaaagctgagggtggctattttggccagatcaggaacactaagaagtcatca
cagcgtctgaaggatgccctactggaccacgaactggccttaccactgtgtctactcatg
gcccagcaacggaacggtgtggttttcttagaaggtggagaaaagcacctcaaacttgtt
ggccatctctacgatcagtgtcatgacacattggtgcagtttggtggctttctggcctcc
aacctcagcacagaggactacatcaagcgtgttccctctattgacatcctctgtaaccag
ttccacgctccacatgatgctgcgtttttcctgtctcggccaatgtatgcccatcagatt
ttgtccaagtacgatgagctgaagaaagcagagaaaggtaaccggcagcagcagaaggtg
cataagtatgttgcagcctgtgaacaggtgatgactccagtgcacgaggccgtgatgtct
ctccattctaccagggtctgggatgatctccgccctcagttctatgccaccttctggtcc
ctcaccatgtacgacctggctgtgcctcattctgcctatgaacgtgaggtcaacaaactg
aaagtccagatcaaagccatcgaagacaaccaagagatacccatgaataagaaaaagaag
gagaaggaacgctgtactgccctgcaggagaaactgcaggaggaagagaaaaaacagttg
gaacatgtacaaagggttctgtaccgcctcaaactggagaaggacaactggttgttggcc
aaatccacaaagaatgagaccattacaaagttcctgcagctctgtttgttcccacgctgc
atcttctcttccatcgatgctgtttactgcgcccgctttgtcgaactggtccatcagcaa
aagacgcccaacttctgcactcttttgtgctatgacagggttttctctgctatcatatac
accgtggccagctgtacggagaatgagtctcaccgctacggacgtttcctctgctgcatg
ctggagacggtgacccgttggcacagcgaccgcgcaatctatgaaaaggaatgtgtgaat
tacccaggcttcctgactatcttcagatcatcaggctttgatggaggaaacaaagcagat
cagctcgactatgtgaacttccggcatgtggtgcacaaatggcactacatgctgactaaa
gcttctgttcactgcctggagaccggagattacacgcacatccgaaatattctcatcgtg
ctgaccaagatcctcccctgctaccccaaggttctgaacttgggtcaagcgcttgaatgc
cgcgtccacaagatctgcctcgaggagaaggacaagagacccgatctctatgccttggca
atgggttattcaggtcggttgaaaagccagaaggtgcacatggtccctgagaatgaattt
cacgacaaggagcagccagcacgcagtgccacccctgccagtcaacagaatggccccggc
aacatgggcaagcctgccgccagcacgagcaaaactgaggaggggacgtcagaggatggt
gatcggggaaaggacaaatctcaggggaccacgaagccagtgaataaagccaacagtgca
gcagccaaagtcaccaccagcaacgggaacggtgctctcaacagcaccaaagccagtaaa
gaacgggacgacaaagagaagagtgggaaagaaaaaaaagagaaaaaggaaaagacgcca
ggcagcacccctgaggccaagggtgacaaccgtcgggagaagcaaagagacgagagggct
ggaaaagatgagcgggtggtacgtgagggtaaggagaagacccccaaggcagacagggag
aaagcaaaggtcgaggagaagagcagcaaagatgacaaggccaaagccggcaatggggag
ccgatggagccatccagggagcgcgatgccatcaaggagtccaagagcaaggagaaggga
gacaggagtacagtggccggatccatcaagtcaacaattcccagagcggagtcagccgag
tctgagagggaacacaaaagacgaaagctcgagagtcacgcttctccatcccactcctcg
acacttaaggacaatagcaacgaacccaaggagcccacatccaagcatcacatcacctac
aattcagtagaccgatccaagagcagagagagggagacggagaagaaagattcagagaac
gcacggggccgatctaaagagaagaaggaagaaaaggatcggaaagaaaggaaacgagat
catatagtcagtgaccgtgacacaagccaagagtccaaacgtagaaaggatgagaatgga
accaattcctcaaaaaacagcaagagcacaagtccttcctgtgactcgccactttcaggt
gaaaaggagaagagcaaaagatccaaatcttccagtaaggaaaagactgtgtctgtgaaa
cctgagcgaacgtcttctggtggcaaaaaggagtccagacatgataaagagaaatctgag
aagaaggagaagagggaaagcagtggaggaaaggaagagaaaaagcaccataaatcgtct
gacaagcacagataa

DBGET integrated database retrieval system