KEGG   Schistosoma haematobium (urinary blood fluke): MS3_00008099
Entry
MS3_00008099      CDS       T05004                                 
Symbol
THOC2_1
Name
(RefSeq) THO complex subunit 2
  KO
K12879  THO complex subunit 2
Organism
shx  Schistosoma haematobium (urinary blood fluke)
Pathway
shx03013  Nucleocytoplasmic transport
shx03040  Spliceosome
Brite
KEGG Orthology (KO) [BR:shx00001]
 09120 Genetic Information Processing
  09121 Transcription
   03040 Spliceosome
    MS3_00008099 (THOC2_1)
  09122 Translation
   03013 Nucleocytoplasmic transport
    MS3_00008099 (THOC2_1)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03019 Messenger RNA biogenesis [BR:shx03019]
    MS3_00008099 (THOC2_1)
   03041 Spliceosome [BR:shx03041]
    MS3_00008099 (THOC2_1)
Messenger RNA biogenesis [BR:shx03019]
 Eukaryotic type
  mRNA surveillance and transport factors
   Transport factors
    TREX complex
     MS3_00008099 (THOC2_1)
Spliceosome [BR:shx03041]
 Complex C
  Other components
   EJC/TREX
    MS3_00008099 (THOC2_1)
 Other splicing related proteins
  Spliceosome associated proteins (SAPs)
   TREX complex
    MS3_00008099 (THOC2_1)
SSDB
Motif
Pfam: Tho2 THOC2_N Thoc2 YlqD VMAP-M1 ABC_tran_CTD
Other DBs
NCBI-GeneID: 24595283
NCBI-ProteinID: XP_051067032
UniProt: A0A922IPC7
LinkDB
Position
4:complement(41372983..41434685)
AA seq 1512 aa
MTNILGHKYHFTQEPGVNTPESLYKVSAFLIWKKLIDLDVLYGHLTPSDADIQQSHNRQI
NLAKSYRPQPPASLSYSATSAINTSLNTAINCDLIRVDGLLSQDSQLGGSGTTSVSGTLM
TNDWTGPRTSGALTSRSAEELMQLEENPDVNLSKTNSHDVSEVGMIDISIDQDGLYENNQ
KLELCAAMVRLGDWTNAQRLLERLPGYWATTYEPLTRDICNLLHCLIEPLYNKACPLPAC
LQRNRFRPRIESFKALNASINLHGIDEKSSVDCEIHEDSRTITNNNNNNNDCILLSPVHD
FVGLAKYVIPIAIYLGPHMSYDLILIVKFCRLGQIYLSEQQQNLRSDKIGIVYQGFFNLL
DEVLLPSLSLVDANCCLAEEIWQMLRFLPYEHRYRLYGQWKHFNVQTEPSLIRRRAQVMC
YAKAIMKRLSKENVKPMGRHLGKLSHSNPGLLFDHVLHTVQLFTNLINPVVEALKYVSSL
GYDMLTFCIIEALATDQSKLEDLQLSQGLQALSLFTGLLCRKYQFDLAGLLQYVLNQLKV
GKSYDLAILREILHRMSGIDISEEMTEEQLESMSGGELLLQEGGYYAQIRNTRRNAGRLK
DALVEHNLIMPFIFLMAQQRDAIIFLDDPERHCKLAGRLYDQCQGTLVQFITFLSLQLTR
DEMQTQCLNIDRMMGEFHVPADTAFCFHRNLFEQKVARLFESKEQAETMKSGEKSKSFSK
TIFSAASLHVCNEIAAAIRPLYPARVWDELGIVFFITFWSLQSSDLVVPESAYQRQIQQL
KEQIQQIDTPASGWNSTKKKREIERLENLIERLTNEQAEREEHVTRVRAWLMTERDNWFQ
TRLATKTDTITQFLQLCIYPRVCFTATDAIYAAQFMHVLHQLKTARFSTLICLDRIFNDI
TLPTSMCTENEAHRYGRFLCAVLELVMRWHASEEVFNQECGQYPGFVTVFRKTYQGLDAN
TKPDQLQYENYRHVVHKWHYRITKAIVACLESGNYVQIRNALIVLTRILPQYPKITQFGS
AVERRVNKLKDEEKDRRPDLKALAFSYAGLMRPRKVKWVSEEEFHLKETKPIRSETNTNY
SSNHQSINSGGNTNRTVSNPDNSSGTTVNSTSYGHNPRGHSGSNQLTSFSNAPSNINNTS
CGITNTNTSSSGSGGNNTIFVTSISNSSSSSATLEPPHISVSVRRTSTQPPPSNQTSKIP
SNSTVAPSGSVLETRSRPQTVNASITAVESSKCNPSGQLSGSVSLSSVTAVSNSSSSNNL
SNNSPSIPTTTQSKLRSSQRFESSLPPPDDDVASVPGPQSHSSKRRRMEAASANLTSPNG
CILSVTTSKASVNTVDEPRESHKSSTRKRSTPTTIPGSAPITPETAIQNLRYTNAGIVSN
TVPSVNSCSGGSSGTGSRSTTSPGENVYPVNSSLRHHHHHQSRQSSHQRASSGNSTESRE
PSLENERNLIASSSAGLVKKAKKAHHHHHHQRSSLDSMNMESNLNNNINNNPGSITIYST
SESQGPSRHSRR
NT seq 4539 nt   +upstreamnt  +downstreamnt
atgactaatatattgggacacaaatatcattttacacaagaacccggtgtcaatacacca
gagtcattgtacaaagtttcagcatttttaatttggaaaaagttaattgacttggatgta
ttatacggacacttaactccatccgatgcggatattcagcaatctcataatcgtcaaatt
aacttggctaaatcttatcgtccacaaccgccggcaagtttatcttattcagcaacttct
gcgatcaatacctcgttgaatactgcaatcaactgtgatttaattcgtgttgatggtcta
ttgagccaggattctcaacttggtggtagtggtactacttctgtttctgggacattgatg
acgaacgattggactggtcctaggacgtcaggagcgctaacttctcgctctgcagaagag
ctaatgcaattggaggagaatcccgatgtaaatttaagtaaaacgaattcacatgatgtt
agtgaagttggtatgatagatataagtattgatcaagatggattgtatgaaaataaccaa
aaattggagttatgtgctgccatggttcgtttaggcgattggacaaatgctcaacgtctg
ttagaacgcctacctggttactgggcaactacatatgaacctttaacacgagacatatgc
aatttgttgcattgtttaattgaacctttgtacaataaagcttgccctttacctgcctgt
ttacaacgtaatcgttttcgaccgagaatcgagtcattcaaagcattaaatgcatctatt
aatttacatggtattgatgagaaatcgtctgttgattgtgaaatacatgaagacagtcgg
accattacaaataataataacaataataatgattgtattctattatcacctgttcatgat
tttgttggccttgccaagtatgtaataccaatagcaatttatttaggcccgcatatgtca
tatgatttaatattaatagtgaagttttgtcgtcttggtcaaatatatctttctgaacag
caacaaaaccttcggtctgacaagattggaattgtttatcaaggattttttaatttatta
gatgaagtgcttcttccgtcgttgagtttggttgatgctaattgttgtttggctgaagag
atttggcagatgttacgatttttgccttatgaacatcgatatcggttatatggacaatgg
aaacatttcaacgttcaaactgaacctagcctaatacgtcggcgggctcaagtaatgtgc
tatgctaaggcgattatgaaacgattgagcaaggaaaacgttaaaccaatgggtagacat
cttggaaaattgtcacacagtaatccaggtttattatttgatcacgtcttgcatacagtt
caactgtttacaaatcttataaatcctgttgttgaagcattgaaatatgttagcagttta
ggctatgatatgctaactttttgtattattgaagcacttgccacagatcaatcgaaactg
gaagatttacaattaagtcaaggtttacaagcattgtcgttgtttactggattgttgtgc
agaaagtatcaattcgatttggctggtttgttacaatatgtactgaatcaattaaaagtt
ggtaaaagttatgatttggcaatattacgtgaaattctccatcgtatgtctggtatcgat
atctcggaagaaatgacggaagaacaattagaatccatgtcaggtggtgaattactattg
caagaaggtggttattatgcacagatacgtaacacacgtcgtaatgctggacgtcttaaa
gatgccttggttgaacataatttgatcatgccatttatttttttaatggctcaacagcgt
gatgctatcatatttctagatgatccagaacgtcactgtaaattagctgggcgtttatat
gatcagtgtcaaggtactcttgtacaattcataacatttcttagtcttcaattaactcgt
gatgaaatgcaaactcagtgtttaaatattgacagaatgatgggtgaatttcatgttcca
gctgatactgccttttgttttcatcgtaatctgtttgaacaaaaagttgctcgtttattt
gaaagcaaagaacaagcagaaacaatgaaatctggtgaaaaatcaaagtcattcagcaag
acaattttctcagctgcttctttacatgtgtgcaatgaaatagctgcagctatacgtcca
ttatatccagctcgtgtatgggatgaattaggcattgtgttctttatcacattttggagc
ttacagtcaagtgatttagttgtccccgaatccgcctatcaacgacaaatacaacaatta
aaagagcagattcaacaaattgatacgccggctagtggatggaattcaacaaaaaagaaa
cgtgagatagaacgtttagaaaatttaattgaacgtctaaccaatgaacaagctgaacgg
gaagaacatgtaacacgtgtacgtgcttggcttatgactgaacgtgataattggtttcaa
acacgattagctacaaaaactgatactattacacaattcttacaattatgtatatatcct
agagtgtgtttcaccgcaactgatgctatctatgcggcacaatttatgcatgttttgcat
caattaaagactgcacgattctctacgctgatttgtttggatcgaatttttaatgatata
acgttgccaaccagtatgtgtacagaaaatgaagcgcatcgttacggtcgatttttatgt
gcagtcttagaacttgtgatgcgttggcatgccagtgaagaagtgttcaatcaagaatgt
ggtcaatatcctggttttgttactgtcttcagaaagacatatcaaggtttagatgcgaat
actaaaccagaccaattgcagtatgagaattatcgtcatgttgtacataaatggcattat
cgtattacaaaagcaattgttgcttgtctagaatctggtaattatgttcaaattcgcaac
gcattaattgtattaacacgaattctacctcaatatccgaaaatcacacagttcggaagt
gctgttgaacgacgtgtaaataaactaaaagatgaagagaaagatcgacgtcctgattta
aaggctttagcattcagttatgctggcctaatgcgtccacgtaaagttaaatgggttagt
gaggaagaatttcatttgaaagaaactaaaccaattcgctcagagacgaatacaaattat
tcaagtaatcatcaaagcatcaacagtggtggtaacactaatcgtactgtatcaaatcct
gacaatagttccggaactactgtcaactcaactagttacggtcataatccacgtggacat
tccggttcaaatcaattgaccagtttttcgaatgcgccttcaaatataaacaacactagt
tgtggtatcaccaatactaataccagtagtagtggtagtggaggcaacaacacaattttt
gttactagcatcagtaattcatctagtagttctgcgacacttgaacctccacatatttca
gtatctgttcgacgtacttcaacacaaccaccaccatcaaatcaaactagtaaaatccca
tccaactcaactgttgctccaagcggtagtgttctggaaacacggtcacgacctcaaaca
gtgaatgcgtctataacagccgtagagtcttcgaaatgcaatccttctggtcaactatcc
gggtctgtgtcactgtcctcggtaacagctgtgagcaattctagtagttcgaataatttg
agcaataattcaccttccattccaacaactacccaaagtaaattacgatctagtcaacgt
tttgagtcatctctaccacctccagacgatgatgttgcttctgtaccgggaccacaatca
cattcatcgaaacgacgcagaatggaagcggccagcgcgaatttgactagtcctaatggg
tgcattttaagtgtgacaacatccaaggcatcagtgaacacagttgacgaaccacgtgag
tcacataaatcttccacacgtaaacgttcaactcccacaactattccaggaagtgcaccg
atcactccagaaactgctatacaaaatttacgctacacaaatgccggtatagtttccaac
actgttcccagtgttaattcttgcagcggcggtagtagtggtactggtagtcgttcaaca
acaagtcctggtgaaaatgtttaccctgttaattcttctctacgtcatcatcatcatcat
caatctagacagtcgagtcatcaaagagcatcatcaggtaattctactgaaagtagagaa
ccaagcttagaaaatgaacgaaatttaattgcttccagttcagcaggtttggtaaagaaa
gcaaagaaagctcatcatcaccaccatcatcaacgttcgtcattagattcaatgaatatg
gaatctaatcttaacaataatattaataataatccgggttctattactatatattcaact
tcagaatcccaaggaccatctcgacattcacgtagataa

DBGET integrated database retrieval system