KEGG   Thalassiosira pseudonana: THAPSDRAFT_28004
Entry
THAPSDRAFT_28004  CDS       T01078                                 
Name
(RefSeq) predicted protein
  KO
K12867  pre-mRNA-splicing factor SYF1
Organism
tps  Thalassiosira pseudonana
Pathway
tps03040  Spliceosome
Brite
KEGG Orthology (KO) [BR:tps00001]
 09120 Genetic Information Processing
  09121 Transcription
   03040 Spliceosome
    THAPSDRAFT_28004
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03019 Messenger RNA biogenesis [BR:tps03019]
    THAPSDRAFT_28004
   03041 Spliceosome [BR:tps03041]
    THAPSDRAFT_28004
Messenger RNA biogenesis [BR:tps03019]
 Eukaryotic type
  mRNA surveillance and transport factors
   Transport factors
    Other transport factors
     THAPSDRAFT_28004
Spliceosome [BR:tps03041]
 Complex B
  Other components
   Prp19-related factors
    THAPSDRAFT_28004
 Complex C
    THAPSDRAFT_28004
SSDB
Motif
Pfam: HAT_Syf1_CNRKL1_C HAT_Syf1_M HAT_Syf1_CNRKL1_N HAT_PRP39_C HAT_PRP39_N TPR_14 TPR_8 TPR_CcmH_CycH TPR_15 TPR_2 TPR_16 Suf ARM_TT21_C TPR_12 TPR_MalT ARM_TT21 Mad3_BUB1_I RPN7 TPR_9
Other DBs
NCBI-GeneID: 7443740
NCBI-ProteinID: XP_002289912
JGI: 28004
UniProt: B8C032
LinkDB
Position
4:complement(1723234..1726375)
AA seq 832 aa
MIGERSVSLLPGSYKLWMKHLSFCLSLLDHSLPVYLLSSSSSHNHYKLTQSAFERALVRL
HKMPKLWLMYAAFVSLYDPLRDPTTVRRVYDRALVALPASQHERVWEEIICWVTGILPST
ALRILRRHALCFDTTFREDLATLCITRYKRYGEGASLLLQLLNNENASGSSTTFLSPNGT
TRHELWLRFADVCTSHPNEAKQQKQQKNIQVISHRLGEMEGTLWTRLAEYHVRAGDFELA
RSVYEEALDAITRVRDFSLVFDAYVRFEEGDLDILLGDNSLQDENAESSADVELAISRAE
HLTSRRPLLLNRVLLRQNPNNVGEWIKRSQLYLDLGEVDMAASALEEALKSVNSGKAVNG
SPSTIVLTLIDVHENKQKDLEAARNVLERICYNNEYTFTDTDDLAQCHSAWVELELRQEN
WDMALNLARRAVSSNTGGQKRGFKAVRGLSRSLRLWNLLFDLEESLGTVQTTKDAYDRSL
ELKVATPSHVLNYANFLKDKKYFEESFAAYERGLGLFPFPHAGATLLWKNYLTNFLERYE
GSKTPRVRELFDRCLADCPPEESPEFYLQYGEYEETHGLTKRALGVYERMCNAVPAAENY
TAFRLYIAKAIKYLGVTSARPIYERAISALEDKPAASICLEYAKMETGLRETDRARTVLV
FGAQLADPRRDPDYWNAWHEFEVSHGNEETFREMLRVKRSVQAAFSTVNYNAAEMGSGAP
KVDTLTEESALEMIAEREGIETEKQPVVGGFVQAKRTADMADLGEVERRAARLRQATGAQ
NVSAGDDEIDIDDVDEEGEEPQQAPRTSNVGGVSTKAVPAAVFGGITTCGDN
NT seq 2499 nt   +upstreamnt  +downstreamnt
atgattggtgaacgttccgtcagtttgttaccggggagttacaaattatggatgaagcat
ttatcattctgtttgagtttattggatcattcgttaccggtatacctcctctcttcttcc
tcctcacacaatcactacaagctgactcaatcggcattcgagagggcattggtccgtcta
cacaagatgccaaaactgtggctcatgtatgcggcgttcgtatcgttgtatgatccgctt
cgggatccaacgactgtgcgaagggtgtacgatcgagcgttggtggcacttccggcgagt
caacatgagagggtttgggaggagattatatgttgggttacggggatattgccttcgacg
gcacttcgtatcctccgccgtcatgcattgtgctttgataccacatttcgtgaagatttg
gcaactttgtgtattactcggtacaaacgatacggggagggggcatcgttgttgttgcaa
cttctcaacaatgagaatgcaagtgggtcatcgactaccttcctaagtccaaatggaacc
acaagacacgagttgtggttacgttttgcagatgtgtgtactagtcatcctaatgaggca
aagcagcagaagcaacagaaaaatattcaagtcattagtcatcgtttgggtgaaatggag
gggacgctgtggacacgacttgcggaatatcacgttcgtgcaggagactttgagctagca
cgttcagtgtacgaagaagcgttggatgcgatcactcgtgttcgtgatttcagtttagtg
tttgatgcctacgtgaggtttgaagagggggatttagatattcttcttggtgacaattct
ctccaagacgagaatgctgaatcatctgcggatgttgagcttgccatctcccgtgctgag
cacctgacttctcgtcggccactgcttctcaatcgtgtcctgttacgtcagaatccaaac
aacgtcggtgaatggatcaagcgatcgcaactttatcttgacttgggtgaagtggacatg
gctgcatcagcattggaagaggcactgaagtcggtgaattctggaaaggcagtaaacgga
tcaccatctacaatcgttctgacgttgattgatgttcacgagaacaagcaaaaggattta
gaggcagctcgcaatgttttggaaaggatatgttacaataatgagtacacgtttactgat
acggatgacttggcccagtgtcattctgcgtgggttgaattggagcttcgacaagagaac
tgggacatggcgttgaacttggcgagacgagctgtgtcgagcaacactggtgggcagaag
agaggattcaaggctgttcgcggtctttctcgcagccttcgcctttggaatttgctattt
gatttagaagagtctttgggaacagtgcagactaccaaggatgcatacgatcgttctttg
gagttgaaggtggcaactccttctcacgttctcaattatgccaacttcttgaaagacaaa
aagtatttcgaagagtcctttgcggcatacgaaagaggcctcgggttgtttccgtttccg
cacgctggagcaacgctactgtggaagaattacctcacaaactttctggagcgatatgaa
ggatccaaaacgcctcgtgtacgtgagctctttgaccgttgccttgccgactgtcctccg
gaagagtcgcctgagttctatcttcagtacggggagtacgaagaaacacatgggctgact
aaacgtgctctaggagtatacgaacggatgtgcaacgcggtgcctgctgccgaaaattac
actgccttccgtctgtacattgccaaggctattaagtaccttggcgttacctctgcgaga
ccaatttacgaacgtgccatctctgcattagaagataagccggctgcgagcatttgtttg
gaatacgccaaaatggagacgggattacgagagacggacagagctaggaccgtgttggtg
tttggtgcacaactagctgatcctcgtcgcgatccagactattggaatgcatggcacgag
tttgaagtatcacacggcaatgaggagacattccgtgagatgctgagagtcaaacgcagc
gtacaggcagcgttctctactgtcaattacaacgccgctgaaatgggatccggtgctcca
aaggtggatacactaacggaagagagtgctttggagatgattgctgaacgagagggtatt
gaaacggagaaacagccagtcgttggtggattcgttcaagcaaagagaactgcagatatg
gcagatttgggagaagttgagaggagagcagcaagattgcgtcaggcaactggagcccaa
aatgtgagcgcaggagatgacgagattgatattgacgacgttgacgaagaaggagaagaa
ccgcaacaggcaccaaggacgtcaaatgttggaggagtcagcacaaaagctgttccagct
gcggtcttcggcggaatcaccacctgcggagacaactga

DBGET integrated database retrieval system