Entry |
|
Symbol |
Dsim_GD19586
|
Name |
(GenBank) uncharacterized protein
|
KO |
|
Organism |
|
Pathway |
|
Brite |
KEGG Orthology (KO) [BR:dsi00001]
09120 Genetic Information Processing
09122 Translation
03015 mRNA surveillance pathway
Dsimw501_GD19586 (Dsim_GD19586)
09180 Brite Hierarchies
09182 Protein families: genetic information processing
03021 Transcription machinery [BR:dsi03021]
Dsimw501_GD19586 (Dsim_GD19586)
03019 Messenger RNA biogenesis [BR:dsi03019]
Dsimw501_GD19586 (Dsim_GD19586)
Transcription machinery [BR:dsi03021]
Eukaryotic type
RNA polymerase II system
Other transcription-related factors
Transcription termination factor
Dsimw501_GD19586 (Dsim_GD19586)
Messenger RNA biogenesis [BR:dsi03019]
Eukaryotic type
mRNA processing factors
3' end processing
Cleavage stimulation factor (CSTF) complex
Dsimw501_GD19586 (Dsim_GD19586)
U7-dependent histone pre-mRNA processing factors
Dsimw501_GD19586 (Dsim_GD19586)
|
SSDB |
|
Motif |
|
Other DBs |
|
LinkDB |
|
Position |
3R
|
AA seq |
1155 aa
MDSIIGRGQFVSETANLFTDEKTATARAKVVDWCNELVIASPSTKCELLAKVQETVLGSC
SELAEEFLESVLSLAHDSNMEVRKQVVVFIEQVCKVKVELLPHVINVVSMLLRDNSAQVI
KRVIQACGSIYKNGLQYLCSLMEPGDSAEQAWNILSLIKAQILDMIDNENDGIRTNAIKF
LEGVVVLQSFADEDSLKRDGDFSLGDVPDHCTLFRRQKLQEEGNNILDILLQFHGTTHIS
SVNLIACTSSLCTIAKMRPMFMGAVVDAFKQLNANLPPTLTDSQVSSVRKSLKMQLQTLM
KNRGAFEFASTIRGMLVDLGSSTNEIQKLIPKMDKQEMARRQKRILENAAQSLAKRARLA
SEQQDQQQREMELDTEELERQKQKSTRVNEKFLAEHFRNPETVVALVLEFLPSLPTEVPQ
KFLQEYTPIREMSIQQQVTNISRMFGEQLSERRLGPGAATFSREPPMRVKKVQPIESTLT
AMEVDEDAVQKLSEEELQRKEEATKKLRETMERAKGEQTVIEKMKERAKTLKLQEITKPL
PRNLKEKFLTDAVRRILNSERQCIKGGVSSKRRKLVTVIAATFPDNVRYGIMEFILEDIK
QRIDLAFSWLFEEYSLLQGFTRHTYVKTENRPDHAYNELLNKLIFGIGERCDHKDKIILI
RRVYLEAPILPEVSIGHLVQLSLDDEFSQHGLELIKDLAVLRPPRKNRFVRALLNFSVHE
RVDLRDRAQAHLVSLYHVHKILPSRIDEFALEWLKFIEQESPPAAVFSQDFGRPSEEPAW
REDTTKVCFGLAFTLLPYKPEVYLQKICQVFVSTSAELKRTILRSLDIPIKKMGVESPTL
LQLIEDCPKGMETLVIRIIYILTERVPSPHEELVRRVRDLYQNKVKDVRVMIPVLSGLTR
SELIAVLPKLIKLNPAVVKEVFNRLLGIGAEFAHQTMAMSPTDILVALHTIDTSVCDLKA
IVKATSLCLAERDLYTQEVLMAVLQQLVEVIPLPTLMMRTTIQSLTLYPRLANFVMNLLQ
RLIIKQVWRQKVIWEGFLKTVQRLKPQSMPILLHLPPAQLVDALHQCPDLRPALSEYAES
MQDEPMNGSGITQQVLDIISGKSVDVFVTDESGGYISAEHIKKEAPDPSEISVISTVPVP
AKPSQGLGTSRTALS |
NT seq |
3468 nt +upstreamnt +downstreamnt
atggatagcataattggacgcggccagtttgtctcggagacggccaatctgttcacggac
gagaagacagctacggcaagagctaaggtggtcgattggtgcaatgagctggtcatcgca
tcaccttcgacaaagtgcgaactgctggctaaggtgcaggagactgtgcttggatcctgc
tcggagctggccgaagaattcctggagtccgtgctgtctttggcccacgactcaaacatg
gaggttcgcaagcaggtagtcgtcttcatagagcaagtttgtaaagtgaaggtggagcta
ctgccccatgtcattaacgtcgtgtctatgctgcttagggataactcggctcaggtgatc
aaaagagtgatccaggcctgcggcagcatctacaagaatggattgcagtacttgtgcagc
ctcatggagcccggcgacagtgcggagcaggcgtggaacatcctcagcttaataaaggcg
caaatactggacatgatcgacaatgaaaacgacggcatacgtacaaatgccattaagttc
ttggagggcgtcgttgtcctgcagagctttgctgacgaggacagtctgaaacgagatgga
gacttctcgctgggcgatgtccccgatcactgcacactattccgccgtcaaaagttgcag
gaggagggcaacaatatcttagatatcttgcttcagttccacggaacaacgcatatttcc
tcagtgaacttgattgcctgcacaagcagcttgtgcacgattgccaaaatgcgacccatg
ttcatgggcgccgtggtggatgcctttaagcagctgaatgccaacctgccgcccaccctc
actgactcgcaagtaagctccgtgcgcaagagcctgaagatgcagctgcagacattaatg
aagaaccgcggtgcctttgagtttgcaagcactatccggggtatgttagttgacctgggg
tcctccacgaatgagatccagaagcttattcccaaaatggacaaacaggaaatggctcgc
agacaaaaacgcatccttgaaaatgctgcgcaaagtcttgcaaagcgagcgcggttggcc
agtgagcagcaggatcagcagcagcgagaaatggagctagacactgaggaattggaacgg
cagaaacagaagtccacacgagttaacgaaaagtttctggcagagcatttccgtaatcca
gaaactgttgtggccctggtgctagagttcttgcccagtctgcccactgaggtgccacaa
aagttccttcaggaatacacacccattcgcgaaatgtctattcagcagcaagtaaccaat
atctccagaatgtttggcgaacaactatctgaaaggcgcctaggcccaggtgccgcaacc
ttcagtcgagagccgccaatgcgggttaagaaagtacaaccgattgaatcaacactaact
gcaatggaagtggatgaggatgctgttcaaaaactgagtgaggaagagctccaacggaag
gaggaagcaaccaagaagctgcgcgagaccatggagcgcgctaagggcgaacagactgtt
attgaaaagatgaaggagcgcgccaagacgctgaagctgcaggagatcaccaagcccctg
ccacgcaacctaaaagagaaattcctaactgatgcagtgcgccgcatcctcaactcggag
cggcagtgcatcaagggaggcgtgtcatcgaagcgccgtaagctggtcaccgtgattgct
gcgacatttcccgataacgtgcgttatggcataatggagttcatcctggaggatataaaa
cagcgtatcgatttggcattttcgtggctctttgaggagtactctctgctgcaaggattt
acaaggcacacttatgttaagacagagaacagacccgatcacgcctacaacgagttgctg
aataagctcatctttggcattggagagcgttgcgatcacaaggataagattattctgatt
cgtagagtttatctagaagcacccattcttccggaagtctcaatcggacatctagttcaa
ttaagcctggacgacgagttctcccagcacggcctggagctgatcaaggaccttgcagtg
ctcaggccgccgcgaaagaatcgcttcgttagagcgttgctcaacttttctgtgcacgag
cgagtggatctgcgggatcgggcacaggcccatctagtcagtctgtaccatgtgcacaaa
atactaccgtctcggatagacgaattcgctctggagtggcttaaatttatcgaacaggag
tctccgccggcagctgttttctcgcaggactttggtcgcccaagcgaggagcctgcttgg
cgggaagacactaccaaggtgtgctttggtttggcctttaccctgctgccctataaaccg
gaagtttacttacagaaaatttgtcaagtgtttgtttccacatctgccgaactgaaacgc
acgatcttgcgcagtctagatatccccataaaaaagatgggtgtggaaagcccgacgttg
ctgcaactgatcgaggattgccctaagggcatggagaccttagtgatccgtataatttac
atactgacggagcgagtgccatccccccatgaagaattggtacggcgggttagggatctc
taccagaacaaggtcaaggatgtgcgcgttatgatacccgttctaagtggcttgacccga
tccgagttaattgccgttctgcctaagctgattaaacttaatcccgctgtggtcaaggag
gttttcaatcgattgcttggtatcggtgccgaattcgcccatcagacgatggccatgtcc
cccactgatattctcgtggctttgcacaccatcgatacgagtgtctgcgaccttaaagcc
atcgtcaaggctacatccctctgtttggctgaaagagatctgtacacacaggaggtactt
atggctgtgctgcaacagcttgtagaggttattccgctgcctactctgatgatgcgcaca
acaattcagagtctaaccctgtacccgcgtctggctaactttgtaatgaacttgctgcag
cgtctgataatcaagcaggtctggcgacaaaaagttatctgggagggcttcctaaagact
gtgcagcgcctgaaaccgcagtcaatgccgattctgctgcatcttcctcccgcccaattg
gtggacgcactgcatcagtgccctgatttaagaccagcattgtcggaatacgccgagagc
atgcaggatgaaccgatgaatggtagtggcattacccagcaggttctggacatcatctcc
ggtaaatcagtggatgtgttcgtgacggacgaaagcggcgggtacatcagcgccgagcat
attaaaaaagaggcaccggatccatcagaaattagtgtaatttccacagtgccagtaccg
gcaaaaccatcacaaggacttggaacatcccgtaccgcactgtcctga |