KEGG   Antrostomus carolinensis (chuck-will's-widow): 104527839
Entry
104527839         CDS       T08541                                 
Name
(RefSeq) pre-mRNA 3' end processing protein WDR33
  KO
K15542  polyadenylation factor subunit 2
Organism
acar  Antrostomus carolinensis (chuck-will's-widow)
Pathway
acar03015  mRNA surveillance pathway
Brite
KEGG Orthology (KO) [BR:acar00001]
 09120 Genetic Information Processing
  09122 Translation
   03015 mRNA surveillance pathway
    104527839
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03019 Messenger RNA biogenesis [BR:acar03019]
    104527839
Messenger RNA biogenesis [BR:acar03019]
 Eukaryotic type
  mRNA processing factors
   3' end processing
    Cleavage and polyadenylation specificity factor (CPSF) complex
     104527839
SSDB
Motif
Pfam: Beta-prop_THOC3 Beta-prop_WDR5 WD40 Beta-prop_WDR3_1st WD40_CDC20-Fz WD40_Prp19 Beta-prop_WDR3_2nd WD40_WDHD1_1st Beta-prop_EML Beta-prop_CAF1B_HIR1 Beta-prop_EML_2 EIF3I WDR55 WD40_Gbeta Beta-prop_TEP1_2nd Beta-prop_Aladin Beta-prop_IFT140_1st Beta-prop_WDR36-Utp21_1st Beta-prop_EIPR1 Collagen Beta-prop_WDR90_POC16_2nd Beta-prop_WDR36-Utp21_2nd ANAPC4_WD40 WD40_MABP1-WDR62_2nd Beta-prop_SCAP Beta-prop_IFT122_1st Beta-prop_WDR75_1st Beta-prop_RIG_2nd WD40_RFWD3 Beta-prop_DCAF4 eIF2A
Other DBs
NCBI-GeneID: 104527839
NCBI-ProteinID: XP_010169960
LinkDB
Position
Unknown
AA seq 1081 aa
HGADVKCVDWHPTKGLVVSGSKDSQQPIKFWDPKTGQSLATLHAHKNTVMEVKLNLNGNW
LLTASRDHLCKLFDIRNLKEELQVFRGHKKEATAVAWHPVHEGLFASGGSDGSLLFWHVG
VEKEVGGMEMAHEGMIWSLAWHPLGHILCSGSNDHTSKFWTRNRPGDKMRDRYNLNLLPG
MSEDGVEYAPGDDLEPNSLAVIPGMGIPEQLKIAMEQEQMGKDESNDIEMTIPGLDWGME
EVMQKDQKKVPQKKVPYAKPIPAQFQQAWMQNKVPLPPPPEPLNDRKEDIKLEEKKKTQA
EIEQEMAALQYTNPQLLEQLKIERLAQKQAEQVQPPPPGGSLHGPQPFPGQGPMSQMPQG
FQQPLPPQQMPMNMPQMGPPGPQGQFRPPGPQGQMGPQGPLHQGAAGPQGFMGPQGPPGP
QGPPQGMPRPQDLHGHQGMQRHPGPHGPMGPPGPQGNAGPQGHLGPQGLPGSQTHLGPQG
PPGPQGHLGPQGPPGGQGMQGPPGPRGMQGPLPHGMQGAPGSQGMQGPISQGPLMGLNPR
GMQGPPGPRDNQGPNPQGMMMGHPQEMRGPHMQSGLLGHGPQEMRGLQGPPPQGTMLGPP
QELRGPPGPQGQQGPPQGSLVGPQGNMQGPQGQPNPSRGPHPSQGPLPFQQQKTPLLGDG
PRPPFNQDGQNPGPPPLIPGLGQQGGQGRLGPHNQGPGLNKGDSRGPPNHHMGPLSDRRH
DQNSGGPDHGPERGLFRGGQEWGDGRDSRGMPDRRGPHPDFHDDFDRPDDFRDDFHPDKR
FGHRLREFEGRGGPPLQDEKWRRGGPGPPFPPDHREFEGGGSNRGPPGAWEGRRPSDDRY
PRDPEDARFRGRRDESFRRGGPSRHEGRGPRGRDGFPGPEDFGPDDAFDSPDENSRGRDH
GGRGRGRGALRGGRKGLLPTPDEFPRFEGGRKPESWDGNREPGPRPDHPPHDGHSPANRE
RSSSLQGMDMASLPPRKRPWHDGPGTSDHREMDPPGGPPEERGKGRGNSGPSQRAPKSGR
SSSLDGDHHDGFHRDEAFGGGPAGGNNPSRGGRSGSNWGRGNNMNSGQSRRGASRGGGRG
R
NT seq 3248 nt   +upstreamnt  +downstreamnt
gacatggagcggatgtgaaatgtgttgactggcatccaacaaaggggttagttgtatcag
gaagtaaagatagccaacagcccatcaagttctgggatccaaagactggccagagccttg
ccacactacatgctcataaaaacacagtgatggaggtgaagctgaatctgaatggcaact
ggctgctaacggcttctcgtgaccatctctgtaagctctttgacattcgtaatctgaaag
aagaacttcaggtcttcagaggtcataagaaggaggccacagctgtggcctggcatcctg
ttcatgaagggctgttcgccagtggaggatctgatggctctttgttgttctggcacgttg
gggttgagaaggaggttggtggaatggaaatggcccatgaaggaatgatctggagtctgg
cgtggcatccccttggacacattctctgttcaggctcaaatgatcacaccagcaagtttt
ggactagaaatcgtcctggagataaaatgcgagatcgttacaacttgaatctgctccctg
ggatgtcagaggatggtgtggaatatgctccgggagatgatctggaacccaatagtcttg
cagttattcctgggatgggaatacctgagcaattgaaaatagccatggagcaagagcaga
tggggaaagatgagtccaatgatattgaaatgacaatcccaggactggactggggaatgg
aggaagtaatgcaaaaggaccaaaagaaggtgccacagaaaaaagtgccctatgcaaaac
caataccagcacagtttcagcaggcctggatgcagaataaagttcctttgcctcctccac
ctgagccattgaatgatagaaaggaagatattaagctggaggagaaaaagaagacacagg
cagagattgaacaggaaatggcagcacttcagtacacaaacccacaactcttagagcaat
tgaagattgagagacttgcacaaaagcaagctgagcaagtgcagccaccacctccaggag
gctctcttcacggaccccagccttttccagggcaaggcccaatgtcacagatgcctcaag
ggtttcagcagccccttccacctcaacagatgccaatgaatatgcctcagatgggacctc
ctggtccacaaggacagttcagacctccgggaccccaaggacaaatgggacctcagggcc
cattacaccaaggagctgcaggtccacaagggttcatgggaccacaaggacctccaggcc
cccaaggacctccgcaaggaatgccacggcctcaggatttacatggacatcaaggaatgc
agagacatcctggtcctcatgggccaatgggacctccaggtccacagggtaatgctggcc
cacaaggccacttaggaccacagggtctgcctgggtctcaaactcatttaggaccccagg
gtccacctggcccacaaggtcacctgggtcctcaaggtccccctggtggtcaaggaatgc
aagggccacctggtccccgaggaatgcaaggacctcttcctcatggcatgcaaggagctc
caggatctcaagggatgcagggccctatatctcaagggcctttgatgggacttaatccaa
gagggatgcagggtccaccaggacccagagataaccagggacctaatccacaagggatga
tgatgggtcatccacaagaaatgagaggtcctcatatgcaaagtggtttactaggacatg
gtccccaggagatgaggggcctccagggacccccacctcaaggtacaatgttaggaccgc
cacaagaattgcgtgggccaccaggtccgcaaggccagcagggacctccacaaggctctt
tagtggggcctcaaggaaacatgcaaggacctcagggacaaccgaatccttcaagggggc
cgcacccttcccagggccctctgcctttccagcagcagaaaacacctctgttgggtgatg
ggcctcgtcccccgtttaatcaggatggacagaacccaggtcctccaccactgataccag
gtctggggcagcagggaggacaaggtcgtcttggccctcacaaccaaggacctggtctta
ataaaggtgactcccgtggtccaccaaaccatcacatgggccctctctcagatagaaggc
acgaccagaacagtggtggtcctgatcatgggcctgagagagggttgtttcgaggtggcc
aggagtggggtgatggtagagacagcaggggcatgccagatagacgaggacctcacccag
atttccatgatgactttgatcgaccagatgacttccgtgacgacttccatccagataaga
gattcggccatcgattacgggaatttgaggggagaggaggcccaccattgcaagatgaaa
aatggagacgaggtggacctgggcctccctttcctcctgatcacagggaatttgaaggag
ggggttcaaaccgtgggcctcctggtgcttgggaaggacggagaccttcagatgatagat
atccacgggatcctgaggacgcacggttccgaggaagacgggatgagagcttcagaagag
gaggaccctcgagacatgagggccggggacccagggggagagatggctttcctggccctg
aagattttgggccagatgacgcttttgactctccagatgaaaactcaagaggaagggacc
atggtggtcgggggcgaggtcgtggtgctcttagaggaggccggaagggtttacttccca
ctccagatgaatttccacgctttgaaggagggcggaaaccagaatcctgggatggaaaca
gagaaccaggtcctcgtccagatcaccctcctcatgatgggcattctccagctaacagag
aacgttcttcttcgctccagggcatggacatggcctcattgccaccacgtaaacgtccct
ggcatgatggaccagggacctctgaccacagagaaatggatcctccaggtggaccaccag
aggagaggggcaaaggacgaggaaattcaggaccctcacagagagcaccaaaatctggga
ggtctagttctttagatggagaccatcacgatggattccacagggatgaagcttttggtg
gaggccctgcaggaggcaataacccttctcgaggaggaaggagcggaagtaattggggaa
gaggaaataacatgaactctggtcaatcaagaagaggagcatctaggggtggtggaagag
gccgatag

DBGET integrated database retrieval system