KEGG   Viruses: 1724740
Entry
1724740           CDS       T40000                                 
Symbol
gag-pro-pol
Name
(RefSeq) Human T-cell leukemia virus type I; Pr gag-pro-pol
  KO
K23454  Deltaretrovirus gag-pro-pol polyprotein
Virus
11908  Human T-cell leukemia virus type I
SSDB
Motif
Pfam: Gag_p19 RVT_1 Gag_p24_C rve IN_DBD_C RVP Integrase_Zn Gag_p24 RNase_H zf-CCHC zf-CCHC_5 zf-CCHC_2 DUF3293 zf-CCHC_3
Other DBs
NCBI-GeneID: 1724740
NCBI-ProteinID: NP_057860
RS: NC_001436
UniProt: P14078
Structure
LinkDB
Position
NC_001436:join(450..1718,1718..2245,2245..4836)
AA seq 1462 aa
MGQIFSRSASPIPRPPRGLAAHHWLNFLQAAYRLEPGPSSYDFHQLKKFLKIALETPVWI
CPINYSLLASLLPKGYPGRVNEILHILIQTQAQIPSRPAPPPPSSSTHDPPDSDPQIPPP
YVEPTAPQVLPVMHPHGAPPNHRPWQMKDLQAIKQEVSQAAPGSPQFMQTIRLAVQQFDP
TAKDLQDLLQYLCSSLVASLHHQQLDSLISEAETRGITGYNPLAGPLRVQANNPQQQGLR
REYQQLWLAAFAALPGSAKDPSWASILQGLEEPYHAFVERLNIALDNGLPEGTPKDPILR
SLAYSNANKECQKLLQARGHTNSPLGDMLRACQAWTPKDKTKVLVVQPKKPPPNQPCFRC
GKAGHWSRDCTQPRPPPGPCPLCQDPTHWKRDCPRLKPTIPEPEPEEDALLLDLPADIPH
PKNLHRGGGLTSPPTLQQVLPNQDPTSILPVIPLDPARRPVIKAQIDTQTSHPKTIEALL
DTGADMTVLPIALFSSNTPLKNTSVLGAGGQTQDHFKLTSLPVLIRLPFRTTPIVLTSCL
VDTKNNWAIIGRDALQQCQGVLYLPEAKRPPVILPIQAPAVLGLEHLPRPPEISQFPLNP
ERLQALQHLVRKALEAGHIEPYTGPGNNPVFPVKKANGTWRFIHDLRATNSLTIDLSSSS
PGPPDLSSLPTTLAHLQTIDLKDAFFQIPLPKQFQPYFAFTVPQQCNYGPGTRYAWRVLP
QGFKNSPTLFEMQLAHILQPIRQAFPQCTILQYMDDILLASPSHADLQLLSEATMASLIS
HGLPVSENKTQQTPGTIKFLGQIISPNHLTYDAVPKVPIRSRWALPELQALLGEIQWVSK
GTPTLRQPLHSLYCALQRHTDPRDQIYLNPSQVQSLVQLRQALSQNCRSRLVQTLPLLGA
IMLTLTGTTTVVFQSKQQWPLVWLHAPLPHTSQCPWGQLLASAVLLLDKYTLQSYGLLCQ
TIHHNISTQTFNQFIQTSDHPSVPILLHHSHRFKNLGAQTGELWNTFLKTTAPLAPVKAL
MPVFTLSPVIINTAPCLFSDGSTSQAAYILWDKHILSQRSFPLPPPHKSAQRAELLGLLH
GLSSARSWRCLNIFLDSKYLYHYLRTLALGTFQGRSSQAPFQALLPRLLSRKVVYLHHVR
SHTNLPDPISRLNALTDALLITPVLQLSPADLHSFTHCGQTALTLQGATTTEASNILRSC
HACRKNNPQHQMPQGHIRRGLLPNHIWQGDITHFKYKNTLYRLHVWVDTFSGAISATQKR
KETSSEAISSLLQAIAYLGKPSYINTDNGPAYISQDFLNMCTSLAIRHTTHVPYNPTSSG
LVERSNGILKTLLYKYFTDKPDLPMDNALSIALWTINHLNVLTNCHKTRWQLHHSPRLQP
IPETHSLSNKQTHWYYFKLPGLNSRQWKGPQEALQEAAGAALIPVSASSAQWIPWRLLKR
AACPRPVGGPADPKEKDHQHHG
NT seq 4389 nt   +upstreamnt  +downstreamnt
atgggccaaatcttttcccgtagcgctagccctattccgcggccgccccgggggctggcc
gctcatcactggcttaacttcctccaggcggcatatcgcctagaacccggtccctccagt
tacgatttccaccagttaaaaaaatttcttaaaatagctttagaaacaccggtctggatc
tgccccattaactactccctcctagccagcctactcccaaaaggataccccggccgggtg
aatgaaattttacacatactcatccaaacccaagcccagatcccgtcccgccccgcgccg
ccgccgccgtcatcctccacccacgaccccccggattctgacccacaaatcccccctccc
tatgttgagcctacagccccccaagtccttccagtcatgcacccacatggtgcccctccc
aaccaccgcccatggcaaatgaaagacctacaggccattaagcaagaagtctcccaagcg
gcccctggaagcccccagtttatgcagaccatccggcttgcggtgcagcagtttgacccc
actgccaaagacctccaagacctcctgcagtacctttgctcctccctcgtggcttccctc
catcaccagcagctagatagccttatatcagaggccgaaactcgaggtattacaggttat
aaccccttagccggtcccctccgtgtccaagccaacaatccacaacaacaaggattaagg
cgagaataccagcaactctggctcgccgccttcgccgccctgccagggagtgccaaagac
ccttcctgggcctctatcctccaaggcctggaggagccttaccacgccttcgtagaacgc
ctcaacatagctcttgacaatgggctgccagaaggcacgcccaaagaccccattttacgt
tccttagcctactctaatgcaaacaaagaatgccaaaaattactacaggcccgagggcac
actaatagccctctaggagatatgttgcgggcttgtcaggcctggacccccaaagacaaa
accaaagtgttagttgtccagcctaaaaaaccccccccaaatcagccgtgcttccggtgc
gggaaagcaggccactggagtcgggactgcactcagcctcgtcctccccctgggccatgc
cccctatgtcaagatccaactcactggaagcgagactgcccccgcctaaagcccactatc
ccagaaccagagccagaggaggatgccctcctattagatctccccgccgacatcccacac
ccaaaaaacctccatagggggggaggtttaacctccccccccacattacagcaagtcctt
cctaaccaagacccaacatctattctgccagttataccgttagatcccgcccgtcggccc
gtaattaaagcccagattgacacccagaccagccacccaaagactatcgaagctctacta
gatacaggagcagacatgacagtccttccgatagccttgttctcaagtaatactcccctc
aaaaacacatccgtgttaggggcagggggccaaacccaagatcactttaagctcacctcc
cttcctgtgctaatacgcctccctttccggacgacgcctattgttttaacatcttgccta
gttgataccaaaaacaactgggccatcataggtcgtgatgccttacaacaatgccaaggc
gtcctgtacctccctgaggcaaaaaggccgcctgtaatcttgccaatacaggcgccagct
gtccttgggctagaacacctcccaaggccccccgaaatcagccagttccctttaaaccca
gaacgcctccaggccttgcaacacttggtccggaaggccctggaggcaggccatatcgaa
ccctacaccgggccaggaaataacccagtattcccagttaaaaaagccaatggaacctgg
cgattcatccacgacctgcgggccactaactctctaaccatagatctctcatcatcttcc
cccgggccccctgacttgtccagcctgccaactacactagcccacttacaaactatagac
cttaaagacgcctttttccaaatccccctacctaaacagttccagccctactttgctttc
actgtcccacagcagtgtaactacggccccggcactagatacgcctggagagtactaccc
caagggtttaaaaatagtcccaccctgttcgaaatgcagctggcccatatcctgcagccc
attcggcaagccttcccccaatgcactattcttcagtacatggatgacattctcctggca
agcccctcccatgcggacctgcaactactctcagaggccacaatggcttccctaatctcc
catgggttgcctgtgtccgaaaacaaaacccagcaaacccctggaacaattaagttccta
gggcaaataatttcacctaatcacctcacttatgatgcagtccccaaggtacctatacgg
tcccgctgggcgctacctgaacttcaagccctacttggcgagattcagtgggtctccaaa
ggaactcctaccttacgccagccccttcacagtctctactgtgccttacaaaggcatact
gatccccgagaccaaatatatttaaatccttctcaagttcaatcattagtgcagctgcgg
caggccctgtcacagaactgccgcagtagactagtccaaaccctgcccctcctaggggct
attatgctgaccctcactggcaccaccactgtggtgttccagtccaagcagcagtggcca
cttgtctggctacatgcccccctaccccacactagccagtgcccctgggggcagctactt
gcctcagctgtgttattactcgacaaatacaccttgcaatcctatggactactctgccaa
accatacatcataacatctccacccaaaccttcaaccaattcattcaaacatctgaccac
cccagtgttcctatcttactccaccacagtcaccgattcaaaaatttaggtgcccagact
ggagaactttggaacacttttcttaaaacaactgccccattggctcctgtgaaagccctt
atgccagtgtttactctttcccctgtgatcataaacaccgccccttgcctgttttcagac
ggatccacctcccaggcagcctatattctctgggacaagcatatattgtcacaaagatca
ttcccccttccgccaccgcacaagtcggcccaacgggccgaacttctcggacttttgcat
ggcctctccagcgcccgttcgtggcgctgtctcaacatatttctagactccaagtatctt
tatcattaccttcggacccttgccctaggcaccttccaaggcaggtcctctcaggccccc
tttcaggccctcctgccccgcttactatcgcgtaaggtcgtctatttgcaccacgttcgc
agccataccaatctacctgatcccatctccaggctcaacgctctcacagatgccctacta
atcacccctgtcctgcagctctctcctgcagacctacacagtttcacccattgcggacag
acggccctcacactgcaaggggcaaccacaactgaggcctccaatatcctgcgctcttgc
cacgcctgccgcaaaaataacccacaacatcagatgcctcaaggacacatccgccgtggc
ctactccctaaccacatctggcaaggcgacattacccatttcaaatataaaaatacactg
tatcgccttcatgtatgggtagacaccttttcaggagccatctcagctacccaaaagaga
aaagaaacaagctcagaagctatttcctctttgctccaggccattgcctatctaggcaag
cctagctacataaacacagacaatggccctgcctatatttcccaagacttcctcaatatg
tgtacctcccttgctattcgccatactacccatgtcccctacaatccaaccagctccgga
cttgtagaacgctctaatggcattcttaaaaccctattatataagtactttactgacaaa
cccgacctacctatggataatgctctatccatagccctatggacaatcaaccacctaaat
gtattaaccaactgccacaaaacccgatggcagcttcaccactccccccgactccagccg
atcccagagacacattccctcagcaataaacaaacccattggtattatttcaagcttcct
ggtcttaatagccgccagtggaaaggaccacaggaggctcttcaagaagctgccggcgct
gctctcatcccggtaagcgctagttctgcccagtggatcccgtggaggctcctcaagcga
gctgcatgcccaagacccgtcggaggccccgccgatcccaaagaaaaagaccaccaacac
catgggtaa

DBGET integrated database retrieval system