KEGG   Giardia lamblia: GL50803_008741
Entry
GL50803_008741    CDS       T01047                                 
Name
(RefSeq) Dipeptidyl-peptidase I
  KO
K01275  cathepsin C [EC:3.4.14.1]
Organism
gla  Giardia lamblia
Pathway
gla04142  Lysosome
Brite
KEGG Orthology (KO) [BR:gla00001]
 09140 Cellular Processes
  09141 Transport and catabolism
   04142 Lysosome
    GL50803_008741
 09180 Brite Hierarchies
  09181 Protein families: metabolism
   01002 Peptidases and inhibitors [BR:gla01002]
    GL50803_008741
  09182 Protein families: genetic information processing
   03110 Chaperones and folding catalysts [BR:gla03110]
    GL50803_008741
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:gla04147]
    GL50803_008741
Enzymes [BR:gla01000]
 3. Hydrolases
  3.4  Acting on peptide bonds (peptidases)
   3.4.14  Dipeptidyl-peptidases and tripeptidyl-peptidases
    3.4.14.1  dipeptidyl-peptidase I
     GL50803_008741
Peptidases and inhibitors [BR:gla01002]
 Cysteine peptidases
  Family C1: papain family
   GL50803_008741
Chaperones and folding catalysts [BR:gla03110]
 Intramolecular chaperones
  Papain family
   GL50803_008741
Exosome [BR:gla04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   GL50803_008741
SSDB
Motif
Pfam: Peptidase_C1 CathepsinC_exc Peptidase_C1_2
Other DBs
NCBI-GeneID: 5702642
NCBI-ProteinID: XP_001709718
UniProt: A8B4X1
LinkDB
Position
4:complement(960168..961853)
AA seq 561 aa
MAGLLLCAIFIWLASADTPCWCANDQVLGTWKIESTGFKYTITNDRTSCPASIRVKETRM
ITLLSPNVAVDEDTGASGTWSQVYSQAIQINIGDLKYLYFLAWEDVPDSSTVHSMCYKSQ
PEMGWAVKQGMMRRYRACIRATNVKPLLTTVDNYYEPSGPGPVNPTLIRRKLDVSVPTGD
MYIQMKKNGDVTNIPAGFRQSFSHSLFPTGAAAENSNSSYRGDKLPKNFDWRSVNGKSYV
PEPFDQGHCGSCYTAATVWAMTARVMVASEDEDKLGATRRLSVQHALDCNQYAQGCSGGF
AEMVVKFAEEFGILTENSYYISYLSGDGVERPCKAGKFLEGDRYFFTAGLPLGGYTGAVT
DPEEIKWEVYRHGPLPVSVYAGNDLFKNCSPYGPSSRAAYTDDDDVSANDKAKRHYFAEY
LDHLIFIIGWEEDENGVAYWRIQNSWGADWCDGGTSRIALGRNEYGIETAPVPFYWWRNG
RVYYDEGLVAVSLAEIIILPVAIVVLAGLIIGLSFVIAIMKKNRRLKYQRLRDEQTTHYE
FTSTALDSTVPESTTDPSSYQ
NT seq 1686 nt   +upstreamnt  +downstreamnt
atggcaggtctgcttctctgcgccatctttatctggcttgccagtgcagacactccttgc
tggtgcgctaatgaccaagttctcgggacatggaagatagaatccactggctttaagtac
acaataacgaatgatagaacaagttgtcctgcaagtatacgtgtaaaagagaccagaatg
ataacgctcttgtcacccaacgttgcagtcgatgaggacacaggagccagcgggacctgg
tcgcaagtttactctcaggccatacagattaacattggcgacttgaagtatctctacttt
ttggcctgggaggatgtgccagactcttctactgttcattccatgtgctacaagtcccag
ccagagatggggtgggccgtgaaacagggcatgatgcggcgctacagggcctgcatacgc
gccaccaatgtcaagcctctcctgaccacagttgacaattattatgagccgtctggcccg
gggccggtcaaccccactcttattagaagaaaactagacgtgtccgtcccaacgggagac
atgtacatccaaatgaaaaagaacggtgacgtcactaatataccagcaggcttccgacag
tccttttcacatagcttgttccccacgggtgcagctgccgagaatagtaactcatcgtac
agaggagataagctccccaagaacttcgattggagatctgtcaatggtaagtcatatgtc
cctgaaccatttgaccagggccactgcgggtcgtgctatacagctgccaccgtctgggcc
atgacagcacgtgtcatggttgctagtgaggacgaggacaagctcggggcgaccagacga
ctctctgttcagcacgcccttgactgtaaccagtatgctcaagggtgcagtggcggcttt
gccgagatggtagttaagtttgcagaagagttcgggatccttacagagaacagctattac
atatcgtacctatctggagacggtgtagagcgaccgtgcaaggcaggtaagttcctagaa
ggggaccgctacttcttcactgcaggccttcctcttgggggctatacaggcgccgttaca
gatcctgaagaaatcaagtgggaagtctatcgtcatggtcccttaccagtctccgtttat
gcaggtaatgatttgtttaagaactgctcgccgtacgggccatcatctagggctgcctat
actgatgatgatgacgtcagtgcaaacgataaggccaagcgtcactactttgcggagtat
cttgaccatcttatctttatcataggttgggaggaagacgagaacggtgttgcttactgg
agaattcaaaattcctggggtgcagattggtgtgacggcggtacttctcggattgcattg
ggccgaaatgaatatgggattgagacagcgccggttccattttattggtggcgaaatgga
agggtctattacgacgagggattggtggctgtgtcactagcagagatcatcattcttcca
gtagccattgtcgtcctggctggactgattattggcctatccttcgtcattgccataatg
aagaagaatagacgccttaagtatcagaggctgcgggatgagcagaccactcactatgag
ttcacatcaacagctctagactcaacggttccagaatcaacgactgatccatcctcttat
cagtga

DBGET integrated database retrieval system