KOALA logo


Query Data Input

KEGG2 logo New service by KEGG for
metagenome annotation

BlastKOALA GhostKOALA Annotate Sequence Pathogen Checker

KOALA (KEGG Orthology And Links Annotation) is KEGG's internal annotation tool for K number assignment of KEGG GENES using SSEARCH computation. BlastKOALA and GhostKOALA assign K numbers to the user's sequence data by BLAST and GHOSTX searches, respectively, against a nonredundant set of KEGG GENES. Annotate Sequence in KEGG Mapper and Pathogen Checker in KEGG Pathogen are special interfaces to the BlastKOALA server and can be executed in an interactive mode.    See Step-by-step Instructions.

Reference: Kanehisa, M., Sato, Y., and Morishima, K. (2016) BlastKOALA and GhostKOALA: KEGG tools for functional characterization of genome and metagenome sequences. J. Mol. Biol. 428, 726-731. [pubmed] [pdf]

GhostKOALA accepts a larger dataset and is suitable for annotating metagenomes

Upload query amino acid sequences in FASTA format
Enter FASTA sequences

Or upload file:
Your query data consisting of multiple amino acid sequences will be given K numbers by GhostKOALA.
The file size of up to 300 MB (one million sequences with average length of 300 or three million sequences with average length of 100) may be uploaded.

Enter KEGG GENES database file to be searched
genus_prokaryotes + family_eukaryotes
genus_prokaryotes + family_eukaryotes + viruses

The database files for GhostKOALA are somewhat different from those for BlastKOALA. For each group of KEGG organisms at the genus or family level, a nonredundant dataset is generated by taking all protein-coding genes from the representative genome and additional genes from the other genomes with two criteria. One is the same as in BlastKOALA, different K numbers, and the other is unique to GhostKOALA, different CD-HIT clusters, which are computed with 50% identity cutoff. In addition, the database file for viruses is created by CD-HIT wth 90% identitity cutoff from the viruses category of KEGG GENES. These additions are meant for analyzing taxonomic compositions of metagenomes.

Enter your email address

An email will be sent to you for confirmation of your input data. You will have to click on the link in the email to initiate your job. When the job is finished, you will receive another email for browsing the result and performing KEGG Mapper analysis. You cannot request another job until the current one is finished or canceled.    Notice: Your email address will not be used for any other purpose.

Last updated: March 4, 2016
[ KEGG | Kanehisa Laboratories ]
[ Kyoto University Bioinformtics Center ]