KOALA logo


Query Data Input

KEGG2 logo Automatic annotation and
KEGG mapping service


KOALA job status 2022/08/18 16:50:51 (GMT+9)
Number of jobs in the queue1320
Submission of last completed job2022/08/18 12:25:552022/08/18 16:17:582022/08/18 15:51:51

KOALA (KEGG Orthology And Links Annotation) is KEGG's internal annotation tool for K number assignment of KEGG GENES using SSEARCH computation. BlastKOALA and GhostKOALA assign K numbers to the user's sequence data by BLAST and GHOSTX searches, respectively, against a nonredundant set of KEGG GENES. KofamKOALA is a new member of the KOALA family available at GenomeNet using the HMM profile search, rather than the sequence similarity search, for K number assignment.    See Step-by-step Instructions.

Reference: Kanehisa, M., Sato, Y., and Morishima, K. (2016) BlastKOALA and GhostKOALA: KEGG tools for functional characterization of genome and metagenome sequences. J. Mol. Biol. 428, 726-731. [pubmed] [pdf]

GhostKOALA accepts a larger dataset and is suitable for annotating metagenomes

Upload query amino acid sequences in FASTA format
Enter FASTA sequences

Or upload file:
Your query data consisting of multiple amino acid sequences will be given K numbers by GhostKOALA.
The file size of up to 300 MB with the limit of 500,000 sequences may be uploaded.

Enter KEGG GENES database file to be searched
genus_prokaryotes + family_eukaryotes
genus_prokaryotes + family_eukaryotes + viruses

The database files for GhostKOALA are somewhat different from those for BlastKOALA. For each group of KEGG organisms at the genus or family level, a nonredundant dataset is generated by taking all protein-coding genes from the representative genome and additional genes from the other genomes with two criteria. One is the same as in BlastKOALA, different K numbers, and the other is unique to GhostKOALA, different CD-HIT clusters, which are computed with 50% identity cutoff. In addition, the database file for viruses is created by CD-HIT wth 90% identitity cutoff from the viruses category of KEGG GENES. These additions are meant for analyzing taxonomic compositions of metagenomes.

Enter your email address

An email will be sent to you for confirmation of your input data. You will have to click on the link in the email to initiate your job. When the job is finished, you will receive another email for browsing the result and performing KEGG Mapper analysis. You cannot request another job until the current one is finished or canceled.    Notice: Your email address will not be used for any other purpose.

Last updated: May 15, 2019
[ KEGG | Kanehisa Laboratories ]
[ Kyoto University Bioinformtics Center ]