KOALA logo

BlastKOALA

Query Data Input

KEGG2 logo Automatic annotation and
KEGG mapping service

BlastKOALA GhostKOALA KofamKOALA
Due to the problems caused by the new job submission system, BlastKOALA/GhostKOALA jobs are not currently accepted.


KOALA (KEGG Orthology And Links Annotation) is KEGG's internal annotation tool for K number assignment of KEGG GENES using SSEARCH computation. BlastKOALA and GhostKOALA assign K numbers to the user's sequence data by BLAST and GHOSTX searches, respectively, against a nonredundant set of KEGG GENES. KofamKOALA is a new member of the KOALA family available at GenomeNet using the HMM profile search, rather than the sequence similarity search, for K number assignment.    See Step-by-step Instructions.

Reference: Kanehisa, M., Sato, Y., and Morishima, K. (2016) BlastKOALA and GhostKOALA: KEGG tools for functional characterization of genome and metagenome sequences. J. Mol. Biol. 428, 726-731. [pubmed] [pdf]



BlastKOALA accepts a smaller dataset and is suitable for annotating high-quality genomes

Upload query amino acid sequences in FASTA format
Enter FASTA sequences

Or upload file:
Your query data consisting of multiple amino acid sequences will be given K numbers by BlastKOALA.
Up to ten thousand sequences may be uploaded (see table below).


Enter taxonomy group of your genome
Not known   Prokaryotes   Bacteria   Archaea
Eukaryotes   Animals   Plants   Fungi   Protists
Taxonomy ID

Taxonomy group information is used in the scoring scheme for K number assignment. Enter NCBI taxonomy ID of your genome, if known, at the species, genus, family or any other level, which will be converted to an appropriate KEGG taxonomy group currently defined. Alternatively, just select a more generic group name shown.


Enter KEGG GENES database file to be searched
family_eukaryotes
genus_eukaryotes
genus_prokaryotes
species_prokaryotes
family_eukaryotes + genus_prokaryotes
DB size (2019/11/24) and query data limit
eukaryotesprokaryotes
organism9743948-18916537-
species9666116-134052455000
genus76797877500479486610000
family519425310000
family_euk + genus_prok99891195000

The database files are generated from KEGG GENES as a collection of representative genomes by removing similar organisms at the species, genus or family level. When multiple members are present in each species/genus/family group, the first genome is taken as a representative genome. When the other members in the group contain different K numbers that are not present in the representative genome, those genes are added as if they are present in additional chromosomes or plasmids.


Enter your email address

An email will be sent to you for confirmation of your input data. You will have to click on the link in the email to initiate your job. When the job is finished, you will receive another email for browsing the result and performing KEGG Mapper analysis. You cannot request another job until the current one is finished or canceled.    Notice: Your email address will not be used for any other purpose.


Last updated: May 15, 2019
[ KEGG | Kanehisa Laboratories ]
[ Kyoto University Bioinformtics Center ]