KOALA logo


Query Data Input

KEGG2 logo New service by KEGG for
genome annotation

BlastKOALA GhostKOALA Annotate Sequence Pathogen Checker

KOALA (KEGG Orthology And Links Annotation) is KEGG's internal annotation tool for K number assignment of KEGG GENES using SSEARCH computation. BlastKOALA and GhostKOALA assign K numbers to the user's sequence data by BLAST and GHOSTX searches, respectively, against a nonredundant set of KEGG GENES. Annotate Sequence in KEGG Mapper and Pathogen Checker in KEGG Pathogen are special interfaces to the BlastKOALA server and can be executed in an interactive mode.    See Step-by-step Instructions.

Reference: Kanehisa, M., Sato, Y., and Morishima, K. (2016) BlastKOALA and GhostKOALA: KEGG tools for functional characterization of genome and metagenome sequences. J. Mol. Biol. 428, 726-731. [pubmed] [pdf]

BlastKOALA accepts a smaller dataset and is suitable for annotating high-quality genomes

Upload query amino acid sequences in FASTA format
Enter FASTA sequences

Or upload file:
Your query data consisting of multiple amino acid sequences will be given K numbers by BlastKOALA.
Up to ten thousand sequences may be uploaded (see table below).

Enter taxonomy group of your genome
Not known   Prokaryotes   Bacteria   Archaea
Eukaryotes   Animals   Plants   Fungi   Protists
Taxonomy ID

Taxonomy group information is used in the scoring scheme for K number assignment. Enter NCBI taxonomy ID of your genome, if known, at the species, genus, family or any other level, which will be converted to an appropriate KEGG taxonomy group currently defined. Alternatively, just select a more generic group name shown.

Enter KEGG GENES database file to be searched
family_eukaryotes + genus_prokaryotes
DB size (2017/1/15) and query data limit
family_euk + genus_prok66926465000

The database files are generated from KEGG GENES as a collection of representative genomes by removing similar organisms at the species, genus or family level. When multiple members are present in each species/genus/family group, the first genome is taken as a representative genome. When the other members in the group contain different K numbers that are not present in the representative genome, those genes are added as if they are present in additional chromosomes or plasmids.

Enter your email address

An email will be sent to you for confirmation of your input data. You will have to click on the link in the email to initiate your job. When the job is finished, you will receive another email for browsing the result and performing KEGG Mapper analysis. You cannot request another job until the current one is finished or canceled.    Notice: Your email address will not be used for any other purpose.

Last updated: March 4, 2016
[ KEGG | Kanehisa Laboratories ]
[ Kyoto University Bioinformtics Center ]