Evaluation of 16S rRNA Databases for Taxonomic Assignments Using Mock Community

Genomics Inform. 2018 Dec;16(4):e24. doi: 10.5808/GI.2018.16.4.e24. Epub 2018 Dec 28.

Abstract

Taxonomy identification is fundamental to all microbiology studies. Particularly in metagenomics, which identify the composition of microorganisms using thousands of sequences, its importance is even greater. Identification is inevitably affected by the choice of database. This study was conducted to evaluate the accuracy of three widely used 16S databases, Greengenes, Silva, and EzBioCloud, and to suggest basic guidelines for selecting reference databases. Using public mock community data, each database was used to assign taxonomy and to test its accuracy. We showed that EzBioCloud performs well compared to other existing databases.

Keywords: 16S rRNA database; accuracy; evaluation; identification; taxonomy.