LEMMI: a continuous benchmarking platform for metagenomics classifiers

Mathieu Seppey; Mosè Manni; Evgeny M Zdobnov

doi:10.1101/gr.260398.119

LEMMI: a continuous benchmarking platform for metagenomics classifiers

Genome Res. 2020 Aug;30(8):1208-1216. doi: 10.1101/gr.260398.119. Epub 2020 Jul 2.

Authors

Mathieu Seppey¹, Mosè Manni¹, Evgeny M Zdobnov¹

Affiliation

¹ Department of Genetic Medicine and Development, University of Geneva Medical School and Swiss Institute of Bioinformatics, 1211 Geneva, Switzerland.

Abstract

Studies of microbiomes are booming, along with the diversity of computational approaches to make sense out of the sequencing data and the volumes of accumulated microbial genotypes. A swift evaluation of newly published methods and their improvements against established tools is necessary to reduce the time between the methods' release and their adoption in microbiome analyses. The LEMMI platform offers a novel approach for benchmarking software dedicated to metagenome composition assessments based on read classification. It enables the integration of newly published methods in an independent and centralized benchmark designed to be continuously open to new submissions. This allows developers to be proactive regarding comparative evaluations and guarantees that any promising methods can be assessed side by side with established tools quickly after their release. Moreover, LEMMI enforces an effective distribution through software containers to ensure long-term availability of all methods. Here, we detail the LEMMI workflow and discuss the performances of some previously unevaluated tools. We see this platform eventually as a community-driven effort in which method developers can showcase novel approaches and get unbiased benchmarks for publications, and users can make informed choices and obtain standardized and easy-to-use tools.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Algorithms
Bacteria / classification*
Bacteria / genetics*
Benchmarking / methods
Computational Biology / methods*
Genome, Bacterial / genetics*
High-Throughput Nucleotide Sequencing / methods
Metagenome / genetics
Metagenomics / methods*
Microbiota / genetics
Sequence Analysis, DNA / methods
Software