Large-scale quality assessment of prokaryotic genomes with metashot/prok-quality

F1000Res. 2021 Aug 17:10:822. doi: 10.12688/f1000research.54418.1. eCollection 2021.

Abstract

Metagenomic sequencing allows large-scale identification and genomic characterization. Binning is the process of recovering genomes from complex mixtures of sequence fragments (metagenome contigs) of unknown bacteria and archaeal species. Assessing the quality of genomes recovered from metagenomes requires the use of complex pipelines involving many independent steps, often difficult to reproduce and maintain. A comprehensive, automated and easy-to-use computational workflow for the quality assessment of draft prokaryotic genomes, based on container technology, would greatly improve reproducibility and reusability of published results. We present metashot/prok-quality, a container-enabled Nextflow pipeline for quality assessment and genome dereplication. The metashot/prok-quality tool produces genome quality reports that are compliant with the Minimum Information about a Metagenome-Assembled Genome (MIMAG) standard, and can run out-of-the-box on any platform that supports Nextflow, Docker or Singularity, including computing clusters or batch infrastructures in the cloud. metashot/prok-quality is part of the metashot collection of analysis pipelines. Workflow and documentation are available under GPL3 licence on GitHub.

Keywords: MAG; MIMAG; completeness; contamination; dereplication; docker; genome quality; metagenome-assembled genome; nextflow.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Archaea / genetics
  • Metagenome*
  • Metagenomics* / methods
  • Prokaryotic Cells
  • Reproducibility of Results

Grants and funding

This work was supported by the Autonomous Province of Trento (Accordo di Programma).