STRONG: metagenomics strain resolution on assembly graphs

Genome Biol. 2021 Jul 26;22(1):214. doi: 10.1186/s13059-021-02419-7.

Abstract

We introduce STrain Resolution ON assembly Graphs (STRONG), which identifies strains de novo, from multiple metagenome samples. STRONG performs coassembly, and binning into metagenome assembled genomes (MAGs), and stores the coassembly graph prior to variant simplification. This enables the subgraphs and their unitig per-sample coverages, for individual single-copy core genes (SCGs) in each MAG, to be extracted. A Bayesian algorithm, BayesPaths, determines the number of strains present, their haplotypes or sequences on the SCGs, and abundances. STRONG is validated using synthetic communities and for a real anaerobic digestor time series generates haplotypes that match those observed from long Nanopore reads.

Keywords: Assembly graph; Bayesian; Metagenome; Microbial community; Microbiome; Strains.

Publication types

  • Research Support, N.I.H., Intramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Bayes Theorem
  • Contig Mapping
  • Genome, Bacterial*
  • Haplotypes
  • Metagenome*
  • Metagenomics / methods
  • Microbial Consortia / genetics*
  • Sequence Analysis, DNA
  • Software*