Critical evaluation of short, long, and hybrid assembly for contextual analysis of antibiotic resistance genes in complex environmental metagenomes

Sci Rep. 2021 Feb 12;11(1):3753. doi: 10.1038/s41598-021-83081-8.

Abstract

In the fight to limit the global spread of antibiotic resistance, the assembly of environmental metagenomes has the potential to provide rich contextual information (e.g., taxonomic hosts, carriage on mobile genetic elements) about antibiotic resistance genes (ARG) in the environment. However, computational challenges associated with assembly can impact the accuracy of downstream analyses. This work critically evaluates the impact of assembly leveraging short reads, nanopore MinION long-reads, and a combination of the two (hybrid) on ARG contextualization for ten environmental metagenomes using seven prominent assemblers (IDBA-UD, MEGAHIT, Canu, Flye, Opera-MS, metaSpades and HybridSpades). While short-read and hybrid assemblies produced similar patterns of ARG contextualization, raw or assembled long nanopore reads produced distinct patterns. Based on an in-silico spike-in experiment using real and simulated reads, we show that low to intermediate coverage species are more likely to be incorporated into chimeric contigs across all assemblers and sequencing technologies, while more abundant species produce assemblies with a greater frequency of inversions and insertion/deletions (indels). In sum, our analyses support hybrid assembly as a valuable technique for boosting the reliability and accuracy of assembly-based analyses of ARGs and neighboring genes at environmentally-relevant coverages, provided that sufficient short-read sequencing depth is achieved.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Anti-Bacterial Agents / pharmacology
  • Computational Biology / methods
  • Drug Resistance, Microbial / genetics*
  • Environmental Microbiology
  • Environmental Monitoring / methods
  • Genome, Bacterial / drug effects
  • High-Throughput Nucleotide Sequencing / methods
  • Metagenome / drug effects
  • Metagenome / genetics*
  • Metagenomics / methods
  • Reproducibility of Results
  • Sequence Analysis, DNA / methods
  • Wastewater / analysis*

Substances

  • Anti-Bacterial Agents
  • Waste Water