High-resolution microbiome analysis enabled by linking of 16S rRNA gene sequences with adjacent genomic contexts

Microb Genom. 2021 Sep;7(9):000624. doi: 10.1099/mgen.0.000624.

Abstract

Sequence-based characterization of bacterial communities has long been a hostage of limitations of both 16S rRNA gene and whole metagenome sequencing. Neither approach is universally applicable, and the main efforts to resolve constraints have been devoted to improvement of computational prediction tools. Here, we present semi-targeted 16S rRNA sequencing (st16S-seq), a method designed for sequencing V1-V2 regions of the 16S rRNA gene along with the genomic locus upstream of the gene. By in silico analysis of 13 570 bacterial genome assemblies, we show that genome-linked 16S rRNA sequencing is superior to individual hypervariable regions or full-length gene sequences in terms of classification accuracy and identification of gene copy numbers. Using mock communities and soil samples we experimentally validate st16S-seq and benchmark it against the established microbial classification techniques. We show that st16S-seq delivers accurate estimation of 16S rRNA gene copy numbers, enables taxonomic resolution at the species level and closely approximates community structures obtainable by whole metagenome sequencing.

Keywords: 16S rRNA; high-throughput microbiome profiling; semi-targeted sequencing; targeted DNA sequencing.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bacteria / classification
  • Bacteria / genetics
  • Base Sequence
  • Computational Biology / methods
  • DNA, Bacterial / genetics
  • Genome, Bacterial*
  • Genomics*
  • High-Throughput Nucleotide Sequencing / methods
  • Metagenome
  • Microbiota / genetics*
  • Phylogeny
  • RNA, Ribosomal, 16S / genetics*
  • Sequence Analysis, DNA

Substances

  • DNA, Bacterial
  • RNA, Ribosomal, 16S