16S rRNA Gene Copy Number Normalization Does Not Provide More Reliable Conclusions in Metataxonomic Surveys

Microb Ecol. 2021 Feb;81(2):535-539. doi: 10.1007/s00248-020-01586-7. Epub 2020 Aug 29.

Abstract

Sequencing 16S rRNA gene amplicons is the gold standard to uncover the composition of prokaryotic communities. The presence of multiple copies of this gene makes the community abundance data distorted and gene copy normalization (GCN) necessary for correction. Even though GCN of 16S data provided a picture closer to the metagenome before, it should also be compared with communities of known composition due to the fact that library preparation is prone to methodological biases. Here, we process 16S rRNA gene amplicon data from eleven simple mock communities with DADA2 and estimate the impact of GCN. In all cases, the mock community composition derived from the 16S sequencing differs from those expected, and GCN fails to improve the classification for most of the analysed communities. Our approach provides empirical evidence that GCN does not improve the 16S target sequencing analyses in real scenarios. We therefore question the use of GCN for metataxonomic surveys until a more comprehensive catalogue of copy numbers becomes available.

Keywords: 16S rRNA; Gene; Metataxonomic surveys.

MeSH terms

  • Gene Dosage
  • Gene Library
  • Metagenome / genetics
  • Metagenomics / standards*
  • Microbiota / genetics*
  • RNA, Ribosomal, 16S / genetics*

Substances

  • RNA, Ribosomal, 16S