The effect of taxonomic classification by full-length 16S rRNA sequencing with a synthetic long-read technology

Sci Rep. 2021 Jan 18;11(1):1727. doi: 10.1038/s41598-020-80826-9.

Abstract

Characterizing the microbial communities inhabiting specimens is one of the primary objectives of microbiome studies. A short-read sequencing platform for reading partial regions of the 16S rRNA gene is most commonly used by reducing the cost burden of next-generation sequencing (NGS), but misclassification at the species level due to its length being too short to consider sequence similarity remains a challenge. Loop Genomics recently proposed a new 16S full-length-based synthetic long-read sequencing technology (sFL16S). We compared a 16S full-length-based synthetic long-read (sFL16S) and V3-V4 short-read (V3V4) methods using 24 human GUT microbiota samples. Our comparison analyses of sFL16S and V3V4 sequencing data showed that they were highly similar at all classification resolutions except the species level. At the species level, we confirmed that sFL16S showed better resolutions than V3V4 in analyses of alpha-diversity, relative abundance frequency and identification accuracy. Furthermore, we demonstrated that sFL16S could overcome the microbial misidentification caused by different sequence similarity in each 16S variable region through comparison the identification accuracy of Bifidobacterium, Bacteroides, and Alistipes strains classified from both methods. Therefore, this study suggests that the new sFL16S method is a suitable tool to overcome the weakness of the V3V4 method.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bacteria / classification*
  • Bacteria / genetics
  • Feces / microbiology
  • Gastrointestinal Microbiome / genetics*
  • High-Throughput Nucleotide Sequencing / methods
  • Humans
  • Metagenome
  • Microbiota / genetics
  • Phylogeny
  • RNA, Ribosomal, 16S / genetics*
  • Sequence Analysis, DNA / methods

Substances

  • RNA, Ribosomal, 16S