The use of different 16S rRNA gene variable regions in biogeographical studies

Environ Microbiol Rep. 2023 Jun;15(3):216-228. doi: 10.1111/1758-2229.13145. Epub 2023 Feb 21.

Abstract

16S rRNA gene amplicon sequencing is routinely used in environmental surveys to identify microbial diversity and composition of the samples of interest. The dominant sequencing technology of the past decade (Illumina) is based on the sequencing of 16S rRNA hypervariable regions. Online sequence data repositories, which represent an invaluable resource for investigating microbial distributional patterns across spatial, environmental or temporal scales, contain amplicon datasets from diverse 16S rRNA gene variable regions. However, the utility of these sequence datasets is potentially reduced by the use of different 16S rRNA gene amplified regions. By comparing 10 Antarctic soil samples sequenced for five different 16S rRNA amplicons, we explore whether sequence data derived from diverse 16S rRNA variable regions can be validly used as a resource for biogeographical studies. Patterns of shared and unique taxa differed among samples as a result of variable taxonomic resolutions of the assessed 16S rRNA variable regions. However, our analyses also suggest that the use of multi-primer datasets for biogeographical studies of the domain Bacteria is a valid approach to explore bacterial biogeographical patterns due to the preservation of bacterial taxonomic and diversity patterns across different variable region datasets. We deem composite datasets useful for biogeographical studies.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bacteria* / genetics
  • Genes, rRNA
  • Phylogeny
  • RNA, Ribosomal, 16S / genetics
  • Sequence Analysis, DNA

Substances

  • RNA, Ribosomal, 16S