Metagenomic Sequencing of Multiple Soil Horizons and Sites in Close Vicinity Revealed Novel Secondary Metabolite Diversity

mSystems. 2021 Oct 26;6(5):e0101821. doi: 10.1128/mSystems.01018-21. Epub 2021 Oct 12.

Abstract

Discovery of novel antibiotics is crucial for combating rapidly spreading antimicrobial resistance and new infectious diseases. Most of the clinically used antibiotics are natural products-secondary metabolites produced by soil microbes that can be cultured in the lab. Rediscovery of these secondary metabolites during discovery expeditions costs both time and resources. Metagenomics approaches can overcome this challenge by capturing both culturable and unculturable hidden microbial diversity. To be effective, such an approach should address questions like the following. Which sequencing method is better at capturing the microbial diversity and biosynthesis potential? What part of the soil should be sampled? Can patterns and correlations from such big-data explorations guide future novel natural product discovery surveys? Here, we address these questions by a paired amplicon and shotgun metagenomic sequencing survey of samples from soil horizons of multiple forest sites very close to each other. Metagenome mining identified numerous novel biosynthetic gene clusters (BGCs) and enzymatic domain sequences. Hybrid assembly of both long reads and short reads improved the metagenomic assembly and resulted in better BGC annotations. A higher percentage of novel domains was recovered from shotgun metagenome data sets than from amplicon data sets. Overall, in addition to revealing the biosynthetic potential of soil microbes, our results suggest the importance of sampling not only different soils but also their horizons to capture microbial and biosynthetic diversity and highlight the merits of metagenome sequencing methods. IMPORTANCE This study helped uncover the biosynthesis potential of forest soils via exploration of shotgun metagenome and amplicon sequencing methods and showed that both methods are needed to expose the full microbial diversity in soil. Based on our metagenome mining results, we suggest revising the historical strategy of sampling soils from far-flung places, as we found a significant number of novel and diverse BGCs and domains even in different soils that are very close to each other. Furthermore, sampling of different soil horizons can reveal the additional diversity that often remains hidden and is mainly caused by differences in environmental key parameters such as soil pH and nutrient content. This paired metagenomic survey identified diversity patterns and correlations, a step toward developing a rational approach for future natural product discovery surveys.

Keywords: Oxford Nanopore; amplicon sequencing; biosynthetic gene clusters; metagenome; natural products; secondary metabolites; soil horizons.