Global analysis of the biosynthetic chemical space of marine prokaryotes

Microbiome. 2023 Jun 28;11(1):144. doi: 10.1186/s40168-023-01573-3.

Abstract

Background: Marine prokaryotes are a rich source of novel bioactive secondary metabolites for drug discovery. Recent genome mining studies have revealed their great potential to bio-synthesize novel secondary metabolites. However, the exact biosynthetic chemical space encoded by the marine prokaryotes has yet to be systematically evaluated.

Results: We first investigated the secondary metabolic potential of marine prokaryotes by analyzing the diversity and novelty of the biosynthetic gene clusters (BGCs) in 7541 prokaryotic genomes from cultivated and single cells, along with 26,363 newly assembled medium-to-high-quality genomes from marine environmental samples. To quantitatively evaluate the unexplored biosynthetic chemical space of marine prokaryotes, the clustering thresholds for constructing the biosynthetic gene cluster and molecular networks were optimized to reach a similar level of the chemical similarity between the gene cluster family (GCF)-encoded metabolites and molecular family (MF) scaffolds using the MIBiG database. The global genome mining analysis demonstrated that the predicted 70,011 BGCs were organized into 24,536 mostly new (99.5%) GCFs, while the reported marine prokaryotic natural products were only classified into 778 MFs at the optimized clustering thresholds. The number of MF scaffolds is only 3.2% of the number of GCF-encoded scaffolds, suggesting that at least 96.8% of the secondary metabolic potential in marine prokaryotes is untapped. The unexplored biosynthetic chemical space of marine prokaryotes was illustrated by the 88 potential novel antimicrobial peptides encoded by ribosomally synthesized and post-translationally modified peptide BGCs. Furthermore, a sea-water-derived Aquimarina strain was selected to illustrate the diverse biosynthetic chemical space through untargeted metabolomics and genomics approaches, which identified the potential biosynthetic pathways of a group of novel polyketides and two known compounds (didemnilactone B and macrolactin A 15-ketone).

Conclusions: The present bioinformatics and cheminformatics analyses highlight the promising potential to explore the biosynthetic chemical diversity of marine prokaryotes and provide valuable knowledge for the targeted discovery and biosynthesis of novel marine prokaryotic natural products. Video Abstract.

Keywords: Biosynthetic gene clusters; Cheminformatics; Genomics; Marine prokaryotes; Secondary metabolite.

Publication types

  • Video-Audio Media
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Biological Products*
  • Biosynthetic Pathways / genetics
  • Computational Biology
  • Genomics*
  • Phylogeny
  • Secondary Metabolism / genetics

Substances

  • Biological Products