Correlational networking guides the discovery of unclustered lanthipeptide protease-encoding genes

Nat Commun. 2022 Mar 28;13(1):1647. doi: 10.1038/s41467-022-29325-1.

Abstract

Bacterial natural product biosynthetic genes, canonically clustered, have been increasingly found to rely on hidden enzymes encoded elsewhere in the genome for completion of biosynthesis. The study and application of lanthipeptides are frequently hindered by unclustered protease genes required for final maturation. Here, we establish a global correlation network bridging the gap between lanthipeptide precursors and hidden proteases. Applying our analysis to 161,954 bacterial genomes, we establish 5209 correlations between precursors and hidden proteases, with 91 prioritized. We use network predictions and co-expression analysis to reveal a previously missing protease for the maturation of class I lanthipeptide paenilan. We further discover widely distributed bacterial M16B metallopeptidases of previously unclear biological function as a new family of lanthipeptide proteases. We show the involvement of a pair of bifunctional M16B proteases in the production of previously unreported class III lanthipeptides with high substrate specificity. Together, these results demonstrate the strength of our correlational networking approach to the discovery of hidden lanthipeptide proteases and potentially other missing enzymes for natural products biosynthesis.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bacteria
  • Endopeptidases
  • Genome, Bacterial* / genetics
  • Peptide Hydrolases* / genetics
  • Substrate Specificity

Substances

  • Endopeptidases
  • Peptide Hydrolases