Uncovering secondary metabolite evolution and biosynthesis using gene cluster networks and genetic dereplication

Sci Rep. 2018 Dec 18;8(1):17957. doi: 10.1038/s41598-018-36561-3.

Abstract

The increased interest in secondary metabolites (SMs) has driven a number of genome sequencing projects to elucidate their biosynthetic pathways. As a result, studies revealed that the number of secondary metabolite gene clusters (SMGCs) greatly outnumbers detected compounds, challenging current methods to dereplicate and categorize this amount of gene clusters on a larger scale. Here, we present an automated workflow for the genetic dereplication and analysis of secondary metabolism genes in fungi. Focusing on the secondary metabolite rich genus Aspergillus, we categorize SMGCs across genomes into SMGC families using network analysis. Our method elucidates the diversity and dynamics of secondary metabolism in section Nigri, showing that SMGC diversity within the section has the same magnitude as within the genus. Using our genome analysis we were able to predict the gene cluster responsible for biosynthesis of malformin, a potentiator of anti-cancer drugs, in 18 strains. To proof the general validity of our predictions, we developed genetic engineering tools in Aspergillus brasiliensis and subsequently verified the genes for biosynthesis of malformin.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Aspergillus / genetics
  • Aspergillus / metabolism
  • Biosynthetic Pathways*
  • Cluster Analysis
  • Computational Biology / methods
  • Data Mining
  • Gene Expression Profiling
  • Gene Expression Regulation*
  • Gene Regulatory Networks*
  • Genetic Engineering
  • Genomics / methods
  • Molecular Sequence Annotation
  • Multigene Family*
  • Secondary Metabolism / genetics*