A generic approach to infer community-level fitness of microbial genes

Proc Natl Acad Sci U S A. 2024 Apr 23;121(17):e2318380121. doi: 10.1073/pnas.2318380121. Epub 2024 Apr 18.

Abstract

The gene content in a metagenomic pool defines the function potential of a microbial community. Natural selection, operating on the level of genomes or genes, shapes the evolution of community functions by enriching some genes while depriving the others. Despite the importance of microbiomes in the environment and health, a general metric to evaluate the community-wide fitness of microbial genes remains lacking. In this work, we adapt the classic neutral model of species and use it to predict how the abundances of different genes will be shaped by selection, regardless of at which level the selection acts. We establish a simple metric that quantitatively infers the average survival capability of each gene in a microbiome. We then experimentally validate the predictions using synthetic communities of barcoded Escherichia coli strains undergoing neutral assembly and competition. We further show that this approach can be applied to publicly available metagenomic datasets to gain insights into the environment-function interplay of natural microbiomes.

Keywords: gene abundance; gene prevalence; microbial genes; natural selection; power law.

MeSH terms

  • Genes, Microbial
  • Metagenome / genetics
  • Microbiota* / genetics
  • Selection, Genetic