Genome size distributions in bacteria and archaea are strongly linked to evolutionary history at broad phylogenetic scales

PLoS Genet. 2022 May 23;18(5):e1010220. doi: 10.1371/journal.pgen.1010220. eCollection 2022 May.

Abstract

The evolutionary forces that determine genome size in bacteria and archaea have been the subject of intense debate over the last few decades. Although the preferential loss of genes observed in prokaryotes is explained through the deletional bias, factors promoting and preventing the fixation of such gene losses often remain unclear. Importantly, statistical analyses on this topic typically do not consider the potential bias introduced by the shared ancestry of many lineages, which is critical when using species as data points because of the potential dependence on residuals. In this study, we investigated the genome size distributions across a broad diversity of bacteria and archaea to evaluate if this trait is phylogenetically conserved at broad phylogenetic scales. After model fit, Pagel's lambda indicated a strong phylogenetic signal in genome size data, suggesting that the diversification of this trait is influenced by shared evolutionary histories. We used a phylogenetic generalized least-squares analysis (PGLS) to test whether phylogeny influences the predictability of genome size from dN/dS ratios and 16S copy number, two variables that have been previously linked to genome size. These results confirm that failure to account for evolutionary history can lead to biased interpretations of genome size predictors. Overall, our results indicate that although bacteria and archaea can rapidly gain and lose genetic material through gene transfers and deletions, respectively, phylogenetic signal for genome size distributions can still be recovered at broad phylogenetic scales that should be taken into account when inferring the drivers of genome size evolution.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Archaea* / genetics
  • Bacteria / genetics
  • Evolution, Molecular*
  • Genome Size
  • Phylogeny

Grants and funding

This investigation was supported by grants from the Institute for Critical Technology, and Applied Science, the National Science Foundation (grant IIBR-1918271), and a Simons Early Career Award in Marine Microbial Ecology and Evolution to F.O.A. The funders had no role in study design, data collection, analysis, decision to publish, or preparation of the manuscript.