Whole genome phylogenies reflect the distributions of recombination rates for many bacterial species

Elife. 2021 Jan 8:10:e65366. doi: 10.7554/eLife.65366.

Abstract

Although recombination is accepted to be common in bacteria, for many species robust phylogenies with well-resolved branches can be reconstructed from whole genome alignments of strains, and these are generally interpreted to reflect clonal relationships. Using new methods based on the statistics of single-nucleotide polymorphism (SNP) splits, we show that this interpretation is incorrect. For many species, each locus has recombined many times along its line of descent, and instead of many loci supporting a common phylogeny, the phylogeny changes many thousands of times along the genome alignment. Analysis of the patterns of allele sharing among strains shows that bacterial populations cannot be approximated as either clonal or freely recombining but are structured such that recombination rates between lineages vary over several orders of magnitude, with a unique pattern of rates for each lineage. Thus, rather than reflecting clonal ancestry, whole genome phylogenies reflect distributions of recombination rates.

Keywords: B. subtilis; E. coli; evolutionary biology; human; phylogeny; population structure; prokaryotic genome evolution; recombination; single nucleotide polymorphisms.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bacillus subtilis / classification
  • Bacillus subtilis / genetics
  • Bacteria / classification
  • Bacteria / genetics*
  • Escherichia coli / classification
  • Escherichia coli / genetics
  • Evolution, Molecular
  • Genome, Bacterial*
  • Helicobacter pylori / classification
  • Helicobacter pylori / genetics
  • Mycobacterium tuberculosis / classification
  • Mycobacterium tuberculosis / genetics
  • Phylogeny*
  • Polymorphism, Single Nucleotide
  • Recombination, Genetic*
  • Salmonella enterica / classification
  • Salmonella enterica / genetics
  • Sequence Analysis, DNA
  • Staphylococcus aureus / classification
  • Staphylococcus aureus / genetics
  • Whole Genome Sequencing