A fine-scale map of genome-wide recombination in divergent Escherichia coli population

Brief Bioinform. 2021 Jul 20;22(4):bbaa335. doi: 10.1093/bib/bbaa335.

Abstract

Recombination is one of the most important molecular mechanisms of prokaryotic genome evolution, but its exact roles are still in debate. Here we try to infer genome-wide recombination within a species, utilizing a dataset of 149 complete genomes of Escherichia coli from diverse animal hosts and geographic origins, including 45 in-house sequenced with the single-molecular real-time platform. Two major clades identified based on physiological, clinical and ecological characteristics form distinct genetic lineages based on scarcity of interclade gene exchanges. By defining gene-based syntenies for genomic segments within and between the two clades, we build a fine-scale recombination map for this representative global E. coli population. The map suggests extensive within-clade recombination that often breaks physical linkages among individual genes but seldom interrupts the structure of genome organizational frameworks as well as primary metabolic portfolios supported by the framework integrity, possibly due to strong natural selection for both physiological compatibility and ecological fitness. In contrast, the between-clade recombination declines drastically when phylogenetic distance increases to the extent where a 10-fold reduction can be observed, establishing a firm genetic barrier between clades. Our empirical data suggest a critical role for such recombination events in the early stage of speciation where recombination rate is associated with phylogenetic distance in addition to sequence and gene variations. The extensive intraclade recombination binds sister strains into a quasisexual group and optimizes genes or alleles to streamline physiological activities, whereas the sharply declined interclade recombination split the population into clades adaptive to divergent ecological niches.

Keywords: Escherichia coli; population divergence; recombination.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Escherichia coli / genetics*
  • Evolution, Molecular*
  • Genetic Variation*
  • Genome, Bacterial*
  • Genome-Wide Association Study
  • Humans
  • Recombination, Genetic*
  • Selection, Genetic*