Distinct evolutionary trajectories in the Escherichia coli pangenome occur within sequence types

Microb Genom. 2022 Nov;8(11):mgen000903. doi: 10.1099/mgen.0.000903.

Abstract

The Escherichia coli species contains a diverse set of sequence types and there remain important questions regarding differences in genetic content within this population that need to be addressed. Pangenomes are useful vehicles for studying gene content within sequence types. Here, we analyse 21 E. coli sequence type pangenomes using comparative pangenomics to identify variance in both pangenome structure and content. We present functional breakdowns of sequence type core genomes and identify sequence types that are enriched in metabolism, transcription and cell membrane biogenesis genes. We also uncover metabolism genes that have variable core classification, depending on which allele is present. Our comparative pangenomics approach allows for detailed exploration of sequence type pangenomes within the context of the species. We show that ongoing gene gain and loss in the E. coli pangenome is sequence type-specific, which may be a consequence of distinct sequence type-specific evolutionary drivers.

Keywords: Escherichia coli; comparative pangenomics; pangenomes; sequence types.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Biological Evolution*
  • Escherichia coli* / genetics
  • Genomics

Associated data

  • figshare/10.6084/m9.figshare.21360108