Origin, evolution, and maintenance of gene-strand bias in bacteria

Nucleic Acids Res. 2024 Apr 24;52(7):3493-3509. doi: 10.1093/nar/gkae155.

Abstract

Gene-strand bias is a characteristic feature of bacterial genome organization wherein genes are preferentially encoded on the leading strand of replication, promoting co-orientation of replication and transcription. This co-orientation bias has evolved to protect gene essentiality, expression, and genomic stability from the harmful effects of head-on replication-transcription collisions. However, the origin, variation, and maintenance of gene-strand bias remain elusive. Here, we reveal that the frequency of inversions that alter gene orientation exhibits large variation across bacterial populations and negatively correlates with gene-strand bias. The density, distance, and distribution of inverted repeats show a similar negative relationship with gene-strand bias explaining the heterogeneity in inversions. Importantly, these observations are broadly evident across the entire bacterial kingdom uncovering inversions and inverted repeats as primary factors underlying the variation in gene-strand bias and its maintenance. The distinct catalytic subunits of replicative DNA polymerase have co-evolved with gene-strand bias, suggesting a close link between replication and the origin of gene-strand bias. Congruently, inversion frequencies and inverted repeats vary among bacteria with different DNA polymerases. In summary, we propose that the nature of replication determines the fitness cost of replication-transcription collisions, establishing a selection gradient on gene-strand bias by fine-tuning DNA sequence repeats and, thereby, gene inversions.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bacteria* / genetics
  • Bacteria* / metabolism
  • DNA Replication* / genetics
  • DNA-Directed DNA Polymerase / genetics
  • DNA-Directed DNA Polymerase / metabolism
  • Evolution, Molecular*
  • Genome, Bacterial*
  • Genomic Instability
  • Inverted Repeat Sequences
  • Replication Origin / genetics
  • Transcription, Genetic

Substances

  • DNA-Directed DNA Polymerase