Evolutionary trends of GC/AT distribution patterns in promoters

Mol Phylogenet Evol. 2011 Aug;60(2):228-35. doi: 10.1016/j.ympev.2011.04.015. Epub 2011 May 7.

Abstract

Nucleotide distributions in genomes is known not to be random, showing the presence of specific motifs, long and short range correlations, periodicities, etc. Particularly, motifs are critical for the recognition by specific proteins affecting chromosome organization, transcription and DNA replication but little is known about the possible functional effects of nucleotide distributions on the conformational landscape of DNA, putatively leading to differential selective pressures throughout evolution. Promoter sequences have a fundamental role in the regulation of gene activity and a vast literature suggests that their conformational landscapes may be a critical factor in gene expression dynamics. On these grounds, with the aim of investigating the putative existence of phylogenetic patterns of promoter base distributions, we analyzed GC/AT ratios along the 1000 nucleotide sequences upstream of TSS in wide sets of promoters belonging to organisms ranging from bacteria to pluricellular eukaryotes. The data obtained showed very clear phylogenetic trends throughout evolution of promoter sequence base distributions. Particularly, in all cases either GC-rich or AT-rich monotone gradients were observed: the former being present in eukaryotes, the latter in bacteria along with strand biases. Moreover, within eukaryotes, GC-rich gradients increased in length from unicellular organisms to plants, to vertebrates and, within them, from ancestral to more recent species. Finally, results were thoroughly discussed with particular attention to the possible correlation between nucleotide distribution patterns, evolution, and the putative existence of differential selection pressures, deriving from structural and/or functional constraints, between and within prokaryotes and eukaryotes.

MeSH terms

  • Animals
  • Bacteria / genetics*
  • Base Composition
  • Biological Evolution*
  • DNA / chemistry*
  • DNA / genetics
  • Databases, Nucleic Acid
  • Eukaryota / genetics*
  • Genome
  • Nucleic Acid Conformation
  • Phylogeny
  • Promoter Regions, Genetic / genetics*
  • Sequence Analysis, DNA

Substances

  • DNA