Pentatricopeptide repeat proteins constrain genome evolution in chloroplasts

Mol Biol Evol. 2011 Jul;28(7):2029-39. doi: 10.1093/molbev/msr023. Epub 2011 Jan 24.

Abstract

Higher plants encode hundreds of pentatricopeptide repeat proteins (PPRs) that are involved in several types of RNA processing reactions. Most PPR genes are predicted to be targeted to chloroplasts or mitochondria, and many are known to affect organellar gene expression. In some cases, RNA binding has been directly demonstrated, and the sequences of the cis-elements are known. In this work, we demonstrate that RNA cis-elements recognized by PPRs are constrained in chloroplast genome evolution. Cis-elements for two PPR genes and several RNA editing sites were analyzed for sequence changes by pairwise nucleotide substitution frequency, pairwise indel frequency, and maximum likelihood (ML) phylogenetic distances. All three of these analyses demonstrated that sequences within the cis-element are highly conserved compared with surrounding sequences. In addition, we have compared sequences around chloroplast editing sites and homologous sequences in species that lack an editing site due to the presence of a genomic T. Cis-elements for RNA editing sites are highly conserved in angiosperms; by contrast, comparable sequences around a genomically encoded T exhibit higher rates of nucleotide substitution, higher frequencies of indels, and greater ML distances. The loss in requirement for editing to create the ndhD start codon has resulted in the conversion of the PPR gene responsible for editing that site to a pseudogene. We show that organellar dependence on nuclear-encoded PPR proteins for gene expression has constrained the evolution of cis-elements that are required at the level of RNA processing. Thus, the expansion of the PPR gene family in plants has had a dramatic effect on the evolution of plant organelle genomes.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Base Sequence
  • Chloroplasts / genetics*
  • Evolution, Molecular*
  • Genome, Plant*
  • Magnoliopsida / genetics
  • Membrane Proteins
  • Mitochondrial Proteins
  • Molecular Sequence Data
  • Mutation
  • Phylogeny
  • Plant Proteins / genetics*
  • Repetitive Sequences, Nucleic Acid
  • Sequence Alignment
  • Transcription Factors / genetics*

Substances

  • Membrane Proteins
  • Mitochondrial Proteins
  • Plant Proteins
  • Transcription Factors