Evidence for transit peptide acquisition through duplication and subsequent frameshift mutation of a preexisting protein gene in rice

Mol Biol Evol. 2006 Dec;23(12):2405-12. doi: 10.1093/molbev/msl112. Epub 2006 Sep 13.

Abstract

Many proteins synthesized in the cytosol are delivered to their appropriate compartments in the cell by specific targeting signals. Here, we provide new insight into the generation of the chloroplast-targeting signal (called the transit peptide) in rice. First, we identified the mitochondrial ribosomal protein L13 (mt rpl13) gene on chromosome 5. Downstream of the gene, we identified a DNA fragment of 266 bp: a segment within a duplication of mt rpl13. The duplicated region was transcribed and found to encode an open reading frame (ORF) of 160 amino acids (aa) (orf160). The orf160 gene comprises C-terminal 60 aa derived from the mt rpl13 gene and N-terminal 100 aa derived from another duplicated fragment of a pentatricopeptide repeat (ppr)564 gene that encodes 564 aa with ppr motifs on chromosome 1. Examination of the localization of the ORF160 protein tagged with green fluorescent protein (GFP) showed that it is targeted to the chloroplasts. As such, ORF160 clearly contains a transit peptide. Interestingly, this was translated from the alternative reading frame of the duplicated fragment of ppr564. To confirm this, the reading frame of the ppr564 gene was shifted according to that of the orf160 gene, and the frameshifted ppr564 sequence was fused to the gene for GFP. The expressed GFP-fused protein was also located in the chloroplasts. These results provide clear evidence for the generation of the transit peptide through duplication and subsequent frameshifting of a reading frame of a preexisting protein gene. We also demonstrate the importance of sequence redundancy and frameshift mutation in this evolutionary process.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Base Sequence
  • Cell Nucleus / genetics
  • Chloroplasts / genetics
  • Chloroplasts / physiology
  • DNA, Mitochondrial / genetics
  • Drosophila Proteins / genetics
  • Evolution, Molecular
  • Frameshift Mutation*
  • Gene Duplication*
  • Models, Biological
  • Molecular Sequence Data
  • Mutant Chimeric Proteins / genetics
  • Mutant Chimeric Proteins / physiology
  • Open Reading Frames / genetics
  • Oryza / genetics*
  • Plant Proteins / genetics
  • Protein Sorting Signals / genetics*
  • Ribosomal Proteins / genetics
  • Sequence Homology, Amino Acid

Substances

  • DNA, Mitochondrial
  • Drosophila Proteins
  • Mutant Chimeric Proteins
  • Plant Proteins
  • Protein Sorting Signals
  • Ribosomal Proteins
  • RpL13 protein, Drosophila

Associated data

  • GENBANK/AB244817