Delineation of pentatricopeptide repeat codes for target RNA prediction

Nucleic Acids Res. 2019 Apr 23;47(7):3728-3738. doi: 10.1093/nar/gkz075.

Abstract

Members of the pentatricopeptide repeat (PPR) protein family are sequence-specific RNA-binding proteins that play crucial roles in organelle RNA metabolism. Each PPR protein consists of a tandem array of PPR motifs, each of which aligns to one nucleotide of the RNA target. The di-residues in the PPR motif, which are referred to as the PPR codes, determine nucleotide specificity. Numerous PPR codes are distributed among the vast number of PPR motifs, but the correlation between PPR codes and RNA bases is poorly understood, which hinders target RNA prediction and functional investigation of PPR proteins. To address this issue, we developed a modular assembly method for high-throughput construction of designer PPRs, and by using this method, 62 designer PPR proteins containing various PPR codes were assembled. Then, the correlation between these PPR codes and RNA bases was systematically explored and delineated. Based on this correlation, the web server PPRCODE (http://yinlab.hzau.edu.cn/pprcode) was developed. Our study will not only serve as a platform for facilitating target RNA prediction and functional investigation of the large number of PPR family proteins but also provide an alternative strategy for the assembly of custom PPRs that can potentially be used for plant organelle RNA manipulation.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence / genetics
  • Arabidopsis / genetics
  • Arabidopsis Proteins / genetics*
  • Models, Molecular
  • Nucleotide Motifs / genetics*
  • Organelles / genetics
  • RNA / genetics*
  • RNA-Binding Proteins / genetics*

Substances

  • Arabidopsis Proteins
  • RNA-Binding Proteins
  • pentatricopeptide repeat protein, Arabidopsis
  • RNA