Probing binding hot spots at protein-RNA recognition sites

Nucleic Acids Res. 2016 Jan 29;44(2):e9. doi: 10.1093/nar/gkv876. Epub 2015 Sep 13.

Abstract

We use evolutionary conservation derived from structure alignment of polypeptide sequences along with structural and physicochemical attributes of protein-RNA interfaces to probe the binding hot spots at protein-RNA recognition sites. We find that the degree of conservation varies across the RNA binding proteins; some evolve rapidly compared to others. Additionally, irrespective of the structural class of the complexes, residues at the RNA binding sites are evolutionary better conserved than those at the solvent exposed surfaces. For recognitions involving duplex RNA, residues interacting with the major groove are better conserved than those interacting with the minor groove. We identify multi-interface residues participating simultaneously in protein-protein and protein-RNA interfaces in complexes where more than one polypeptide is involved in RNA recognition, and show that they are better conserved compared to any other RNA binding residues. We find that the residues at water preservation site are better conserved than those at hydrated or at dehydrated sites. Finally, we develop a Random Forests model using structural and physicochemical attributes for predicting binding hot spots. The model accurately predicts 80% of the instances of experimental ΔΔG values in a particular class, and provides a stepping-stone towards the engineering of protein-RNA recognition sites with desired affinity.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Binding Sites
  • Conserved Sequence
  • Databases, Protein
  • Evolution, Molecular
  • Humans
  • Models, Molecular
  • Models, Statistical*
  • Molecular Sequence Data
  • Nucleic Acid Conformation
  • Protein Binding
  • Protein Conformation
  • RNA / chemistry*
  • RNA-Binding Proteins / chemistry*
  • Thermodynamics
  • Water / chemistry

Substances

  • RNA-Binding Proteins
  • Water
  • RNA