Computational approaches for the analysis of RNA-protein interactions: A primer for biologists

J Biol Chem. 2019 Jan 4;294(1):1-9. doi: 10.1074/jbc.REV118.004842. Epub 2018 Nov 19.

Abstract

RNA-binding proteins (RBPs) play important roles in the control of gene expression and the coordination of different layers of post-transcriptional regulation. Interactions between certain RBPs and mRNA transcripts are notoriously difficult to predict, as any given protein-RNA interaction may rely not only on RNA sequence, but also on three-dimensional RNA structures, competitive inhibition from other RBPs, and input from cellular signaling pathways. Advanced and high-throughput technologies for the identification of RNA-protein interactions have come to the rescue, but the identification of binding sites and downstream functional effects of RBPs from the resulting data can be challenging. In this review, we discuss statistical inference and machine-learning approaches and tools relevant for the study of RBPs and the analysis of large-scale RNA-protein interaction datasets. This primer is intended for life scientists who are interested in incorporating these tools into their own research. We begin with the demystification of regression models, as used in the analysis of next-generation sequencing data, and progress to a discussion of Hidden Markov Models, which are of particular value in analyzing cross-linking followed by immunoprecipitation data. We then continue with examples of machine learning techniques, such as support vector machines and gradient tree boosting. We close with a brief discussion of current trends in the field, including deep learning architectures.

Keywords: RNA-binding protein; RNA-seq; RNA–protein interaction; computational biology; next generation sequencing; statistics; translation control.

Publication types

  • Review

MeSH terms

  • Computer Simulation*
  • Databases, Nucleic Acid
  • Databases, Protein
  • Models, Chemical*
  • RNA / genetics
  • RNA / metabolism
  • RNA-Binding Proteins / chemistry*
  • RNA-Binding Proteins / genetics
  • RNA-Binding Proteins / metabolism

Substances

  • RNA-Binding Proteins
  • RNA