Feature-Based and String-Based Models for Predicting RNA-Protein Interaction

Molecules. 2018 Mar 19;23(3):697. doi: 10.3390/molecules23030697.

Abstract

In this work, we study two approaches for the problem of RNA-Protein Interaction (RPI). In the first approach, we use a feature-based technique by combining extracted features from both sequences and secondary structures. The feature-based approach enhanced the prediction accuracy as it included much more available information about the RNA-protein pairs. In the second approach, we apply search algorithms and data structures to extract effective string patterns for prediction of RPI, using both sequence information (protein and RNA sequences), and structure information (protein and RNA secondary structures). This led to different string-based models for predicting interacting RNA-protein pairs. We show results that demonstrate the effectiveness of the proposed approaches, including comparative results against leading state-of-the-art methods.

Keywords: RNA Protein Interaction; RNA structure; RPI; k-mers; protein structure; richness; suffix trees.

MeSH terms

  • Algorithms*
  • Databases, Protein
  • Models, Molecular*
  • Nucleic Acid Conformation
  • RNA / chemistry
  • RNA / metabolism*
  • RNA, Long Noncoding / metabolism
  • RNA-Binding Proteins / chemistry
  • RNA-Binding Proteins / metabolism*

Substances

  • RNA, Long Noncoding
  • RNA-Binding Proteins
  • RNA