Predicting RNA SHAPE scores with deep learning

RNA Biol. 2020 Sep;17(9):1324-1330. doi: 10.1080/15476286.2020.1760534. Epub 2020 May 31.

Abstract

Secondary structure prediction approaches rely typically on models of equilibrium free energies that are themselves based on in vitro physical chemistry. Recent transcriptome-wide experiments of in vivo RNA structure based on SHAPE-MaP experiments provide important information that may make it possible to extend current in vitro-based RNA folding models in order to improve the accuracy of computational RNA folding simulations with respect to the experimentally measured in vivo RNA secondary structure. Here we present a machine learning approach that utilizes RNA secondary structure prediction results and nucleotide sequence in order to predict in vivo SHAPE scores. We show that this approach has a higher Pearson correlation coefficient with experimental SHAPE scores than thermodynamic folding. This could be an important step towards augmenting experimental results with computational predictions and help with RNA secondary structure predictions that inherently take in-vivo folding properties into account.

Keywords: RNA; SHAPE; SHAPE-MaP; deep learning; neural network; secondary structure.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, N.I.H., Intramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Codon, Initiator
  • Computational Biology* / methods
  • Deep Learning*
  • Models, Molecular*
  • Neural Networks, Computer
  • Nucleic Acid Conformation*
  • RNA / chemistry*
  • RNA / genetics
  • RNA Folding*

Substances

  • Codon, Initiator
  • RNA