Machine learning modeling of RNA structures: methods, challenges and future perspectives

Brief Bioinform. 2023 Jul 20;24(4):bbad210. doi: 10.1093/bib/bbad210.

Abstract

The three-dimensional structure of RNA molecules plays a critical role in a wide range of cellular processes encompassing functions from riboswitches to epigenetic regulation. These RNA structures are incredibly dynamic and can indeed be described aptly as an ensemble of structures that shifts in distribution depending on different cellular conditions. Thus, the computational prediction of RNA structure poses a unique challenge, even as computational protein folding has seen great advances. In this review, we focus on a variety of machine learning-based methods that have been developed to predict RNA molecules' secondary structure, as well as more complex tertiary structures. We survey commonly used modeling strategies, and how many are inspired by or incorporate thermodynamic principles. We discuss the shortcomings that various design decisions entail and propose future directions that could build off these methods to yield more robust, accurate RNA structure predictions.

Keywords: RNA; RNA structure prediction; deep learning; machine learning; review; secondary structure; tertiary structure.

Publication types

  • Review
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Computational Biology / methods
  • Epigenesis, Genetic*
  • Machine Learning
  • Protein Structure, Secondary
  • RNA* / metabolism

Substances

  • RNA