Evaluating DCA-based method performances for RNA contact prediction by a well-curated data set

RNA. 2020 Jul;26(7):794-802. doi: 10.1261/rna.073809.119. Epub 2020 Apr 10.

Abstract

RNA molecules play many pivotal roles in a cell that are still not fully understood. Any detailed understanding of RNA function requires knowledge of its three-dimensional structure, yet experimental RNA structure resolution remains demanding. Recent advances in sequencing provide unprecedented amounts of sequence data that can be statistically analyzed by methods such as direct coupling analysis (DCA) to determine spatial proximity or contacts of specific nucleic acid pairs, which improve the quality of structure prediction. To quantify this structure prediction improvement, we here present a well curated data set of about 70 RNA structures of high resolution and compare different nucleotide-nucleotide contact prediction methods available in the literature. We observe only minor differences between the performances of the different methods. Moreover, we discuss how robust these predictions are for different contact definitions and how strongly they depend on procedures used to curate and align the families of homologous RNA sequences.

Keywords: RNA contact prediction; RNA structure prediction; direct coupling analysis; multiple sequence alignment.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Data Analysis
  • Datasets as Topic
  • Nucleic Acid Conformation
  • RNA / genetics*
  • Sequence Alignment / methods

Substances

  • RNA