Reproducible Analysis of Sequencing-Based RNA Structure Probing Data with User-Friendly Tools

Methods Enzymol. 2015:558:153-180. doi: 10.1016/bs.mie.2015.01.014. Epub 2015 Mar 3.

Abstract

RNA structure-probing data can improve the prediction of RNA secondary and tertiary structure and allow structural changes to be identified and investigated. In recent years, massive parallel sequencing has dramatically improved the throughput of RNA structure probing experiments, but at the same time also made analysis of the data challenging for scientists without formal training in computational biology. Here, we discuss different strategies for data analysis of massive parallel sequencing-based structure-probing data. To facilitate reproducible and standardized analysis of this type of data, we have made a collection of tools, which allow raw sequencing reads to be converted to normalized probing values using different published strategies. In addition, we also provide tools for visualization of the probing data in the UCSC Genome Browser and for converting RNA coordinates to genomic coordinates and vice versa. The collection is implemented as functions in the R statistical environment and as tools in the Galaxy platform, making them easily accessible for the scientific community. We demonstrate the usefulness of the collection by applying it to the analysis of sequencing-based hydroxyl radical probing data and comparing different normalization strategies.

Keywords: Bioconductor; Galaxy; Normalization; Probing; RNA; Sequencing; Structure; Winsorization.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Base Sequence
  • Computational Biology
  • Computer Graphics
  • DNA Barcoding, Taxonomic / methods
  • DNA Probes / chemistry
  • Data Mining
  • Genome*
  • High-Throughput Nucleotide Sequencing / instrumentation
  • High-Throughput Nucleotide Sequencing / methods
  • High-Throughput Nucleotide Sequencing / statistics & numerical data*
  • Hydroxyl Radical / chemistry
  • Models, Statistical*
  • Molecular Probes / chemistry
  • Molecular Sequence Data
  • RNA / chemistry*
  • Sequence Analysis, RNA / instrumentation
  • Sequence Analysis, RNA / methods
  • Sequence Analysis, RNA / statistics & numerical data*
  • Software*

Substances

  • DNA Probes
  • Molecular Probes
  • Hydroxyl Radical
  • RNA