RNA Modification Detection Using Nanopore Direct RNA Sequencing and nanoDoc2

Methods Mol Biol. 2023:2632:299-319. doi: 10.1007/978-1-0716-2996-3_21.

Abstract

RNA modifications regulate multiple aspects of cellular function including RNA splicing, translation, export, decay, stability, and phase separation. One of the comprehensive ways to detect such modifications is by the recent advancement of direct RNA sequencing from Oxford Nanopore Technologies (ONT). However, this method obtains a large amount of data with high complexity in the form of raw current signal that poses a new informatics challenge to accurately detect those modifications. Here, we provide nanoDoc2, a software to detect multiple types of RNA modification from nanopore direct RNA sequencing data. The nanoDoc2 includes a novel signal segmentation algorithm based on the trace value-a base probability feature that is added by the Guppy basecalling program from ONT during processing of the raw signal. The core of nanoDoc2 includes a machine learning algorithm in which a 6-mer segmented raw current signal is analyzed by deep one-class classification using a WaveNet-based neural network. As an output, an RNA modification is detected by a statistical score in each candidate position. Herein, we describe the detailed instructions on how to use nanoDoc2 for signal segmentation, train/test the neural network, and finally predict RNA modifications present in nanopore direct RNA sequencing data.

Keywords: Deep learning; Epitranscriptome; Nanopore sequence; One-class classification; RNA modification; Ribosomal RNA; Single-molecule sequencing.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • High-Throughput Nucleotide Sequencing / methods
  • Nanopores*
  • RNA* / genetics
  • Sequence Analysis, RNA / methods
  • Software

Substances

  • RNA