DENA: training an authentic neural network model using Nanopore sequencing data of Arabidopsis transcripts for detection and quantification of N6-methyladenosine on RNA

Genome Biol. 2022 Jan 17;23(1):25. doi: 10.1186/s13059-021-02598-3.

Abstract

Models developed using Nanopore direct RNA sequencing data from in vitro synthetic RNA with all adenosine replaced by N6-methyladenosine (m6A) are likely distorted due to superimposed signals from saturated m6A residues. Here, we develop a neural network, DENA, for m6A quantification using the sequencing data of in vivo transcripts from Arabidopsis. DENA identifies 90% of miCLIP-detected m6A sites in Arabidopsis and obtains modification rates in human consistent to those found by SCARLET, demonstrating its robustness across species. We sequence the transcriptome of two additional m6A-deficient Arabidopsis, mtb and fip37-4, using Nanopore and evaluate their single-nucleotide m6A profiles using DENA.

Keywords: Arabidopsis thaliana; AtFIP37; AtMTB; DENA; N 6-Methyladenosine; Nanopore direct RNA sequencing.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adenosine / analogs & derivatives
  • Arabidopsis* / genetics
  • Humans
  • Nanopore Sequencing*
  • Nanopores*
  • Neural Networks, Computer
  • RNA
  • RNA, Messenger / genetics
  • Transcriptome

Substances

  • RNA, Messenger
  • RNA
  • N-methyladenosine
  • Adenosine