LncReader: identification of dual functional long noncoding RNAs using a multi-head self-attention mechanism

Brief Bioinform. 2023 Jan 19;24(1):bbac579. doi: 10.1093/bib/bbac579.

Abstract

Long noncoding ribonucleic acids (RNAs; LncRNAs) endowed with both protein-coding and noncoding functions are referred to as 'dual functional lncRNAs'. Recently, dual functional lncRNAs have been intensively studied and identified as involved in various fundamental cellular processes. However, apart from time-consuming and cell-type-specific experiments, there is virtually no in silico method for predicting the identity of dual functional lncRNAs. Here, we developed a deep-learning model with a multi-head self-attention mechanism, LncReader, to identify dual functional lncRNAs. Our data demonstrated that LncReader showed multiple advantages compared to various classical machine learning methods using benchmark datasets from our previously reported cncRNAdb project. Moreover, to obtain independent in-house datasets for robust testing, mass spectrometry proteomics combined with RNA-seq and Ribo-seq were applied in four leukaemia cell lines, which further confirmed that LncReader achieved the best performance compared to other tools. Therefore, LncReader provides an accurate and practical tool that enables fast dual functional lncRNA identification.

Keywords: deep learning; noncoding RNA; system biology.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • RNA, Long Noncoding* / chemistry
  • RNA, Long Noncoding* / genetics
  • RNA-Seq

Substances

  • RNA, Long Noncoding