Comprehensive comparison of two types of algorithm for circRNA detection from short-read RNA-Seq

Hongfei Liu; Zhanerke Akhatayeva; Chuanying Pan; Mingzhi Liao; Xianyong Lan

doi:10.1093/bioinformatics/btac302

Comprehensive comparison of two types of algorithm for circRNA detection from short-read RNA-Seq

Bioinformatics. 2022 May 26;38(11):3037-3043. doi: 10.1093/bioinformatics/btac302.

Authors

Hongfei Liu¹, Zhanerke Akhatayeva¹, Chuanying Pan¹, Mingzhi Liao², Xianyong Lan¹

Affiliations

¹ College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China.
² College of Life Sciences, Northwest A&F University, Yangling, Shaanxi 712100, China.

PMID: 35482518
DOI: 10.1093/bioinformatics/btac302

Abstract

Motivation: Circular RNA is generally formed by the 'back-splicing' process between the upstream splice acceptor and the downstream donor in/not in the regulation of the corresponding RNA-binding proteins or cis-elements. Therefore, more and more software packages have been developed and they are mostly based on the identification of the back-spliced junction reads. However, recent studies developed two software tools that can detect circRNA candidates by constructing k-mer table or/and de Bruijn graph rather than reads mapping.

Results: Here, we compared the precision, sensitivity and detection efficiency between software tools based on different algorithms. Eleven representative detection tools with two types of algorithm were selected for the overall pipeline analysis of RNA-seq datasets with/without RNase R treatment in two cell lines. Precision, sensitivity, AUC, F1 score and detection efficiency metrics were assessed to compare the prediction tools. Meanwhile, the sensitivity and distribution of highly expressed circRNAs before and after RNase R treatment were also revealed by their enrichment, unaffected and depleted candidate frequencies. Eventually, we found that compared to the k-mer based tools, CIRI2 and KNIFE based on reads mapping had relatively superior and more balanced detection performance regardless of the cell line or RNase R (-/+) datasets.

Availability and implementation: All predicted results and source codes can be retrieved from https://github.com/luffy563/circRNA_tools_comparison.

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Algorithms*
RNA, Circular*
RNA-Seq
Sequence Analysis, RNA / methods
Software

Substances

RNA, Circular