CircSeqAlignTk: An R package for end-to-end analysis of RNA-seq data for circular genomes

F1000Res. 2024 Apr 30:11:1221. doi: 10.12688/f1000research.127348.1. eCollection 2022.

Abstract

RNA sequencing (RNA-seq) technology has become one of the standard tools for studying biological mechanisms at the transcriptome level. Advances in RNA-seq technology have led to the development of numerous publicly available tools for RNA-seq data analysis. Most of these tools target linear genome sequences despite the necessity of studying organisms with circular genome sequences. For example, studying the infection mechanisms of viroids which comprise 246-401 nucleotides circular RNAs and target plants may prevent tremendous economic and agricultural damage. Unfortunately, using the available tools to construct workflows for the analysis of circular genome sequences is difficult, especially for non-bioinformaticians. To overcome this limitation, we present CircSeqAlignTk, an easy-to-use and richly documented R package. CircSeqAlignTk offers both command line and graphical user interfaces for end-to-end RNA-seq data analysis, spanning alignment to the visualisation of circular genome sequences, via a series of functions. Moreover, it includes a feature to generate synthetic sequencing data that mirrors real RNA-seq data from biological experiments. CircSeqAlignTk not only provides an easy-to-use analysis interface for novice users but also allows developers to evaluate the performance of alignment tools and new workflows.

Keywords: R package; alignment; circular genome sequence; small RNA-seq; viroid.; visualisation.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Computational Biology / methods
  • Genome
  • RNA, Circular / genetics
  • RNA-Seq* / methods
  • Sequence Analysis, RNA / methods
  • Software*

Substances

  • RNA, Circular

Grants and funding

This work was supported by JSPS KAKENHI [21K05608 and 22H05179].