Poly(a) selection introduces bias and undue noise in direct RNA-sequencing

BMC Genomics. 2022 Jul 22;23(1):530. doi: 10.1186/s12864-022-08762-8.

Abstract

Background: Genome-wide RNA-sequencing technologies are increasingly critical to a wide variety of diagnostic and research applications. RNA-seq users often first enrich for mRNA, with the most popular enrichment method being poly(A) selection. In many applications it is well-known that poly(A) selection biases the view of the transcriptome by selecting for longer tailed mRNA species.

Results: Here, we show that poly(A) selection biases Oxford Nanopore direct RNA sequencing. As expected, poly(A) selection skews sequenced mRNAs toward longer poly(A) tail lengths. Interestingly, we identify a population of mRNAs (> 10% of genes' mRNAs) that are inconsistently captured by poly(A) selection due to highly variable poly(A) tails, and demonstrate this phenomenon in our hands and in published data. Importantly, we show poly(A) selection is dispensable for Oxford Nanopore's direct RNA-seq technique, and demonstrate successful library construction without poly(A) selection, with decreased input, and without loss of quality.

Conclusions: Our work expands the utility of direct RNA-seq by validating the use of total RNA as input, and demonstrates important technical artifacts from poly(A) selection that inconsistently skew mRNA expression and poly(A) tail length measurements.

Keywords: Direct RNA-sequencing; Oxford Nanopore; Poly(a) selection; RNA-sequencing; Transcriptomics.

MeSH terms

  • High-Throughput Nucleotide Sequencing / methods
  • Poly A* / genetics
  • Poly A* / metabolism
  • Polyadenylation
  • RNA* / metabolism
  • RNA, Messenger / genetics
  • RNA, Messenger / metabolism
  • Sequence Analysis, RNA / methods
  • Transcriptome

Substances

  • RNA, Messenger
  • Poly A
  • RNA