A 3'-end capture sequencing method for high-throughput targeted gene expression profiling

Biotechnol J. 2022 Sep;17(9):e2100660. doi: 10.1002/biot.202100660. Epub 2022 May 24.

Abstract

Molecular phenotyping through shallow 3'-end RNA-sequencing workflows is increasingly applied in the context of large-scale chemical or genetic perturbation screens to study disease biology or support drug discovery. While these workflows enable accurate quantification of the most abundant genes, they are less effective for applications that require expression profiling of low abundant transcripts, like long noncoding RNAs (lncRNAs), or selected gene panels. To tackle these issues, we describe a workflow combining 3'-end library preparation with 3'-end hybrid capture probes and shallow RNA-sequencing for cost-effective, targeted quantification of subsets of (low abundant) genes across hundreds to thousands of samples. To assess the performance of the method, we designed a capture probe set for more than 100 mRNA and lncRNA target genes and applied the workflow to a cohort of 360 samples. When compared to standard 3'-end RNA-sequencing, 3'-end capture sequencing resulted in a more than 200-fold enrichment of target gene abundance while conserving relative intergene and intersample abundances. 3'-end RNA capture sequencing enables accurate targeted gene expression profiling at extremely shallow sequencing depth.

Keywords: RNA-sequencing; abundance; capture-sequencing; coverage; probes; transcriptomics.

MeSH terms

  • Gene Expression Profiling* / methods
  • Gene Library
  • High-Throughput Nucleotide Sequencing / methods
  • Humans
  • RNA, Long Noncoding* / genetics
  • RNA, Messenger / genetics
  • Sequence Analysis, RNA / methods

Substances

  • RNA, Long Noncoding
  • RNA, Messenger