Targeted RNA-sequencing with competitive multiplex-PCR amplicon libraries

PLoS One. 2013 Nov 13;8(11):e79120. doi: 10.1371/journal.pone.0079120. eCollection 2013.

Abstract

Whole transcriptome RNA-sequencing is a powerful tool, but is costly and yields complex data sets that limit its utility in molecular diagnostic testing. A targeted quantitative RNA-sequencing method that is reproducible and reduces the number of sequencing reads required to measure transcripts over the full range of expression would be better suited to diagnostic testing. Toward this goal, we developed a competitive multiplex PCR-based amplicon sequencing library preparation method that a) targets only the sequences of interest and b) controls for inter-target variation in PCR amplification during library preparation by measuring each transcript native template relative to a known number of synthetic competitive template internal standard copies. To determine the utility of this method, we intentionally selected PCR conditions that would cause transcript amplification products (amplicons) to converge toward equimolar concentrations (normalization) during library preparation. We then tested whether this approach would enable accurate and reproducible quantification of each transcript across multiple library preparations, and at the same time reduce (through normalization) total sequencing reads required for quantification of transcript targets across a large range of expression. We demonstrate excellent reproducibility (R² = 0.997) with 97% accuracy to detect 2-fold change using External RNA Controls Consortium (ERCC) reference materials; high inter-day, inter-site and inter-library concordance (R² = 0.97-0.99) using FDA Sequencing Quality Control (SEQC) reference materials; and cross-platform concordance with both TaqMan qPCR (R² = 0.96) and whole transcriptome RNA-sequencing following "traditional" library preparation using Illumina NGS kits (R² = 0.94). Using this method, sequencing reads required to accurately quantify more than 100 targeted transcripts expressed over a 10⁷-fold range was reduced more than 10,000-fold, from 2.3×10⁹ to 1.4×10⁵ sequencing reads. These studies demonstrate that the competitive multiplex-PCR amplicon library preparation method presented here provides the quality control, reproducibility, and reduced sequencing reads necessary for development and implementation of targeted quantitative RNA-sequencing biomarkers in molecular diagnostic testing.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Gene Expression Profiling
  • Gene Library
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Multiplex Polymerase Chain Reaction*
  • ROC Curve
  • Reproducibility of Results
  • Reverse Transcriptase Polymerase Chain Reaction
  • Sequence Analysis, RNA
  • Transcriptome