Error baseline rates of five sample preparation methods used to characterize RNA virus populations

PLoS One. 2017 Feb 9;12(2):e0171333. doi: 10.1371/journal.pone.0171333. eCollection 2017.

Abstract

Individual RNA viruses typically occur as populations of genomes that differ slightly from each other due to mutations introduced by the error-prone viral polymerase. Understanding the variability of RNA virus genome populations is critical for understanding virus evolution because individual mutant genomes may gain evolutionary selective advantages and give rise to dominant subpopulations, possibly even leading to the emergence of viruses resistant to medical countermeasures. Reverse transcription of virus genome populations followed by next-generation sequencing is the only available method to characterize variation for RNA viruses. However, both steps may lead to the introduction of artificial mutations, thereby skewing the data. To better understand how such errors are introduced during sample preparation, we determined and compared error baseline rates of five different sample preparation methods by analyzing in vitro transcribed Ebola virus RNA from an artificial plasmid-based system. These methods included: shotgun sequencing from plasmid DNA or in vitro transcribed RNA as a basic "no amplification" method, amplicon sequencing from the plasmid DNA or in vitro transcribed RNA as a "targeted" amplification method, sequence-independent single-primer amplification (SISPA) as a "random" amplification method, rolling circle reverse transcription sequencing (CirSeq) as an advanced "no amplification" method, and Illumina TruSeq RNA Access as a "targeted" enrichment method. The measured error frequencies indicate that RNA Access offers the best tradeoff between sensitivity and sample preparation error (1.4-5) of all compared methods.

MeSH terms

  • DNA Mutational Analysis / methods
  • DNA Mutational Analysis / standards
  • Diagnostic Errors
  • Genome, Viral
  • High-Throughput Nucleotide Sequencing* / methods
  • High-Throughput Nucleotide Sequencing* / standards
  • Humans
  • Polymorphism, Single Nucleotide
  • RNA Viruses / genetics*
  • RNA, Viral / analysis*
  • RNA, Viral / genetics
  • Research Design
  • Sequence Analysis, RNA* / methods
  • Sequence Analysis, RNA* / standards
  • Specimen Handling / methods*
  • Specimen Handling / standards

Substances

  • RNA, Viral

Grants and funding

This work was supported by Defense Threat Reduction Agency and Battelle Memorial Institute’s prime contract with the US National Institute of Allergy and Infectious Diseases (NIAID) under Contract No. HHSN272200700016I (J.H.K.). A subcontractor to Battelle Memorial Institute who performed this work is: J.H.K., an employee of Tunnell Government Services, Inc.