ConcatSeq: A method for increasing throughput of single molecule sequencing by concatenating short DNA fragments

Sci Rep. 2017 Jul 12;7(1):5252. doi: 10.1038/s41598-017-05503-w.

Abstract

Single molecule sequencing (SMS) platforms enable base sequences to be read directly from individual strands of DNA in real-time. Though capable of long read lengths, SMS platforms currently suffer from low throughput compared to competing short-read sequencing technologies. Here, we present a novel strategy for sequencing library preparation, dubbed ConcatSeq, which increases the throughput of SMS platforms by generating long concatenated templates from pools of short DNA molecules. We demonstrate adaptation of this technique to two target enrichment workflows, commonly used for oncology applications, and feasibility using PacBio single molecule real-time (SMRT) technology. Our approach is capable of increasing the sequencing throughput of the PacBio RSII platform by more than five-fold, while maintaining the ability to correctly call allele frequencies of known single nucleotide variants. ConcatSeq provides a versatile new sample preparation tool for long-read sequencing technologies.

MeSH terms

  • DNA, Concatenated / analysis*
  • DNA, Concatenated / genetics*
  • Genome, Human*
  • High-Throughput Nucleotide Sequencing / methods*
  • Humans
  • Male
  • Molecular Sequence Annotation
  • Sequence Analysis, DNA / methods*

Substances

  • DNA, Concatenated