In silico error correction improves cfDNA mutation calling

Bioinformatics. 2019 Jul 15;35(14):2380-2385. doi: 10.1093/bioinformatics/bty1004.

Abstract

Motivation: Circulating-free DNA (cfDNA) profiling by sequencing is an important minimally invasive protocol for monitoring the mutation profile of solid tumours in cancer patients. Since the concentration of available cfDNA is limited, sample library generation relies on multiple rounds of PCR amplification, during which the accumulation of errors results in reduced sensitivity and lower accuracy.

Results: We present PCR Error Correction (PEC), an algorithm to identify and correct errors in short read sequencing data. It exploits the redundancy that arises from multiple rounds of PCR amplification. PEC is particularly well suited to applications such as single-cell sequencing and circulating tumour DNA (ctDNA) analysis, in which many cycles of PCR are used to generate sufficient DNA for sequencing from small amounts of starting material. When applied to ctDNA analysis, PEC significantly improves mutation calling accuracy, achieving similar levels of performance to more complex strategies that require additional protocol steps and access to calibration DNA datasets.

Availability and implementation: PEC is available under the GPL-v3 Open Source licence, and is freely available from: https://github.com/CRUKMI-ComputationalBiology/PCR_Error_Correction.git.

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Cell-Free Nucleic Acids
  • Circulating Tumor DNA
  • Computer Simulation
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Mutation*
  • Sequence Analysis, DNA

Substances

  • Cell-Free Nucleic Acids
  • Circulating Tumor DNA