Correcting palindromes in long reads after whole-genome amplification

BMC Genomics. 2018 Nov 6;19(1):798. doi: 10.1186/s12864-018-5164-1.

Abstract

Background: Next-generation sequencing requires sufficient DNA to be available. If limited, whole-genome amplification is applied to generate additional amounts of DNA. Such amplification often results in many chimeric DNA fragments, in particular artificial palindromic sequences, which limit the usefulness of long sequencing reads.

Results: Here, we present Pacasus, a tool for correcting such errors. Two datasets show that it markedly improves read mapping and de novo assembly, yielding results similar to these that would be obtained with non-amplified DNA.

Conclusions: With Pacasus long-read technologies become available for sequencing targets with very small amounts of DNA, such as single cells or even single chromosomes.

Keywords: Chimeric reads; High molecular weight DNA; Long read sequencing; Palindromes; Read mapping; Whole-genome amplification; de novo assembly.

MeSH terms

  • Algorithms
  • Animals
  • Arabidopsis / genetics*
  • DNA / analysis*
  • DNA / genetics
  • Gorilla gorilla / genetics*
  • Nucleotides / genetics*
  • Research Design
  • Sequence Analysis, DNA / methods*
  • Whole Genome Sequencing / methods*
  • Y Chromosome / genetics*

Substances

  • Nucleotides
  • DNA