A survey of error-correction methods for next-generation sequencing

Brief Bioinform. 2013 Jan;14(1):56-66. doi: 10.1093/bib/bbs015. Epub 2012 Apr 6.

Abstract

Error Correction is important for most next-generation sequencing applications because highly accurate sequenced reads will likely lead to higher quality results. Many techniques for error correction of sequencing data from next-gen platforms have been developed in the recent years. However, compared with the fast development of sequencing technologies, there is a lack of standardized evaluation procedure for different error-correction methods, making it difficult to assess their relative merits and demerits. In this article, we provide a comprehensive review of many error-correction methods, and establish a common set of benchmark data and evaluation criteria to provide a comparative assessment. We present experimental results on quality, run-time, memory usage and scalability of several error-correction methods. Apart from providing explicit recommendations useful to practitioners, the review serves to identify the current state of the art and promising directions for future research.

Availability: All error-correction programs used in this article are downloaded from hosting websites. The evaluation tool kit is publicly available at: http://aluru-sun.ece.iastate.edu/doku.php?id=ecr.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Review

MeSH terms

  • Algorithms
  • Animals
  • Chromosome Mapping / statistics & numerical data
  • Chromosome Mapping / trends
  • Computational Biology
  • Databases, Genetic / statistics & numerical data
  • Databases, Genetic / trends
  • Forecasting
  • Humans
  • Sequence Alignment / statistics & numerical data
  • Sequence Alignment / trends
  • Sequence Analysis, DNA / statistics & numerical data
  • Sequence Analysis, DNA / trends*
  • Software*