On the synthesis of DNA error correcting codes

Biosystems. 2012 Oct;110(1):1-8. doi: 10.1016/j.biosystems.2012.06.005. Epub 2012 Jul 6.

Abstract

DNA error correcting codes over the edit metric consist of embeddable markers for sequencing projects that are tolerant of sequencing errors. When a genetic library has multiple sources for its sequences, use of embedded markers permit tracking of sequence origin. This study compares different methods for synthesizing DNA error correcting codes. A new code-finding technique called the salmon algorithm is introduced and used to improve the size of best known codes in five difficult cases of the problem, including the most studied case: length six, distance three codes. An updated table of the best known code sizes with 36 improved values, resulting from three different algorithms, is presented. Mathematical background results for the problem from multiple sources are summarized. A discussion of practical details that arise in application, including biological design and decoding, is also given in this study.

MeSH terms

  • Algorithms*
  • Computational Biology
  • DNA Repair
  • DNA Replication
  • DNA*
  • Gene Library

Substances

  • DNA