Sequence assembly validation by multiple restriction digest fragment coverage analysis

Proc Int Conf Intell Syst Mol Biol. 1998:6:140-7.

Abstract

DNA sequence analysis depends on the accurate assembly of fragment reads for the determination of a consensus sequence. This report examines the possibility of analyzing multiple, independent restriction digests as a method for testing the fidelity of sequence assembly. A dynamic programming algorithm to determine the maximum likelihood alignment of error prone electrophoretic mobility data to the expected fragment mobilities given the consensus sequence and restriction enzymes is derived and used to assess the likelihood of detecting rearrangements in genomic sequencing projects. The method is shown to reliably detect errors in sequence fragment assembly without the necessity of making reference to an overlying physical map. An html form-based interface is available at http:/(/)www.ibc.wustl.edu/services/validate. html.

Publication types

  • Comparative Study
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Algorithms
  • Artificial Intelligence
  • Base Sequence
  • DNA / genetics
  • DNA Fingerprinting
  • Reproducibility of Results
  • Restriction Mapping / methods*
  • Restriction Mapping / statistics & numerical data
  • Sequence Alignment / methods
  • Sequence Alignment / statistics & numerical data
  • Sequence Analysis, DNA / methods*
  • Sequence Analysis, DNA / statistics & numerical data

Substances

  • DNA