An improved heuristic algorithm for finding motif signals in DNA sequences

IEEE/ACM Trans Comput Biol Bioinform. 2011 Jul-Aug;8(4):959-75. doi: 10.1109/TCBB.2010.92.

Abstract

The planted (l, d)-motif search problem is a mathematical abstraction of the DNA functional site discovery task. In this paper, we propose a heuristic algorithm that can find planted (l, d)-signals in a given set of DNA sequences. Evaluations on simulated data sets demonstrate that the proposed algorithm outperforms current widely used motif finding algorithms. We also report the results of experiments on real biological data sets.

MeSH terms

  • Algorithms*
  • Animals
  • Base Sequence
  • Binding Sites
  • Cattle
  • Chickens
  • Computational Biology / methods*
  • Computer Simulation
  • Consensus Sequence*
  • DNA / chemistry*
  • DNA / genetics
  • Humans
  • Mice
  • Molecular Sequence Data
  • Nucleic Acid Conformation
  • Pattern Recognition, Automated / methods*
  • Sequence Analysis, DNA / methods*
  • Swine
  • Tetraodontiformes

Substances

  • DNA