Performance evaluation of DNA motif discovery programs

Bioinformation. 2008;3(5):205-12. doi: 10.6026/97320630003205. Epub 2008 Dec 31.

Abstract

Methods for the identification of transcription factor binding sites have proved to be useful for deciphering genetic regulatory networks. The strengths and weaknesses for a number of available web tools are not fully understood. Here, we designed a comprehensive set of performance measures and benchmarked sequence-based motif discovery tools using large scale datasets (derived from Escherichia coli genome and RegulonDB database). The benchmark study showed that nucleotide based and binding site based prediction accuracy is often low and activator binding site based prediction accuracy is high.

Keywords: DNA binding site; accuracy; evaluation; motif discovery; regulatory proteins.

Publication types

  • Retracted Publication