Motif discovery and transcription factor binding sites before and after the next-generation sequencing era

Brief Bioinform. 2013 Mar;14(2):225-37. doi: 10.1093/bib/bbs016. Epub 2012 Apr 19.

Abstract

Motif discovery has been one of the most widely studied problems in bioinformatics ever since genomic and protein sequences have been available. In particular, its application to the de novo prediction of putative over-represented transcription factor binding sites in nucleotide sequences has been, and still is, one of the most challenging flavors of the problem. Recently, novel experimental techniques like chromatin immunoprecipitation (ChIP) have been introduced, permitting the genome-wide identification of protein-DNA interactions. ChIP, applied to transcription factors and coupled with genome tiling arrays (ChIP on Chip) or next-generation sequencing technologies (ChIP-Seq) has opened new avenues in research, as well as posed new challenges to bioinformaticians developing algorithms and methods for motif discovery.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Animals
  • Binding Sites / genetics
  • Chromatin Immunoprecipitation / statistics & numerical data
  • Computational Biology
  • Consensus Sequence
  • DNA / genetics
  • DNA / metabolism
  • Gene Expression Profiling / statistics & numerical data
  • High-Throughput Nucleotide Sequencing / statistics & numerical data*
  • Humans
  • Regulatory Elements, Transcriptional*
  • Transcription Factors / metabolism*

Substances

  • Transcription Factors
  • DNA