Cross-species de novo identification of cis-regulatory modules with GibbsModule: application to gene regulation in embryonic stem cells

Genome Res. 2008 Aug;18(8):1325-35. doi: 10.1101/gr.072769.107. Epub 2008 May 15.

Abstract

We introduce the GibbsModule algorithm for de novo detection of cis-regulatory motifs and modules in eukaryote genomes. GibbsModule models the coexpressed genes within one species as sharing a core cis-regulatory motif and each homologous gene group as sharing a homologous cis-regulatory module (CRM), characterized by a similar composition of motifs. Without using a predetermined alignment result, GibbsModule iteratively updates the core motif shared by coexpressed genes and traces the homologous CRMs that contain the core motif. GibbsModule achieved substantial improvements in both precision and recall as compared with peer algorithms on a number of synthetic and real data sets. Applying GibbsModule to analyze the binding regions of the Krüppel-like factor (KLF) transcription factor in embryonic stem cells (ESCs), we discovered a motif that differs from a previously published KLF motif identified by a SELEX experiment, but the new motif is consistent with mutagenesis analysis. The SOX2 motif was found to be a collaborating motif to the KLF motif in ESCs. We used quantitative chromatin immunoprecipitation (ChIP) analysis to test whether GibbsModule could distinguish functional and nonfunctional binding sites. All seven tested binding sites in GibbsModule-predicted CRMs had higher ChIP signals as compared with the other seven tested binding sites located outside of predicted CRMs. GibbsModule is available at (http://biocomp.bioen.uiuc.edu/GibbsModule).

Publication types

  • Evaluation Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Animals
  • Binding Sites
  • Cells, Cultured
  • DNA-Binding Proteins / metabolism
  • Embryonic Stem Cells / metabolism*
  • Enhancer Elements, Genetic
  • Gene Expression Regulation
  • Genomics
  • HMGB Proteins / metabolism
  • Humans
  • Kruppel-Like Factor 4
  • Kruppel-Like Transcription Factors / metabolism
  • Mice
  • Muscle Cells / metabolism
  • Regulatory Elements, Transcriptional*
  • SOXB1 Transcription Factors
  • Transcription Factors / metabolism*

Substances

  • DNA-Binding Proteins
  • HMGB Proteins
  • KLF4 protein, human
  • Klf4 protein, mouse
  • Kruppel-Like Factor 4
  • Kruppel-Like Transcription Factors
  • SOX2 protein, human
  • SOXB1 Transcription Factors
  • Sox2 protein, mouse
  • Transcription Factors