Discovering regulatory binding-site modules using rule-based learning

Genome Res. 2005 Jun;15(6):856-66. doi: 10.1101/gr.3760605.

Abstract

Transcription factors regulate expression by binding selectively to sequence sites in cis-regulatory regions of genes. It is therefore reasonable to assume that genes regulated by the same transcription factors should all contain the corresponding binding sites in their regulatory regions and exhibit similar expression profiles as measured by, for example, microarray technology. We have used this assumption to analyze genome-wide yeast binding-site and microarray expression data to reveal the combinatorial nature of gene regulation. We obtained IF-THEN rules linking binding-site combinations (binding-site modules) to genes with particular expression profiles, and thereby provided testable hypotheses on the combinatorial coregulation of gene expression. We showed that genes associated with such rules have a significantly higher probability of being bound by the same transcription factors, as indicated by a genome-wide location analysis, than genes associated with only common binding sites or similar expression. Furthermore, we also found that such genes were significantly more often biologically related in terms of Gene Ontology annotations than genes only associated with common binding sites or similar expression. We analyzed expression data collected under different sets of stress conditions and found many binding-site modules that are conserved over several of these condition sets, as well as modules that are specific to particular biological responses. Our results on the reoccurrence of binding sites in different modules provide specific data on how binding sites may be combined to allow a large number of expression outcomes using relatively few transcription factors.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms*
  • Animals
  • Gene Expression Profiling / methods
  • Gene Expression Regulation, Fungal
  • Genome, Fungal*
  • Humans
  • Promoter Regions, Genetic
  • Saccharomyces cerevisiae / genetics*
  • Saccharomyces cerevisiae Proteins / genetics*
  • Sequence Analysis, DNA / methods*
  • Transcription Factors / genetics*
  • Transcription, Genetic

Substances

  • Saccharomyces cerevisiae Proteins
  • Transcription Factors