Expression-guided in silico evaluation of candidate cis regulatory codes for Drosophila muscle founder cells

PLoS Comput Biol. 2006 May;2(5):e53. doi: 10.1371/journal.pcbi.0020053. Epub 2006 May 26.

Abstract

While combinatorial models of transcriptional regulation can be inferred for metazoan systems from a priori biological knowledge, validation requires extensive and time-consuming experimental work. Thus, there is a need for computational methods that can evaluate hypothesized cis regulatory codes before the difficult task of experimental verification is undertaken. We have developed a novel computational framework (termed "CodeFinder") that integrates transcription factor binding site and gene expression information to evaluate whether a hypothesized transcriptional regulatory model (TRM; i.e., a set of co-regulating transcription factors) is likely to target a given set of co-expressed genes. Our basic approach is to simultaneously predict cis regulatory modules (CRMs) associated with a given gene set and quantify the enrichment for combinatorial subsets of transcription factor binding site motifs comprising the hypothesized TRM within these predicted CRMs. As a model system, we have examined a TRM experimentally demonstrated to drive the expression of two genes in a sub-population of cells in the developing Drosophila mesoderm, the somatic muscle founder cells. This TRM was previously hypothesized to be a general mode of regulation for genes expressed in this cell population. In contrast, the present analyses suggest that a modified form of this cis regulatory code applies to only a subset of founder cell genes, those whose gene expression responds to specific genetic perturbations in a similar manner to the gene on which the original model was based. We have confirmed this hypothesis by experimentally discovering six (out of 12 tested) new CRMs driving expression in the embryonic mesoderm, four of which drive expression in founder cells.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Motifs
  • Animals
  • Cluster Analysis
  • Computational Biology / methods*
  • Drosophila melanogaster / embryology*
  • Drosophila melanogaster / genetics*
  • Gene Expression Regulation, Developmental*
  • Mesoderm / metabolism
  • Models, Biological
  • Muscles / embryology*
  • Oligonucleotide Array Sequence Analysis
  • Promoter Regions, Genetic
  • Regulatory Sequences, Nucleic Acid*
  • Wings, Animal / embryology