Experimental design schemes for learning Boolean network models

Bioinformatics. 2014 Sep 1;30(17):i445-52. doi: 10.1093/bioinformatics/btu451.

Abstract

Motivation: A holy grail of biological research is a working model of the cell. Current modeling frameworks, especially in the protein-protein interaction domain, are mostly topological in nature, calling for stronger and more expressive network models. One promising alternative is logic-based or Boolean network modeling, which was successfully applied to model signaling regulatory circuits in human. Learning such models requires observing the system under a sufficient number of different conditions. To date, the amount of measured data is the main bottleneck in learning informative Boolean models, underscoring the need for efficient experimental design strategies.

Results: We developed novel design approaches that greedily select an experiment to be performed so as to maximize the difference or the entropy in the results it induces with respect to current best-fit models. Unique to our maximum difference approach is the ability to account for all (possibly exponential number of) Boolean models displaying high fit to the available data. We applied both approaches to simulated and real data from the EFGR and IL1 signaling systems in human. We demonstrate the utility of the developed strategies in substantially improving on a random selection approach. Our design schemes highlight the redundancy in these datasets, leading up to 11-fold savings in the number of experiments to be performed.

Availability and implementation: Source code will be made available upon acceptance of the manuscript.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • ErbB Receptors / metabolism
  • Humans
  • Interleukin-1 / metabolism
  • Models, Biological*
  • Research Design
  • Signal Transduction*

Substances

  • Interleukin-1
  • ErbB Receptors