Probabilistic inference of transcription factor concentrations and gene-specific regulatory activities

Bioinformatics. 2006 Nov 15;22(22):2775-81. doi: 10.1093/bioinformatics/btl473. Epub 2006 Sep 11.

Abstract

Motivation: Quantitative estimation of the regulatory relationship between transcription factors and genes is a fundamental stepping stone when trying to develop models of cellular processes. Recent experimental high-throughput techniques, such as Chromatin Immunoprecipitation (ChIP) provide important information about the architecture of the regulatory networks in the cell. However, it is very difficult to measure the concentration levels of transcription factor proteins and determine their regulatory effect on gene transcription. It is therefore an important computational challenge to infer these quantities using gene expression data and network architecture data.

Results: We develop a probabilistic state space model that allows genome-wide inference of both transcription factor protein concentrations and their effect on the transcription rates of each target gene from microarray data. We use variational inference techniques to learn the model parameters and perform posterior inference of protein concentrations and regulatory strengths. The probabilistic nature of the model also means that we can associate credibility intervals to our estimates, as well as providing a tool to detect which binding events lead to significant regulation. We demonstrate our model on artificial data and on two yeast datasets in which the network structure has previously been obtained using ChIP data. Predictions from our model are consistent with the underlying biology and offer novel quantitative insights into the regulatory structure of the yeast cell.

Availability: MATLAB code is available from http://umber.sbs.man.ac.uk/resources/puma

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Chromatin Immunoprecipitation
  • Computational Biology / methods*
  • Gene Expression Profiling
  • Gene Expression Regulation*
  • Gene Expression Regulation, Fungal*
  • Genes, Fungal*
  • Internet
  • Models, Statistical
  • Oligonucleotide Array Sequence Analysis
  • Probability
  • Regulatory Elements, Transcriptional
  • Software
  • Transcription Factors / chemistry*
  • Transcription, Genetic

Substances

  • Transcription Factors