A Bayesian calibration model for combining different pre-processing methods in Affymetrix chips

BMC Bioinformatics. 2008 Dec 1:9:512. doi: 10.1186/1471-2105-9-512.

Abstract

Background: In gene expression studies a key role is played by the so called "pre-processing", a series of steps designed to extract the signal and account for the sources of variability due to the technology used rather than to biological differences between the RNA samples. At the moment there is no commonly agreed gold standard pre-processing method and each researcher has the responsibility to choose one method, incurring the risk of false positive and false negative features arising from the particular method chosen.

Results: We propose a Bayesian calibration model that makes use of the information provided by several pre-processing methods and we show that this model gives a better assessment of the 'true' unknown differential expression between two conditions. We demonstrate how to estimate the posterior distribution of the differential expression values of interest from the combined information.

Conclusion: On simulated data and on the spike-in Latin Square dataset from Affymetrix the Bayesian calibration model proves to have more power than each pre-processing method. Its biological interest is demonstrated through an experimental example on publicly available data.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Bayes Theorem
  • Calibration
  • Gene Expression Profiling / methods*
  • Models, Statistical
  • Oligonucleotide Array Sequence Analysis / methods*
  • RNA / chemistry

Substances

  • RNA