Empirical Bayes analysis of quantitative proteomics experiments

PLoS One. 2009 Oct 14;4(10):e7454. doi: 10.1371/journal.pone.0007454.

Abstract

Background: Advances in mass spectrometry-based proteomics have enabled the incorporation of proteomic data into systems approaches to biology. However, development of analytical methods has lagged behind. Here we describe an empirical Bayes framework for quantitative proteomics data analysis. The method provides a statistical description of each experiment, including the number of proteins that differ in abundance between 2 samples, the experiment's statistical power to detect them, and the false-positive probability of each protein.

Methodology/principal findings: We analyzed 2 types of mass spectrometric experiments. First, we showed that the method identified the protein targets of small-molecules in affinity purification experiments with high precision. Second, we re-analyzed a mass spectrometric data set designed to identify proteins regulated by microRNAs. Our results were supported by sequence analysis of the 3' UTR regions of predicted target genes, and we found that the previously reported conclusion that a large fraction of the proteome is regulated by microRNAs was not supported by our statistical analysis of the data.

Conclusions/significance: Our results highlight the importance of rigorous statistical analysis of proteomic data, and the method described here provides a statistical framework to robustly and reliably interpret such data.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • 3' Untranslated Regions
  • Bayes Theorem
  • Computational Biology / methods
  • False Positive Reactions
  • HeLa Cells
  • Humans
  • Mass Spectrometry / methods*
  • Models, Statistical
  • Normal Distribution
  • Proteome*
  • Proteomics / methods*
  • Reproducibility of Results
  • Sequence Analysis, Protein / methods*

Substances

  • 3' Untranslated Regions
  • Proteome