An automated pipeline for high-throughput label-free quantitative proteomics

J Proteome Res. 2013 Apr 5;12(4):1628-44. doi: 10.1021/pr300992u. Epub 2013 Feb 22.

Abstract

We present a computational pipeline for the quantification of peptides and proteins in label-free LC-MS/MS data sets. The pipeline is composed of tools from the OpenMS software framework and is applicable to the processing of large experiments (50+ samples). We describe several enhancements that we have introduced to OpenMS to realize the implementation of this pipeline. They include new algorithms for centroiding of raw data, for feature detection, for the alignment of multiple related measurements, and a new tool for the calculation of peptide and protein abundances. Where possible, we compare the performance of the new algorithms to that of their established counterparts in OpenMS. We validate the pipeline on the basis of two small data sets that provide ground truths for the quantification. There, we also compare our results to those of MaxQuant and Progenesis LC-MS, two popular alternatives for the analysis of label-free data. We then show how our software can be applied to a large heterogeneous data set of 58 LC-MS/MS runs.

MeSH terms

  • Algorithms*
  • Automation
  • Chromatography, Liquid / methods
  • High-Throughput Screening Assays / methods
  • Humans
  • Leptospira interrogans
  • Proteins / analysis*
  • Proteomics / methods*
  • Reproducibility of Results
  • Software
  • Streptococcus pyogenes
  • Tandem Mass Spectrometry / methods*

Substances

  • Proteins