msmsEDA & msmsTests: Label-Free Differential Expression by Spectral Counts

Methods Mol Biol. 2023:2426:197-242. doi: 10.1007/978-1-0716-1967-4_10.

Abstract

msmsTests is an R/Bioconductor package providing functions for statistical tests in label-free LC-MS/MS data by spectral counts. These functions aim at discovering differentially expressed proteins between two biological conditions. Three tests are available: Poisson GLM regression, quasi-likelihood GLM regression, and the negative binomial of the edgeR package. The three models admit blocking factors to control for nuisance variables. To assure a good level of reproducibility a post-test filter is available, where (1) a minimum effect size considered biologically relevant, and (2) a minimum expression of the most abundant condition, may be set. A companion package, msmsEDA, proposes functions to explore datasets based on msms spectral counts. The provided graphics help in identifying outliers, the presence of eventual batch factors, and check the effects of different normalizing strategies. This protocol illustrates the use of both packages on two examples: A purely spike-in experiment of 48 human proteins in a standard yeast cell lysate; and a cancer cell-line secretome dataset requiring a biological normalization.

Keywords: Batch effects; Bioconductor; Biomarker discovery; Label free; Normalization; Reproducibility; Secretomes; Spectral counts; msmsEDA; msmsTests.

MeSH terms

  • Chromatography, Liquid
  • Humans
  • Proteomics* / methods
  • Reproducibility of Results
  • Saccharomyces cerevisiae
  • Software*
  • Tandem Mass Spectrometry / methods