Simple Tool for Rapidly Assessing the Quality of Multiplexed Single Cell Proteomics Data

J Am Soc Mass Spectrom. 2023 Dec 6;34(12):2615-2619. doi: 10.1021/jasms.3c00238. Epub 2023 Nov 22.

Abstract

Recent advances in the sensitivity and speed of mass spectrometers coupled with improved sample preparation methods have enabled the field of single cell proteomics to proliferate. While heavy development is occurring in the label free space, dramatic improvements in throughput are provided by multiplexing with tandem mass tags. Hundreds or thousands of single cells can be analyzed with this method, yielding large data sets which may contain poor data arising from loss of material during cell sorting or poor digestion, labeling, and lysis. To date, no tools have been described that can assess data quality prior to data processing. We present herein a lightweight python script and accompanying graphic user interface that can rapidly quantify reporter ion peaks within each MS/MS spectrum in a file. With simple summary reports, we can identify single cell samples that fail to pass a set quality threshold, thus reducing analysis time waste. In addition, this tool, Diagnostic Ion Data Analysis Reduction (DIDAR), will create reduced MGF files containing only spectra possessing a user-specified number of single cell reporter ions. By reducing the number of spectra that have excessive zero values, we can speed up sample processing with little loss in data completeness as these spectra are removed in later stages in data processing workflows. DIDAR and the DIDAR GUI are compatible with all modern operating systems and are available at: https://github.com/orsburn/DIDARSCPQC. All files described in this study are available at www.massive.ucsd.edu as accession MSV000088887.

MeSH terms

  • Ions
  • Proteomics* / methods
  • Software
  • Tandem Mass Spectrometry* / methods

Substances

  • Ions