A learned embedding for efficient joint analysis of millions of mass spectra

Wout Bittremieux; Damon H May; Jeffrey Bilmes; William Stafford Noble

doi:10.1038/s41592-022-01496-1

A learned embedding for efficient joint analysis of millions of mass spectra

Nat Methods. 2022 Jun;19(6):675-678. doi: 10.1038/s41592-022-01496-1. Epub 2022 May 30.

Authors

Wout Bittremieux¹, Damon H May², Jeffrey Bilmes^{3

4}, William Stafford Noble^{5

6}

Affiliations

¹ Skaggs School of Pharmacy and Pharmaceutical Science, University of California San Diego, La Jolla, CA, USA.
² Department of Genome Sciences, University of Washington, Seattle, WA, USA.
³ Department of Electrical and Computer Engineering, University of Washington, Seattle, WA, USA.
⁴ Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, WA, USA.
⁵ Department of Genome Sciences, University of Washington, Seattle, WA, USA. william-noble@uw.edu.
⁶ Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, WA, USA. william-noble@uw.edu.

Abstract

Computational methods that aim to exploit publicly available mass spectrometry repositories rely primarily on unsupervised clustering of spectra. Here we trained a deep neural network in a supervised fashion on the basis of previous assignments of peptides to spectra. The network, called 'GLEAMS', learns to embed spectra in a low-dimensional space in which spectra generated by the same peptide are close to one another. We applied GLEAMS for large-scale spectrum clustering, detecting groups of unidentified, proximal spectra representing the same peptide. We used these clusters to explore the dark proteome of repeatedly observed yet consistently unidentified mass spectra.

Publication types

Research Support, N.I.H., Extramural

MeSH terms

Algorithms
Cluster Analysis
Neural Networks, Computer
Peptides* / chemistry
Proteome / analysis
Tandem Mass Spectrometry* / methods

Substances

Peptides
Proteome

Grants and funding

R01 GM121818/GM/NIGMS NIH HHS/United States