A transformer architecture for retention time prediction in liquid chromatography mass spectrometry-based proteomics

Thang V Pham; Vinh V Nguyen; Duong Vu; Alex A Henneman; Robin A Richardson; Sander R Piersma; Connie R Jimenez

doi:10.1002/pmic.202200041

A transformer architecture for retention time prediction in liquid chromatography mass spectrometry-based proteomics

Proteomics. 2023 Apr;23(7-8):e2200041. doi: 10.1002/pmic.202200041. Epub 2023 Mar 17.

Authors

Thang V Pham^{1

2}, Vinh V Nguyen³, Duong Vu⁴, Alex A Henneman^{1

2}, Robin A Richardson⁵, Sander R Piersma^{1

2}, Connie R Jimenez^{1

2}

Affiliations

¹ Amsterdam UMC, location Vrije Universiteit Amsterdam, OncoProteomics Laboratory, Medical Oncology, Amsterdam, The Netherlands.
² Cancer Center Amsterdam, Cancer Biology and Immunology, Amsterdam, The Netherlands.
³ University of Engineering and Technology, Vietnam National University, Hanoi, Vietnam.
⁴ Westerdijk Fungal Biodiversity Institute, Uppsalalaan 8, Utrecht, The Netherlands.
⁵ Netherlands eScience Center, The Netherlands.

PMID: 36906835
DOI: 10.1002/pmic.202200041

Abstract

Accurate retention time (RT) prediction is important for spectral library-based analysis in data-independent acquisition mass spectrometry-based proteomics. The deep learning approach has demonstrated superior performance over traditional machine learning methods for this purpose. The transformer architecture is a recent development in deep learning that delivers state-of-the-art performance in many fields such as natural language processing, computer vision, and biology. We assess the performance of the transformer architecture for RT prediction using datasets from five deep learning models Prosit, DeepDIA, AutoRT, DeepPhospho, and AlphaPeptDeep. The experimental results on holdout datasets and independent datasets exhibit state-of-the-art performance of the transformer architecture. The software and evaluation datasets are publicly available for future development in the field.

Keywords: DIA-MS; deep learning; retention time prediction; spectral library; transformer architecture.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Chromatography, Liquid / methods
Mass Spectrometry / methods
Peptide Library*
Proteomics* / methods
Software

Substances

Peptide Library