A transformer architecture for retention time prediction in liquid chromatography mass spectrometry-based proteomics

Proteomics. 2023 Apr;23(7-8):e2200041. doi: 10.1002/pmic.202200041. Epub 2023 Mar 17.

Abstract

Accurate retention time (RT) prediction is important for spectral library-based analysis in data-independent acquisition mass spectrometry-based proteomics. The deep learning approach has demonstrated superior performance over traditional machine learning methods for this purpose. The transformer architecture is a recent development in deep learning that delivers state-of-the-art performance in many fields such as natural language processing, computer vision, and biology. We assess the performance of the transformer architecture for RT prediction using datasets from five deep learning models Prosit, DeepDIA, AutoRT, DeepPhospho, and AlphaPeptDeep. The experimental results on holdout datasets and independent datasets exhibit state-of-the-art performance of the transformer architecture. The software and evaluation datasets are publicly available for future development in the field.

Keywords: DIA-MS; deep learning; retention time prediction; spectral library; transformer architecture.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Chromatography, Liquid / methods
  • Mass Spectrometry / methods
  • Peptide Library*
  • Proteomics* / methods
  • Software

Substances

  • Peptide Library