Benefit of In Silico Predicted Spectral Libraries in Data-Independent Acquisition Data Analysis Workflows

An Staes; Teresa Mendes Maia; Sara Dufour; Robbin Bouwmeester; Ralf Gabriels; Lennart Martens; Kris Gevaert; Francis Impens; Simon Devos

doi:10.1021/acs.jproteome.4c00048

Benefit of In Silico Predicted Spectral Libraries in Data-Independent Acquisition Data Analysis Workflows

J Proteome Res. 2024 Apr 26. doi: 10.1021/acs.jproteome.4c00048. Online ahead of print.

Authors

An Staes^{1

2

3}, Teresa Mendes Maia^{1

2

3}, Sara Dufour^{1

2

3}, Robbin Bouwmeester^{1

2}, Ralf Gabriels^{1

2}, Lennart Martens^{1

2}, Kris Gevaert^{1

2}, Francis Impens^{1

2

3}, Simon Devos^{1

2

3}

Affiliations

¹ VIB Center for Medical Biotechnology, Technologiepark-Zwijnaarde 75, B9052 Ghent, Belgium.
² Department of Biomolecular Medicine, Ghent University, Technologiepark-Zwijnaarde 75, B9052 Ghent, Belgium.
³ VIB Proteomics Core, B9052 Ghent, Belgium.

PMID: 38666436
DOI: 10.1021/acs.jproteome.4c00048

Abstract

Data-independent acquisition (DIA) has become a well-established method for MS-based proteomics. However, the list of options to analyze this type of data is quite extensive, and the use of spectral libraries has become an important factor in DIA data analysis. More specifically the use of in silico predicted libraries is gaining more interest. By working with a differential spike-in of human standard proteins (UPS2) in a constant yeast tryptic digest background, we evaluated the sensitivity, precision, and accuracy of the use of in silico predicted libraries in data DIA data analysis workflows compared to more established workflows. Three commonly used DIA software tools, DIA-NN, EncyclopeDIA, and Spectronaut, were each tested in spectral library mode and spectral library-free mode. In spectral library mode, we used independent spectral library prediction tools PROSIT and MS2PIP together with DeepLC, next to classical data-dependent acquisition (DDA)-based spectral libraries. In total, we benchmarked 12 computational workflows for DIA. Our comparison showed that DIA-NN reached the highest sensitivity while maintaining a good compromise on the reproducibility and accuracy levels in either library-free mode or using in silico predicted libraries pointing to a general benefit in using in silico predicted libraries.

Keywords: DIA data analysis; benchmarking; data-independent acquisition (DIA); in silico spectral libraries.