IDIA: An Integrative Signal Extractor for Data-Independent Acquisition Proteomics

Proceedings (IEEE Int Conf Bioinformatics Biomed). 2022 Dec:2022:266-269. doi: 10.1109/bibm55620.2022.9994873. Epub 2023 Jan 2.

Abstract

In proteomics, data-independent acquisition (DIA) has been shown to provide less biased and more reproducible results than data-dependent acquisition. Recently, many researchers have developed a series of methods to identify peptides and proteins by using spectrum libraries for DIA data. However, spectrum libraries are not always available for novel organisms or microbial communities. To detect peptides and proteins without a spectrum library, we developed IDIA, a library-free method using DIA data to generate pseudo-spectra that can be searched using conventional sequence database searching software. IDIA integrates two isotopic trace detection strategies and employs B-spline and Gaussian filters to help extract high-quality pseudo-spectra from the complex DIA data. The experimental results on human and yeast data demonstrated that our approach remarkably produced more peptide and protein identifications than the two state-of-the-art library-free methods, i.e., DIA-Umpire and Group-DIA. IDIA is freely available under the GNU GPL license at https://github.com/Biocomputing-Research-Group/IDIA.

Keywords: DIA; Feature Detection; Proteomics; Pseudo-spectra.