Integrative open workflow for confident annotation and molecular networking of metabolomics MSE/DIA data

Brief Bioinform. 2024 Jan 22;25(2):bbae013. doi: 10.1093/bib/bbae013.

Abstract

Liquid chromatography coupled with high-resolution mass spectrometry data-independent acquisition (LC-HRMS/DIA), including MSE, enable comprehensive metabolomics analyses though they pose challenges for data processing with automatic annotation and molecular networking (MN) implementation. This motivated the present proposal, in which we introduce DIA-IntOpenStream, a new integrated workflow combining open-source software to streamline MSE data handling. It provides 'in-house' custom database construction, allows the conversion of raw MSE data to a universal format (.mzML) and leverages open software (MZmine 3 and MS-DIAL) all advantages for confident annotation and effective MN data interpretation. This pipeline significantly enhances the accessibility, reliability and reproducibility of complex MSE/DIA studies, overcoming previous limitations of proprietary software and non-universal MS data formats that restricted integrative analysis. We demonstrate the utility of DIA-IntOpenStream with two independent datasets: dataset 1 consists of new data from 60 plant extracts from the Ocotea genus; dataset 2 is a publicly available actinobacterial extract spiked with authentic standard for detailed comparative analysis with existing methods. This user-friendly pipeline enables broader adoption of cutting-edge MS tools and provides value to the scientific community. Overall, it holds promise for speeding up metabolite discoveries toward a more collaborative and open environment for research.

Keywords: Ocotea; chemical annotation; data-independent acquisition; mass spectrometry; open software.

MeSH terms

  • Chromatography, Liquid / methods
  • Mass Spectrometry / methods
  • Metabolomics* / methods
  • Reproducibility of Results
  • Software*
  • Workflow