Graph Theoretic Approach for the Analysis of Comprehensive Mass-Spectrometry (MS/MS) Data of Dissolved Organic Matter

Proceedings (IEEE Int Conf Bioinformatics Biomed). 2021 Dec:2021:3742-3746. doi: 10.1109/bibm52615.2021.9669289.

Abstract

Dissolved organic matter (DOM) is a highly complex mixture of organic substances found in aquatic ecosystems. This mixture results from the degradation of primary producers within the ecosystem, groundwater, and the surrounding terrestrial sources. Understanding the chemical structure of DOM is crucial to assessing its impact on aquatic ecosystems. Although multiple studies have addressed the complexity of DOM, the molecular structure of this set of compounds remains unclear. In this work, we present a novel computational framework "Graph-DOM," to assess the comprehensive fragmentation data obtained from the analysis of DOM using the Data Independent Fragmentation strategy with ESI-FT-ICR MS/MS enabling better understanding of the structural complexity of DOM. Graph-DOM uses graph algorithms to dissect a compiled output file obtained from processing hundreds of ultra-high-resolution fragment spectra. Over half a million ordered fragmentation pathways were computed for 764 isolated precursor ions assuming up to seven vector segments categorized as neutral losses (CH2, CH3, O, CH4, H2O, CO, and CO2). Families of structurally related molecules were identified using pathway overlaps, and output files compatible with network visualization software (e.g., Cytoscape) were also generated. Graph-DOM is able to efficiently process all the pathways to discover families within only a few minutes with adjustable parameters for overlap length of fragmentation pathways as well as configuring low abundance CHOS, CHON, and CHONS compounds. Graph-DOM is available at https://github.com/Usman095/Graph-DOM.

Keywords: HPC; dissolved organic matter; dynamic programming; regularity chains.