Accessible and reproducible mass spectrometry imaging data analysis in Galaxy

Gigascience. 2019 Dec 1;8(12):giz143. doi: 10.1093/gigascience/giz143.

Abstract

Background: Mass spectrometry imaging is increasingly used in biological and translational research because it has the ability to determine the spatial distribution of hundreds of analytes in a sample. Being at the interface of proteomics/metabolomics and imaging, the acquired datasets are large and complex and often analyzed with proprietary software or in-house scripts, which hinders reproducibility. Open source software solutions that enable reproducible data analysis often require programming skills and are therefore not accessible to many mass spectrometry imaging (MSI) researchers.

Findings: We have integrated 18 dedicated mass spectrometry imaging tools into the Galaxy framework to allow accessible, reproducible, and transparent data analysis. Our tools are based on Cardinal, MALDIquant, and scikit-image and enable all major MSI analysis steps such as quality control, visualization, preprocessing, statistical analysis, and image co-registration. Furthermore, we created hands-on training material for use cases in proteomics and metabolomics. To demonstrate the utility of our tools, we re-analyzed a publicly available N-linked glycan imaging dataset. By providing the entire analysis history online, we highlight how the Galaxy framework fosters transparent and reproducible research.

Conclusion: The Galaxy framework has emerged as a powerful analysis platform for the analysis of MSI data with ease of use and access, together with high levels of reproducibility and transparency.

Keywords: Galaxy; MALDI imaging; computational workflows; mass spectrometry imaging; metabolomics; proteomics; reproducibility; spatially resolved mass spectrometry.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Computational Biology / education*
  • Computational Biology / methods
  • Data Analysis
  • Humans
  • Mass Spectrometry
  • Metabolomics / methods*
  • Proteomics / methods*
  • Reproducibility of Results
  • Software
  • Translational Research, Biomedical