Stronger findings from mass spectral data through multi-peak modeling

Tommi Suvitaival; Simon Rogers; Samuel Kaski

doi:10.1186/1471-2105-15-208

Stronger findings from mass spectral data through multi-peak modeling

BMC Bioinformatics. 2014 Jun 19:15:208. doi: 10.1186/1471-2105-15-208.

Authors

Tommi Suvitaival, Simon Rogers, Samuel Kaski¹

Affiliation

¹ Helsinki Institute for Information Technology HIIT, Department of Information and Computer Science, Aalto University, 00076 Espoo, Finland. samuel.kaski@aalto.fi.

Abstract

Background: Mass spectrometry-based metabolomic analysis depends upon the identification of spectral peaks by their mass and retention time. Statistical analysis that follows the identification currently relies on one main peak of each compound. However, a compound present in the sample typically produces several spectral peaks due to its isotopic properties and the ionization process of the mass spectrometer device. In this work, we investigate the extent to which these additional peaks can be used to increase the statistical strength of differential analysis.

Results: We present a Bayesian approach for integrating data of multiple detected peaks that come from one compound. We demonstrate the approach through a simulated experiment and validate it on ultra performance liquid chromatography-mass spectrometry (UPLC-MS) experiments for metabolomics and lipidomics. Peaks that are likely to be associated with one compound can be clustered by the similarity of their chromatographic shape. Changes of concentration between sample groups can be inferred more accurately when multiple peaks are available.

Conclusions: When the sample-size is limited, the proposed multi-peak approach improves the accuracy at inferring covariate effects. An R implementation and data are available at http://research.ics.aalto.fi/mi/software/peakANOVA/.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Bayes Theorem
Cluster Analysis
Lipids / analysis
Mass Spectrometry / methods*
Metabolomics

Substances

Lipids