Missing value imputation for LC-MS metabolomics data by incorporating metabolic network and adduct ion relations

Bioinformatics. 2018 May 1;34(9):1555-1561. doi: 10.1093/bioinformatics/btx816.

Abstract

Motivation: Metabolomics data generated from liquid chromatography-mass spectrometry platforms often contain missing values. Existing imputation methods do not consider underlying feature relations and the metabolic network information. As a result, the imputation results may not be optimal.

Results: We proposed an imputation algorithm that incorporates the existing metabolic network, adduct ion relations even for unknown compounds, as well as linear and nonlinear associations between feature intensities to build a feature-level network. The algorithm uses support vector regression for missing value imputation based on features in the neighborhood on the network. We compared our proposed method with methods being widely used. As judged by the normalized root mean squared error in real data-based simulations, our proposed methods can achieve better accuracy.

Availability and implementation: The R package is available at http://web1.sph.emory.edu/users/tyu8/MINMA.

Contact: jiankang@umich.edu or tianwei.yu@emory.edu.

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Chromatography, Liquid / methods
  • Coronary Disease / metabolism
  • Humans
  • Mass Spectrometry / methods*
  • Metabolic Networks and Pathways*
  • Metabolomics / methods*
  • Software*