The General Explanation Method with NMR Spectroscopy Enables the Identification of Metabolite Profiles Specific for Normal and Tumor Cell Lines

Chembiochem. 2018 Oct 4;19(19):2066-2071. doi: 10.1002/cbic.201800392. Epub 2018 Sep 14.

Abstract

Machine learning models in metabolomics, despite their great prediction accuracy, are still not widely adopted owing to the lack of an efficient explanation for their predictions. In this study, we propose the use of the general explanation method to explain the predictions of a machine learning model to gain detailed insight into metabolic differences between biological systems. The method was tested on a dataset of 1 H NMR spectra acquired on normal lung and mesothelial cell lines and their tumor counterparts. Initially, the random forests and artificial neural network models were applied to the dataset, and excellent prediction accuracy was achieved. The predictions of the models were explained with the general explanation method, which enabled identification of discriminating metabolic concentration differences between individual cell lines and enabled the construction of their specific metabolic concentration profiles. This intuitive and robust method holds great promise for in-depth understanding of the mechanisms that underline phenotypes as well as for biomarker discovery in complex diseases.

Keywords: NMR spectroscopy; cancer; general explanation method; machine learning; metabolomics.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Cell Line
  • Datasets as Topic
  • Humans
  • Lung / cytology*
  • Lung Neoplasms / pathology*
  • Machine Learning
  • Magnetic Resonance Spectroscopy / methods
  • Metabolome*
  • Metabolomics / methods*