Machine-learning based lipid mediator serum concentration patterns allow identification of multiple sclerosis patients with high accuracy

Sci Rep. 2018 Oct 5;8(1):14884. doi: 10.1038/s41598-018-33077-8.

Abstract

Based on increasing evidence suggesting that MS pathology involves alterations in bioactive lipid metabolism, the present analysis was aimed at generating a complex serum lipid-biomarker. Using unsupervised machine-learning, implemented as emergent self-organizing maps of neuronal networks, swarm intelligence and Minimum Curvilinear Embedding, a cluster structure was found in the input data space comprising serum concentrations of d = 43 different lipid-markers of various classes. The structure coincided largely with the clinical diagnosis, indicating that the data provide a basis for the creation of a biomarker (classifier). This was subsequently assessed using supervised machine-learning, implemented as random forests and computed ABC analysis-based feature selection. Bayesian statistics-based biomarker creation was used to map the diagnostic classes of either MS patients (n = 102) or healthy subjects (n = 301). Eight lipid-markers passed the feature selection and comprised GluCerC16, LPA20:4, HETE15S, LacCerC24:1, C16Sphinganine, biopterin and the endocannabinoids PEA and OEA. A complex classifier or biomarker was developed that predicted MS at a sensitivity, specificity and accuracy of approximately 95% in training and test data sets, respectively. The present successful application of serum lipid marker concentrations to MS data is encouraging for further efforts to establish an MS biomarker based on serum lipidomics.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adult
  • Bayes Theorem
  • Biomarkers / blood
  • Female
  • Humans
  • Lipids / blood*
  • Machine Learning*
  • Male
  • Middle Aged
  • Multiple Sclerosis / blood*
  • Multiple Sclerosis / diagnosis
  • Young Adult

Substances

  • Biomarkers
  • Lipids