Proof of concept for identifying cystic fibrosis from perspiration samples

Proc Natl Acad Sci U S A. 2019 Dec 3;116(49):24408-24412. doi: 10.1073/pnas.1909630116. Epub 2019 Nov 18.

Abstract

The gold standard for cystic fibrosis (CF) diagnosis is the determination of chloride concentration in sweat. Current testing methodology takes up to 3 h to complete and has recognized shortcomings on its diagnostic accuracy. We present an alternative method for the identification of CF by combining desorption electrospray ionization mass spectrometry and a machine-learning algorithm based on gradient boosted decision trees to analyze perspiration samples. This process takes as little as 2 min, and we determined its accuracy to be 98 ± 2% by cross-validation on analyzing 277 perspiration samples. With the introduction of statistical bootstrap, our method can provide a confidence estimate of our prediction, which helps diagnosis decision-making. We also identified important peaks by the feature selection algorithm and assigned the chemical structure of the metabolites by high-resolution and/or tandem mass spectrometry. We inspected the correlation between mild and severe CFTR gene mutation types and lipid profiles, suggesting a possible way to realize personalized medicine with this noninvasive, fast, and accurate method.

Keywords: cystic fibrosis; desorption electrospray ionization; machine learning; mass spectrometry.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms*
  • Case-Control Studies
  • Chlorides / analysis*
  • Cystic Fibrosis / diagnosis*
  • Cystic Fibrosis Transmembrane Conductance Regulator / genetics
  • Humans
  • Lipids / analysis
  • Lipids / chemistry
  • Lipids / genetics
  • Machine Learning
  • Mutation
  • Proof of Concept Study
  • Reproducibility of Results
  • Spectrometry, Mass, Electrospray Ionization / methods
  • Spectrometry, Mass, Electrospray Ionization / statistics & numerical data*
  • Sweat / chemistry*
  • Tandem Mass Spectrometry / methods
  • Tandem Mass Spectrometry / statistics & numerical data

Substances

  • CFTR protein, human
  • Chlorides
  • Lipids
  • Cystic Fibrosis Transmembrane Conductance Regulator