Trace, Machine Learning of Signal Images for Trace-Sensitive Mass Spectrometry: A Case Study from Single-Cell Metabolomics

Anal Chem. 2019 May 7;91(9):5768-5776. doi: 10.1021/acs.analchem.8b05985. Epub 2019 Apr 15.

Abstract

Recent developments in high-resolution mass spectrometry (HRMS) technology enabled ultrasensitive detection of proteins, peptides, and metabolites in limited amounts of samples, even single cells. However, extraction of trace-abundance signals from complex data sets ( m/ z value, separation time, signal abundance) that result from ultrasensitive studies requires improved data processing algorithms. To bridge this gap, we here developed "Trace", a software framework that incorporates machine learning (ML) to automate feature selection and optimization for the extraction of trace-level signals from HRMS data. The method was validated using primary (raw) and manually curated data sets from single-cell metabolomic studies of the South African clawed frog ( Xenopus laevis) embryo using capillary electrophoresis electrospray ionization HRMS. We demonstrated that Trace combines sensitivity, accuracy, and robustness with high data processing throughput to recognize signals, including those previously identified as metabolites in single-cell capillary electrophoresis HRMS measurements that we conducted over several months. These performance metrics combined with a compatibility with MS data in open-source (mzML) format make Trace an attractive software resource to facilitate data analysis for studies employing ultrasensitive high-resolution MS.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Electrophoresis, Capillary
  • Embryo, Nonmammalian / metabolism*
  • Machine Learning*
  • Metabolome*
  • Single-Cell Analysis / methods*
  • Software*
  • Spectrometry, Mass, Electrospray Ionization / methods*
  • Xenopus laevis / metabolism*