Using MetaboAnalyst 4.0 for Comprehensive and Integrative Metabolomics Data Analysis

Curr Protoc Bioinformatics. 2019 Dec;68(1):e86. doi: 10.1002/cpbi.86.

Abstract

MetaboAnalyst (https://www.metaboanalyst.ca) is an easy-to-use web-based tool suite for comprehensive metabolomic data analysis, interpretation, and integration with other omics data. Since its first release in 2009, MetaboAnalyst has evolved significantly to meet the ever-expanding bioinformatics demands from the rapidly growing metabolomics community. In addition to providing a variety of data processing and normalization procedures, MetaboAnalyst supports a wide array of functions for statistical, functional, as well as data visualization tasks. Some of the most widely used approaches include PCA (principal component analysis), PLS-DA (partial least squares discriminant analysis), clustering analysis and visualization, MSEA (metabolite set enrichment analysis), MetPA (metabolic pathway analysis), biomarker selection via ROC (receiver operating characteristic) curve analysis, as well as time series and power analysis. The current version of MetaboAnalyst (4.0) features a complete overhaul of the user interface and significantly expanded underlying knowledge bases (compound database, pathway libraries, and metabolite sets). Three new modules have been added to support pathway activity prediction directly from mass peaks, biomarker meta-analysis, and network-based multi-omics data integration. To enable more transparent and reproducible analysis of metabolomic data, we have released a companion R package (MetaboAnalystR) to complement the web-based application. This article provides an overview of the main functional modules and the general workflow of MetaboAnalyst 4.0, followed by 12 detailed protocols: © 2019 by John Wiley & Sons, Inc. Basic Protocol 1: Data uploading, processing, and normalization Basic Protocol 2: Identification of significant variables Basic Protocol 3: Multivariate exploratory data analysis Basic Protocol 4: Functional interpretation of metabolomic data Basic Protocol 5: Biomarker analysis based on receiver operating characteristic (ROC) curves Basic Protocol 6: Time-series and two-factor data analysis Basic Protocol 7: Sample size estimation and power analysis Basic Protocol 8: Joint pathway analysis Basic Protocol 9: MS peaks to pathway activities Basic Protocol 10: Biomarker meta-analysis Basic Protocol 11: Knowledge-based network exploration of multi-omics data Basic Protocol 12: MetaboAnalystR introduction.

Keywords: MS peaks to pathways; ROC curve; biomarker analysis; chemometrics; joint pathway analysis; meta-analysis; metabolic pathway analysis; metabolite set enrichment analysis; metabolomics; multi-omics integration; network analysis; power analysis; reproducible data analysis; web application.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Biomarkers
  • Computational Biology / methods*
  • Databases, Factual
  • Humans
  • Internet
  • Metabolome
  • Metabolomics / methods*
  • Principal Component Analysis
  • Software*
  • User-Computer Interface

Substances

  • Biomarkers

Grants and funding