metaboprep: an R package for preanalysis data description and processing

Bioinformatics. 2022 Mar 28;38(7):1980-1987. doi: 10.1093/bioinformatics/btac059.

Abstract

Motivation: Metabolomics is an increasingly common part of health research and there is need for preanalytical data processing. Researchers typically need to characterize the data and to exclude errors within the context of the intended analysis. Whilst some preprocessing steps are common, there is currently a lack of standardization and reporting transparency for these procedures.

Results: Here, we introduce metaboprep, a standardized data processing workflow to extract and characterize high quality metabolomics datasets. The package extracts data from preformed worksheets, provides summary statistics and enables the user to select samples and metabolites for their analysis based on a set of quality metrics. A report summarizing quality metrics and the influence of available batch variables on the data are generated for the purpose of open disclosure. Where possible, we provide users flexibility in defining their own selection thresholds.

Availability and implementation: metaboprep is an open-source R package available at https://github.com/MRCIEU/metaboprep.

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Humans
  • Metabolomics*
  • Research Personnel
  • Software*
  • Workflow