Analysis of high-dimensional metabolomics data with complex temporal dynamics using RM-ASCA

PLoS Comput Biol. 2023 Jun 23;19(6):e1011221. doi: 10.1371/journal.pcbi.1011221. eCollection 2023 Jun.

Abstract

The intricate dependency structure of biological "omics" data, particularly those originating from longitudinal intervention studies with frequently sampled repeated measurements renders the analysis of such data challenging. The high-dimensionality, inter-relatedness of multiple outcomes, and heterogeneity in the studied systems all add to the difficulty in deriving meaningful information. In addition, the subtle differences in dynamics often deemed meaningful in nutritional intervention studies can be particularly challenging to quantify. In this work we demonstrate the use of quantitative longitudinal models within the repeated-measures ANOVA simultaneous component analysis+ (RM-ASCA+) framework to capture the dynamics in frequently sampled longitudinal data with multivariate outcomes. We illustrate the use of linear mixed models with polynomial and spline basis expansion of the time variable within RM-ASCA+ in order to quantify non-linear dynamics in a simulation study as well as in a metabolomics data set. We show that the proposed approach presents a convenient and interpretable way to systematically quantify and summarize multivariate outcomes in longitudinal studies while accounting for proper within subject dependency structures.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Computer Simulation
  • Linear Models
  • Metabolomics*

Grants and funding

This project has been made possible by the Enabling Technologies Hotels programme of ZonMW (nr. 435005017, grant recipient I.C.W.A. and A.K.S.) with contributions from FrieslandCampina (https://www.frieslandcampina.com/). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.