Bayesian compositional regression with microbiome features via variational inference

BMC Bioinformatics. 2023 May 22;24(1):210. doi: 10.1186/s12859-023-05219-x.

Abstract

The microbiome plays a key role in the health of the human body. Interest often lies in finding features of the microbiome, alongside other covariates, which are associated with a phenotype of interest. One important property of microbiome data, which is often overlooked, is its compositionality as it can only provide information about the relative abundance of its constituting components. Typically, these proportions vary by several orders of magnitude in datasets of high dimensions. To address these challenges we develop a Bayesian hierarchical linear log-contrast model which is estimated by mean field Monte-Carlo co-ordinate ascent variational inference (CAVI-MC) and easily scales to high dimensional data. We use novel priors which account for the large differences in scale and constrained parameter space associated with the compositional covariates. A reversible jump Monte Carlo Markov chain guided by the data through univariate approximations of the variational posterior probability of inclusion, with proposal parameters informed by approximating variational densities via auxiliary parameters, is used to estimate intractable marginal expectations. We demonstrate that our proposed Bayesian method performs favourably against existing frequentist state of the art compositional data analysis methods. We then apply the CAVI-MC to the analysis of real data exploring the relationship of the gut microbiome to body mass index.

Keywords: Compositional; Markov chain Monte Carlo; Microbiome; Singular multivariate normal; Variational inference.

MeSH terms

  • Bayes Theorem
  • Gastrointestinal Microbiome*
  • Humans
  • Linear Models
  • Markov Chains
  • Microbiota*
  • Monte Carlo Method