Model-averaged confounder adjustment for estimating multivariate exposure effects with linear regression

Biometrics. 2018 Sep;74(3):1034-1044. doi: 10.1111/biom.12860. Epub 2018 Mar 22.

Abstract

In environmental and nutritional epidemiology and in many other fields, there is increasing interest in estimating the effect of simultaneous exposure to several agents (e.g., multiple nutrients, pesticides, or air pollutants) on a health outcome. We consider estimating the effect of a multivariate exposure that includes several continuous agents and their interactions-on an outcome, when the true confounding variables are an unknown subset of a potentially large (relative to sample size) set of measured covariates. Our approach is rooted in the ideas of Bayesian model averaging: the exposure effect is estimated as a weighted average of the estimated exposure effects obtained under several linear regression models that include different sets of the potential confounders. We introduce a data-driven prior that assigns to the likely confounders a higher probability of being included into the regression model. We show that our approach can also be formulated as a penalized likelihood formulation with an interpretable tuning parameter. Through a simulation study, we demonstrate that the proposed approach identifies parsimonious models that are fully adjusted for observed confounding and estimates the multivariate exposure effect with smaller mean squared error compared to several alternatives. We apply the method to an Environmental Wide Association Study using National Heath and Nutrition Examination Survey to estimate the effect of mixtures of nutrients and pesticides on lipid levels.

Keywords: Bayesian model averaging; Confounding; Exposome; Model uncertainty; Multiple exposures; Multivariate exposure effects.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Bayes Theorem
  • Computer Simulation
  • Confounding Factors, Epidemiologic*
  • Environmental Exposure*
  • Humans
  • Linear Models*
  • Lipid Metabolism / drug effects
  • Multivariate Analysis*
  • Nutrients / pharmacology
  • Pesticides / pharmacology
  • Risk Factors

Substances

  • Pesticides