Harnessing Causal Forests for Epidemiologic Research: Key Consideration

Am J Epidemiol. 2024 Feb 6:kwae003. doi: 10.1093/aje/kwae003. Online ahead of print.

Abstract

Assessing heterogeneous treatment effects (HTEs) is an essential task in epidemiology. The recent integration of machine learning into causal inference has provided a new, flexible tool for evaluating complex HTEs: causal forest. Jawadekar et al. (Am J Epidemiol. 2023) introduce this innovative approach and offer practical guidelines for applied users. Building on their work, this commentary provides additional insights and guidance to promote the understanding and application of causal forest in epidemiologic research. We start with conceptual clarifications, differentiating between honesty and cross-fitting, and exploring the interpretation of estimated conditional average treatment effects. We then delve into the following practical considerations not addressed by Jawadekar et al., including motivations for estimating HTEs, calibration approaches, and ways to leverage causal forest output with examples from simulated data. We conclude by outlining challenges to consider for future advancements and applications of causal forest in epidemiological research.

Keywords: Causal forest; Conditional average treatment effect; Heterogeneous treatment effect.