Assessing the properties of patient-specific treatment effect estimates from causal forest algorithms under essential heterogeneity

BMC Med Res Methodol. 2024 Mar 13;24(1):66. doi: 10.1186/s12874-024-02187-5.

Abstract

Background: Treatment variation from observational data has been used to estimate patient-specific treatment effects. Causal Forest Algorithms (CFAs) developed for this task have unknown properties when treatment effect heterogeneity from unmeasured patient factors influences treatment choice - essential heterogeneity.

Methods: We simulated eleven populations with identical treatment effect distributions based on patient factors. The populations varied in the extent that treatment effect heterogeneity influenced treatment choice. We used the generalized random forest application (CFA-GRF) to estimate patient-specific treatment effects for each population. Average differences between true and estimated effects for patient subsets were evaluated.

Results: CFA-GRF performed well across the population when treatment effect heterogeneity did not influence treatment choice. Under essential heterogeneity, however, CFA-GRF yielded treatment effect estimates that reflected true treatment effects only for treated patients and were on average greater than true treatment effects for untreated patients.

Conclusions: Patient-specific estimates produced by CFAs are sensitive to why patients in real-world practice make different treatment choices. Researchers using CFAs should develop conceptual frameworks of treatment choice prior to estimation to guide estimate interpretation ex post.

Keywords: Causal Forest Algorithm (CFA); Linear probability estimators; Machine learning; Simulation modeling; Treatment effect estimation.

MeSH terms

  • Algorithms*
  • Causality
  • Computer Simulation
  • Humans
  • Patient Selection
  • Patients*
  • Treatment Effect Heterogeneity