Distributional anchor regression

Lucas Kook; Beate Sick; Peter Bühlmann

doi:10.1007/s11222-022-10097-z

Distributional anchor regression

Stat Comput. 2022;32(3):39. doi: 10.1007/s11222-022-10097-z. Epub 2022 May 13.

Authors

Lucas Kook^{1

2}, Beate Sick^{1

2}, Peter Bühlmann³

Affiliations

¹ Epidemiology, Biostatistics and Prevention Institute, University of Zurich, 8001 Zurich, Switzerland.
² Institute of Data Analysis and Process Design, Zurich University of Applied Sciences, 8400 Winterthur, Switzerland.
³ Seminar for Statistics ETH Zurich, 8092 Zurich, Switzerland.

Abstract

Prediction models often fail if train and test data do not stem from the same distribution. Out-of-distribution (OOD) generalization to unseen, perturbed test data is a desirable but difficult-to-achieve property for prediction models and in general requires strong assumptions on the data generating process (DGP). In a causally inspired perspective on OOD generalization, the test data arise from a specific class of interventions on exogenous random variables of the DGP, called anchors. Anchor regression models, introduced by Rothenhäusler et al. (J R Stat Soc Ser B 83(2):215-246, 2021. 10.1111/rssb.12398), protect against distributional shifts in the test data by employing causal regularization. However, so far anchor regression has only been used with a squared-error loss which is inapplicable to common responses such as censored continuous or ordinal data. Here, we propose a distributional version of anchor regression which generalizes the method to potentially censored responses with at least an ordered sample space. To this end, we combine a flexible class of parametric transformation models for distributional regression with an appropriate causal regularizer under a more general notion of residuals. In an exemplary application and several simulation scenarios we demonstrate the extent to which OOD generalization is possible.

Keywords: Anchor regression; Covariate shift; Diluted causality; Distributional regression; Out-of-distribution generalization; Transformation models.