Gradient boosting for linear mixed models

Colin Griesbach; Benjamin Säfken; Elisabeth Waldmann

doi:10.1515/ijb-2020-0136

Gradient boosting for linear mixed models

Int J Biostat. 2021 Jan 13;17(2):317-329. doi: 10.1515/ijb-2020-0136.

Authors

Colin Griesbach¹, Benjamin Säfken², Elisabeth Waldmann¹

Affiliations

¹ Department of Medical Informatics, Biometry and Epidemiology, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany.
² Chair of Statistics, Georg-August-Universität Göttingen, Göttingen, Germany.

PMID: 34826371
DOI: 10.1515/ijb-2020-0136

Abstract

Gradient boosting from the field of statistical learning is widely known as a powerful framework for estimation and selection of predictor effects in various regression models by adapting concepts from classification theory. Current boosting approaches also offer methods accounting for random effects and thus enable prediction of mixed models for longitudinal and clustered data. However, these approaches include several flaws resulting in unbalanced effect selection with falsely induced shrinkage and a low convergence rate on the one hand and biased estimates of the random effects on the other hand. We therefore propose a new boosting algorithm which explicitly accounts for the random structure by excluding it from the selection procedure, properly correcting the random effects estimates and in addition providing likelihood-based estimation of the random effects variance structure. The new algorithm offers an organic and unbiased fitting approach, which is shown via simulations and data examples.

Keywords: gradient boosting; mixed models; regularised regression; statistical learning.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Algorithms*
Likelihood Functions
Linear Models
Models, Statistical*