Gradient Tree Boosting for Hierarchical Data

Marie Salditt; Sarah Humberg; Steffen Nestler

doi:10.1080/00273171.2022.2146638

Gradient Tree Boosting for Hierarchical Data

Multivariate Behav Res. 2023 Sep-Oct;58(5):911-937. doi: 10.1080/00273171.2022.2146638. Epub 2023 Jan 5.

Authors

Marie Salditt¹, Sarah Humberg¹, Steffen Nestler¹

Affiliation

¹ Department of Psychology, University of Münster, Münster, Germany.

PMID: 36602080
DOI: 10.1080/00273171.2022.2146638

Abstract

Gradient tree boosting is a powerful machine learning technique that has shown good performance in predicting a variety of outcomes. However, when applied to hierarchical (e.g., longitudinal or clustered) data, the predictive performance of gradient tree boosting may be harmed by ignoring the hierarchical structure, and may be improved by accounting for it. Tree-based methods such as regression trees and random forests have already been extended to hierarchical data settings by combining them with the linear mixed effects model (MEM). In the present article, we add to this literature by proposing two algorithms to estimate a combination of the MEM and gradient tree boosting. We report on two simulation studies that (i) investigate the predictive performance of the two MEM boosting algorithms and (ii) compare them to standard gradient tree boosting, standard random forest, and other existing methods for hierarchical data (MEM, MEM random forests, model-based boosting, Bayesian additive regression trees [BART]). We found substantial improvements in the predictive performance of our MEM boosting algorithms over standard boosting when the random effects were non-negligible. MEM boosting as well as BART showed a predictive performance similar to the correctly specified MEM (i.e., the benchmark model), and overall outperformed the model-based boosting and random forest approaches.

Keywords: Mixed effects models; atypical observations; gradient boosting; longitudinal data; regression trees.

MeSH terms

Algorithms*
Bayes Theorem
Computer Simulation
Linear Models
Machine Learning*