Gradient tree boosting for hierarchical data

Salditt, M.; Humberg, S.; & Nestler, S.

Research article (journal) | Peer reviewed

Abstract

Gradient tree boosting is a powerful machine learning technique that has shown good performance in predicting a variety of outcomes. However, when applied to hierarchical (e.g., longitudinal or clustered) data, the predictive performance of gradient tree boosting may be harmed by ignoring the hierarchical structure, and may be improved by accounting for it. Tree-based methods such as regression trees and random forests have already been extended to hierarchical data settings by combining them with the linear mixed effects model (MEM). In the present article, we add to this literature by proposing two algorithms to estimate a combination of the MEM and gradient tree boosting. We report on two simulation studies that (i) investigate the predictive performance of the two MEM boosting algorithms and (ii) compare them to standard gradient tree boosting, standard random forest, and other existing methods for hierarchical data (MEM, MEM random forests, model-based boosting, Bayesian additive regression trees [BART]). We found substantial improvements in the predictive performance of our MEM boosting algorithms over standard boosting when the random effects were non-negligible. MEM boosting as well as BART showed a predictive performance similar to the correctly specified MEM (i.e., the benchmark model), and overall outperformed the model-based boosting and random forest approaches

Details about the publication

JournalMultivariate Behavioral Research
Volumexxx
StatusPublished
Release year2022
Keywordsmixed effects models, longitudinal data, regression trees, gradient boosting, atypical observations

Authors from the University of Münster

Humberg, Sarah
Professorship for Psychologiscal Diagnostics and Personality Psychology (Prof. Back)
Nestler, Steffen
Professorship for statistics and research methods in psychology
Salditt, Marie Babette
Professorship for statistics and research methods in psychology