Linear or smooth? Enhanced model choice in boosting via deselection of base-learners

Mayr, Andreas; Wistuba, Tobias; Speller, Jan; Gude, Francisco; Hofner, Benjamin;

Research article (journal) | Peer reviewed

Abstract

The specification of a particular type of effect (e.g., linear or non-linear) of a covariate in a regression model can be either based on graphical assessment, subject matter knowledge or also on data-driven model choice procedures. For the latter variant, we present a boosting approach that is available for a huge number of different model classes. Boosting is an indirect regularization technique that leads to variable selection and can easily incorporate also non-linear or smooth effects. Furthermore, the algorithm can be adapted in a way to automatically select whether to model a continuous variable with a smooth or a linear effect. We enhance this model choice procedure by trying to compensate the inherent bias towards the more complex effect by incorporating a pragmatic and simple deselection technique that was originally implemented for enhanced variable selection. We illustrate our approach in the analysis of T3 thyroid hormone levels from a larger Galician cohort and investigate its performance in a simulation study.

Details about the publication

JournalStatistical Modelling
Volume23
Issue5-6
Page range441-455
StatusPublished
Release year2023 (08/08/2023)
Language in which the publication is writtenEnglish
DOI10.1177/1471082X231170045
Link to the full texthttps://doi.org/10.1177/1471082X231170045
Keywordsboosting; model choice; prediction modelling; sparsity; splines;

Authors from the University of Münster

Speller, Jan
Junior professorship for practical computer science - modern aspects of data processing / data science (Prof. Braun)