Linear or smooth? Enhanced model choice in boosting via deselection of base-learners

Mayr, Andreas; Wistuba, Tobias; Speller, Jan; Gude, Francisco; Hofner, Benjamin;

Forschungsartikel (Zeitschrift) | Peer reviewed

Zusammenfassung

The specification of a particular type of effect (e.g., linear or non-linear) of a covariate in a regression model can be either based on graphical assessment, subject matter knowledge or also on data-driven model choice procedures. For the latter variant, we present a boosting approach that is available for a huge number of different model classes. Boosting is an indirect regularization technique that leads to variable selection and can easily incorporate also non-linear or smooth effects. Furthermore, the algorithm can be adapted in a way to automatically select whether to model a continuous variable with a smooth or a linear effect. We enhance this model choice procedure by trying to compensate the inherent bias towards the more complex effect by incorporating a pragmatic and simple deselection technique that was originally implemented for enhanced variable selection. We illustrate our approach in the analysis of T3 thyroid hormone levels from a larger Galician cohort and investigate its performance in a simulation study.

Details zur Publikation

FachzeitschriftStatistical Modelling
Jahrgang / Bandnr. / Volume23
Ausgabe / Heftnr. / Issue5-6
Seitenbereich441-455
StatusVeröffentlicht
Veröffentlichungsjahr2023 (08.08.2023)
Sprache, in der die Publikation verfasst istEnglisch
DOI10.1177/1471082X231170045
Link zum Volltexthttps://doi.org/10.1177/1471082X231170045
Stichwörterboosting; model choice; prediction modelling; sparsity; splines;

Autor*innen der Universität Münster

Speller, Jan
Juniorprofessur für Praktische Informatik - Moderne Aspekte der Verarbeitung von Daten / Data Science (Prof. Braun)