Moving beyond spatial and random cross-validation in environmental modelling: a call for prediction-domain adaptive evaluationOpen Access

Linnenbrink, J; Nowosad, J; Meyer, H

Forschungsartikel in Online-Sammlung | Preprint

Zusammenfassung

With the growing application of spatial predictive modeling in ecology, the question of how to appropriately evaluate the resulting maps has gained increasing attention. While there is consensus that map accuracy is ideally estimated using an independent probability sample of the prediction area, there is still no agreement on the most appropriate way to conduct an evaluation for the common case when such a sample is not available. Cross-validation, which involves multiple train-test splits, is commonly applied not only to estimate final model accuracy but also to guide model tuning and selection. Many different spatial and non-spatial approaches to cross-validation have been proposed, and approaches in both groups have faced substantial criticism. It has been shown that random cross-validation methods are suitable when the training points are randomly distributed in the prediction area, while spatial cross-validation is better suited towards extrapolation situations. In practice, however, there is a continuum and most cases are between those two extremes. To address this gap, we advocate for a new category of cross-validation methods to account for this: prediction-domain adaptive evaluation. Methods in this category flexibly adapt to the prediction situation, yielding most reliable estimates of map accuracy across different scenarios. To ground this perspective empirically, we reproduce a simulation study that was used in earlier research and systematically compare different evaluation methods and discuss their purpose.

Details zur Publikation

Name des RepositoriumsarXiv
StatusVeröffentlicht
Veröffentlichungsjahr2026
Sprache, in der die Publikation verfasst istEnglisch
Stichwörter area of applicability; data leakage; machine learning; map accuracy; model validation; spatial prediction

Autor*innen der Universität Münster

Linnenbrink, Jan
Meyer, Hanna

Projekte, aus denen die Publikation entstanden ist

Laufzeit: 01.10.2024 - 30.06.2028 | 1. Förderperiode
Gefördert durch: DFG - Sonderforschungsbereich
Art des Projekts: Teilprojekt in DFG-Verbund koordiniert außerhalb der Universität Münster
Laufzeit: 14.08.2024 - 13.08.2026
Gefördert durch: EU Horizon Europe - Marie Skłodowska-Curie Actions - Postdoctoral Fellowship
Art des Projekts: EU-Projekt koordiniert an der Universität Münster
Laufzeit: 01.08.2023 - 31.07.2027 | 1. Förderperiode
Gefördert durch: DFG - Schwerpunktprogramm
Art des Projekts: Teilprojekt in DFG-Verbund koordiniert außerhalb der Universität Münster