Classification of Parkinson's Disease from Voice - Analysis of Data Selection BiasOpen Access

Brenner A.; Van Alen C.M.; Plagwitz L.; Varghese J.

Research article (journal) | Peer reviewed

Abstract

A growing number of studies have been researching biomarkers of Parkinson's disease (PD) using mobile technology. Many have shown high accuracy in PD classification using machine learning (ML) and voice records from the mPower study, a large database of PD patients and healthy controls. Since the dataset has unbalanced class, gender and age distribution, it is important to consider appropriate sampling when assessing classification scores. We analyse biases, such as identity confounding and implicit learning of non-disease-specific characteristics and present a sampling strategy to highlight and prevent these problems.

Details about the publication

JournalStudies in Health Technology and Informatics (Stud Health Technol Inform)
Volume302
Page range127-128
StatusPublished
Release year2023
Language in which the publication is writtenEnglish
KeywordsMachine Learning; Parkinson's Disease; Selection Bias

Authors from the University of Münster

Brenner, Alexander
Plagwitz, Lucas
van Alen, Catharina Marie
Varghese, Julian