The Necessity of Multiple Data Sources for ECG-Based Machine Learning Models.

Plagwitz L; Vogelsang T; Doldi F; Bickmann L; Fujarski M; Eckardt L; Varghese J

Research article (journal) | Peer reviewed

Abstract

Even though the interest in machine learning studies is growing significantly, especially in medicine, the imbalance between study results and clinical relevance is more pronounced than ever. The reasons for this include data quality and interoperability issues. Hence, we aimed at examining site- and study-specific differences in publicly available standard electrocardiogram (ECG) datasets, which in theory should be interoperable by consistent 12-lead definition, sampling rate, and measurement duration. The focus lies upon the question of whether even slight study peculiarities can affect the stability of trained machine learning models. To this end, the performances of modern network architectures as well as unsupervised pattern detection algorithms are investigated across different datasets. Overall, this is intended to examine the generalization of machine learning results of single-site ECG studies.

Details about the publication

JournalStudies in Health Technology and Informatics (Stud Health Technol Inform)
Volume302
Page range33-37
StatusPublished
Release year2023 (18/05/2023)
Language in which the publication is writtenEnglish
DOI10.3233/SHTI230059
Link to the full texthttps://www.semanticscholar.org/paper/The-Necessity-of-Multiple-Data-Sources-for-Machine-Plagwitz-Vogelsang/0dc875dbd003fe62b7d1443ea3d7ec99d0fa5010
KeywordsInformation Sources; Machine Learning; Algorithms; Electrocardiography; Data Accuracy

Authors from the University of Münster

Doldi, Florian Günther
Klinik für Kardiologie II
Eckardt, Lars
Department for Cardiovascular Medicine
Fujarski, Michael
Institute of Medical Informatics
Plagwitz, Lucas
Institute of Medical Informatics
Varghese, Julian
Institute of Medical Informatics