Machine Learning for Chemical Reactivity: The Importance of Failed Experiments

Strieth-Kalthoff, Felix; Sandfort, Frederik; Kühnemund, Marius; Schäfer, Felix R.; Kuchen, Herbert; Glorius, Frank

Research article (journal) | Peer reviewed

Abstract

Assessing the outcomes of chemical reactions in a quantitative fashion has been a cornerstone across all synthetic disciplines. Classically approached through empirical optimization, data-driven modelling bears an enormous potential to streamline this process. However, such predictive models require significant quantities of high-quality data, the availability of which is limited: Main reasons for this include experimental errors and, importantly, human biases regarding experiment selection and result reporting. In a series of case studies, we investigate the impact of these biases for drawing general conclusions from chemical reaction data, revealing the utmost importance of “negative” examples. Eventually, case studies into data expansion approaches showcase directions to circumvent these limitations—and demonstrate perspectives towards a long-term data quality enhancement in chemistry.

Details about the publication

JournalAngewandte Chemie International Edition (Angew. Chem. Int. Ed.)
Volume61
Issue29
Article numbere202204647
StatusPublished
Release year2022
Language in which the publication is writtenEnglish
DOI10.1002/anie.202204647
KeywordsCross-Coupling; Data Bias; Machine Learning; Reaction Data; Yield Prediction

Authors from the University of Münster

Glorius, Frank
Professur für Organische Chemie (Prof. Glorius)
Kuchen, Herbert
Practical Computer Science Group (PI)
Kühnemund, Marius
Practical Computer Science Group (PI)
Sandfort, Frederik
Professur für Organische Chemie (Prof. Glorius)
Schäfer, Felix Richard
Professur für Organische Chemie (Prof. Glorius)
Strieth-Kalthoff, Felix
Professur für Organische Chemie (Prof. Glorius)