Low Data Machine Learning for Sustainable Chemical Sciences (LowDataML)

Basic data for this project

Type of projectEU-project hosted outside University of Münster
Duration at the University of Münster01/10/2025 - 30/09/2029

Description

Innovation in the chemical sciences is bound to iInnovation in the chemical sciences is bound to impact on Healthcare and Society. Supported by improved analytical methods and automation, brute force and large-scale experimentation have been playing an important role in generating volumes of chemical and biological data. These data now enable the support to decision making through machine learning/artificial intelligence (ML/AI) algorithms. In doing so, such algorithms help in the design and prioritization of experiments. As a result, we are witnessing a renaissance of ML/AI for accelerating chemistry, as in planning retrosyntheses, predicting reaction products, designing drug leads and materials de novo, and deconvoluting drug targets among others. Despite the chemistry advances leveraged by ML/AI, one can argue that not all research questions and findings benefit from the availability of big data (e.g. some discoveries are serendipitous). Here we argue that the current ML/AI toolkit excels in scenarios (big data) that do not entirely map onto daily practice (low and sparse data, often out of training set distribution). Thus, the scientific potential of ML/AI is not fully realized when state-of-the-art tools are implemented and primed for big data and highly charted search spaces. The disconnect between what is feasible and generally needed is apparent and is impacting our ability to advance the chemical sciences at a faster pace. In LowDataML we propose a suite of intersectional and transdisciplinary research projects that will create a new breed of scientists. Through an extensive training program, we will deliver 10 PhDs to the European R&D ecosystem, who are experts in low data ML with domain knowledge in synthetic organic chemistry and drug discovery. Further, the projects seamlessly integrate together and focus on: i) real world needs; ii) sustainability by minimising cost, time and materials consumption, with special focus on ecofriendly research practices.

KeywordsMaschinelles Lernen; KI; Wirkstoffe; Drug Design
Website of the projecthttps://cordis.europa.eu/project/id/101226058
Funding identifier101226058
Funder / funding scheme
  • EC Horizon Europe - Marie Skłodowska-Curie Actions - Doctoral Network (MSCA DN)

Project management at the University of Münster

Glorius, Frank
Professur für Organische Chemie (Prof. Glorius)

Applicants from the University of Münster

Glorius, Frank
Professur für Organische Chemie (Prof. Glorius)

Project partners outside the University of Münster

  • Ben-Gurion University of the Negev (BGU)Israel
  • Astrazeneca AbSweden
  • Spanish National Research Council (CSIC)Spain
  • Complutense University of Madrid (UCM)Spain
  • Swiss Federal Institute of Technology in Lausanne (EPFL)Switzerland
  • Eindhoven University of Technology (TUe)Netherlands (Kingdom of the)
  • University of CambridgeUnited Kingdom
  • Freie Universität Berlin (FU Berlin)Germany
  • University of TorontoCanada
  • University of Lisbon (ULisboa)Portugal
  • IBM Research GmbHSwitzerland
  • Gulbenkian Institute for Molecular Medicine (GIMM)Portugal
  • ARQUIMEA Research CenterSpain
  • Biomedical Research Foundation of the Academy of Athens (BRFAA)Greece
  • EU-OPENSCREENGermany

Coordinating organisations outside the University of Münster

  • Faculty of Pharmacy Research and Development Association (FARM-ID)Portugal