Abstract
Machine learning (ML) and artificial intelligence (AI) techniques are transforming the way chemical reactions are studied today. Datasets from high-throughput experimentation (HTE) are generated to better understand the reaction conditions crucial for outcomes such as yields and selectivities. However, it is often overlooked that datasets from such designed experiments possess a specific structure, which can be captured by a statistical model. Ignoring these data structures when applying ML/AI algorithms can result in misleading conclusions. In contrast, leveraging knowledge about the data-generating process yields reliable, interpretable, and comprehensive insights into reaction mechanisms. A particularly complex dataset is available for the Buchwald-Hartwig amination. Using this dataset, a statistical model for such HTE-generated chemical data is introduced, and a parameter estimation algorithm is developed. Based on the estimated model, new insights into the Buchwald-Hartwig amination are discussed. Our approach is applicable to a wide range of HTE-generated data for chemical reactions and beyond.
| Original language | English |
|---|---|
| Article number | 61 |
| Number of pages | 18 |
| Journal | Communications Chemistry |
| Volume | 9 |
| Issue number | 1 |
| Early online date | 3 Jan 2026 |
| DOIs | |
| Publication status | Published - 2 Feb 2026 |
Austrian Fields of Science 2012
- 101018 Statistics
- 104027 Computational chemistry
Keywords
- ISOR
- Cat.2
Fingerprint
Dive into the research topics of 'Modelling and estimation of chemical reaction yields from high-throughput experiments'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver