Skip to main navigation Skip to search Skip to main content

Metabolite Identification Data in Drug Discovery, Part 2: Site-of-Metabolism Annotation, Analysis, and Exploration for Machine Learning

Publications: Contribution to journalArticlePeer Reviewed

Abstract

The ability to pinpoint and predict sites of metabolism (SoMs) is essential for designing and optimizing effective and safe bioactive small molecules. However, the number of molecules with annotated SoMs is limited, hindering the advancement of data-driven methods such as machine learning for metabolism prediction. Here, we provide a comprehensive characterization of SoM data obtained from the readouts of a human hepatocyte assay conducted at AstraZeneca Gothenburg. We explore a new strategy for SoM annotation that accounts for uncertainty in the experimental data, and we relate our findings to the most comprehensive SoM data collection available to date. Our study includes entropy analysis of SoM annotations, accompanied by representative examples that highlight the complexities of interpreting and working with metabolism data. Furthermore, we demonstrate the impact and value of the new metabolism data on SoM prediction. Importantly, a substantial portion of the data generated and analyzed as part of this work is made publicly available.

Original languageEnglish
Pages (from-to)6772-6787
Number of pages16
JournalMolecular Pharmaceutics
Volume22
Issue number11
DOIs
Publication statusPublished - 3 Nov 2025

Austrian Fields of Science 2012

  • 301207 Pharmaceutical chemistry
  • 104027 Computational chemistry
  • 102019 Machine learning

Keywords

  • Machine Learning
  • Drug Discovery/methods
  • Humans
  • Hepatocytes/metabolism
  • data analysis
  • data annotation
  • drug metabolism
  • data sets
  • xenobiotic metabolism
  • sites of metabolism (SoMs)

Fingerprint

Dive into the research topics of 'Metabolite Identification Data in Drug Discovery, Part 2: Site-of-Metabolism Annotation, Analysis, and Exploration for Machine Learning'. Together they form a unique fingerprint.

Cite this