TY - JOUR
T1 - Chemical space exploration with Molpher
T2 - Generating and assessing a glucocorticoid receptor ligand library
AU - Agea, M Isabel
AU - Čmelo, Ivan
AU - Dehaen, Wim
AU - Chen, Ya
AU - Kirchmair, Johannes
AU - Sedlák, David
AU - Bartůněk, Petr
AU - Šícho, Martin
AU - Svozil, Daniel
N1 - © 2024 The Authors. Molecular Informatics published by Wiley-VCH GmbH.
PY - 2024/8/12
Y1 - 2024/8/12
N2 - Computational exploration of chemical space is crucial in modern cheminformatics research for accelerating the discovery of new biologically active compounds. In this study, we present a detailed analysis of the chemical library of potential glucocorticoid receptor (GR) ligands generated by the molecular generator, Molpher. To generate the targeted GR library and construct the classification models, structures from the ChEMBL database as well as from the internal IMG library, which was experimentally screened for biological activity in the primary luciferase reporter cell assay, were utilized. The composition of the targeted GR ligand library was compared with a reference library that randomly samples chemical space. A random forest model was used to determine the biological activity of ligands, incorporating its applicability domain using conformal prediction. It was demonstrated that the GR library is significantly enriched with GR ligands compared to the random library. Furthermore, a prospective analysis demonstrated that Molpher successfully designed compounds, which were subsequently experimentally confirmed to be active on the GR. A collection of 34 potential new GR ligands was also identified. Moreover, an important contribution of this study is the establishment of a comprehensive workflow for evaluating computationally generated ligands, particularly those with potential activity against targets that are challenging to dock.
AB - Computational exploration of chemical space is crucial in modern cheminformatics research for accelerating the discovery of new biologically active compounds. In this study, we present a detailed analysis of the chemical library of potential glucocorticoid receptor (GR) ligands generated by the molecular generator, Molpher. To generate the targeted GR library and construct the classification models, structures from the ChEMBL database as well as from the internal IMG library, which was experimentally screened for biological activity in the primary luciferase reporter cell assay, were utilized. The composition of the targeted GR ligand library was compared with a reference library that randomly samples chemical space. A random forest model was used to determine the biological activity of ligands, incorporating its applicability domain using conformal prediction. It was demonstrated that the GR library is significantly enriched with GR ligands compared to the random library. Furthermore, a prospective analysis demonstrated that Molpher successfully designed compounds, which were subsequently experimentally confirmed to be active on the GR. A collection of 34 potential new GR ligands was also identified. Moreover, an important contribution of this study is the establishment of a comprehensive workflow for evaluating computationally generated ligands, particularly those with potential activity against targets that are challenging to dock.
KW - chemical space
KW - de novo design
KW - glucocorticoid receptor
KW - molecular generation
UR - http://www.scopus.com/inward/record.url?scp=85197786625&partnerID=8YFLogxK
U2 - 10.1002/minf.202300316
DO - 10.1002/minf.202300316
M3 - Article
C2 - 38979783
VL - 43
JO - Molecular Informatics
JF - Molecular Informatics
SN - 1868-1743
IS - 8
M1 - e202300316
ER -