» Articles » PMID: 39283165

Combined Physics- and Machine-Learning-Based Method to Identify Druggable Binding Sites Using SILCS-Hotspots

Overview
Date 2024 Sep 16
PMID 39283165
Authors
Affiliations
Soon will be listed here.
Abstract

Identifying druggable binding sites on proteins is an important and challenging problem, particularly for cryptic, allosteric binding sites that may not be obvious from X-ray, cryo-EM, or predicted structures. The Site-Identification by Ligand Competitive Saturation (SILCS) method accounts for the flexibility of the target protein using all-atom molecular simulations that include various small molecule solutes in aqueous solution. During the simulations, the combination of protein flexibility and comprehensive sampling of the water and solute spatial distributions can identify buried binding pockets absent in experimentally determined structures. Previously, we reported a method for leveraging the information in the SILCS sampling to identify binding sites (termed Hotspots) of small mono- or bicyclic compounds, a subset of which coincide with known binding sites of drug-like molecules. Here, we build on that physics-based approach and present a ML model for ranking the Hotspots according to the likelihood they can accommodate drug-like molecules (e.g., molecular weight >200 Da). In the independent validation set, which includes various enzymes and receptors, our model recalls 67% and 89% of experimentally validated ligand binding sites in the top 10 and 20 ranked Hotspots, respectively. Furthermore, we show that the model's output Decision Function is a useful metric to predict binding sites and their potential druggability in new targets. Given the utility the SILCS method for ligand discovery and optimization, the tools presented represent an important advancement in the identification of orthosteric and allosteric binding sites and the discovery of drug-like molecules targeting those sites.

Citing Articles

Exploring Druggable Binding Sites on the Class A GPCRs Using the Residue Interaction Network and Site Identification by Ligand Competitive Saturation.

Inan T, Yuce M, MacKerell Jr A, Kurkcuoglu O ACS Omega. 2024; 9(38):40154-40171.

PMID: 39346853 PMC: 11425613. DOI: 10.1021/acsomega.4c06172.

References
1.
Stepniewska-Dziubinska M, Zielenkiewicz P, Siedlecki P . Improving detection of protein-ligand binding sites with 3D segmentation. Sci Rep. 2020; 10(1):5035. PMC: 7081267. DOI: 10.1038/s41598-020-61860-z. View

2.
Han Y, Belley M, Bayly C, Colucci J, Dufresne C, Giroux A . Discovery of [(3-bromo-7-cyano-2-naphthyl)(difluoro)methyl]phosphonic acid, a potent and orally active small molecule PTP1B inhibitor. Bioorg Med Chem Lett. 2008; 18(11):3200-5. DOI: 10.1016/j.bmcl.2008.04.064. View

3.
Prakash P, Sayyed-Ahmad A, Gorfe A . pMD-Membrane: A Method for Ligand Binding Site Identification in Membrane-Bound Proteins. PLoS Comput Biol. 2015; 11(10):e1004469. PMC: 4623977. DOI: 10.1371/journal.pcbi.1004469. View

4.
Zhao M, Kognole A, Jo S, Tao A, Hazel A, MacKerell Jr A . GPU-specific algorithms for improved solute sampling in grand canonical Monte Carlo simulations. J Comput Chem. 2023; 44(20):1719-1732. PMC: 10330275. DOI: 10.1002/jcc.27121. View

5.
Raman E, Yu W, Guvench O, MacKerell A . Reproducing crystal binding modes of ligand functional groups using Site-Identification by Ligand Competitive Saturation (SILCS) simulations. J Chem Inf Model. 2011; 51(4):877-96. PMC: 3090225. DOI: 10.1021/ci100462t. View