Benchmarking AlphaFold-enabled Molecular Docking Predictions for Antibiotic Discovery
Overview
Authors
Affiliations
Efficient identification of drug mechanisms of action remains a challenge. Computational docking approaches have been widely used to predict drug binding targets; yet, such approaches depend on existing protein structures, and accurate structural predictions have only recently become available from AlphaFold2. Here, we combine AlphaFold2 with molecular docking simulations to predict protein-ligand interactions between 296 proteins spanning Escherichia coli's essential proteome, and 218 active antibacterial compounds and 100 inactive compounds, respectively, pointing to widespread compound and protein promiscuity. We benchmark model performance by measuring enzymatic activity for 12 essential proteins treated with each antibacterial compound. We confirm extensive promiscuity, but find that the average area under the receiver operating characteristic curve (auROC) is 0.48, indicating weak model performance. We demonstrate that rescoring of docking poses using machine learning-based approaches improves model performance, resulting in average auROCs as large as 0.63, and that ensembles of rescoring functions improve prediction accuracy and the ratio of true-positive rate to false-positive rate. This work indicates that advances in modeling protein-ligand interactions, particularly using machine learning-based approaches, are needed to better harness AlphaFold2 for drug discovery.
Sun T, Hao Z, Meng F, Li X, Wang Y, Zhu H Molecules. 2025; 30(5).
PMID: 40076396 PMC: 11901460. DOI: 10.3390/molecules30051173.
Predicting Antimicrobial Class Specificity of Small Molecules Using Machine Learning.
Gadiya Y, Genilloud O, Bilitewski U, Bronstrup M, von Berlin L, Attwood M J Chem Inf Model. 2025; 65(5):2416-2431.
PMID: 39987507 PMC: 11898080. DOI: 10.1021/acs.jcim.4c02347.
Alshammari M, He J, Wriggers W Bioinform Adv. 2025; 5(1):vbae181.
PMID: 39897947 PMC: 11783307. DOI: 10.1093/bioadv/vbae181.
Robustly interrogating machine learning-based scoring functions: what are they learning?.
Durant G, Boyles F, Birchall K, Marsden B, Deane C Bioinformatics. 2025; 41(2).
PMID: 39874452 PMC: 11821266. DOI: 10.1093/bioinformatics/btaf040.
Wang J, Zhang R, Zhao X, Zhang J, Tong Y, Abbas Z Int J Mol Sci. 2025; 26(2.
PMID: 39859222 PMC: 11764585. DOI: 10.3390/ijms26020505.