HIt Discovery Using Docking ENriched by GEnerative Modeling (HIDDEN GEM): A Novel Computational Workflow for Accelerated Virtual Screening of Ultra-large Chemical Libraries
Overview
Biology
Chemistry
Molecular Biology
Affiliations
Recent rapid expansion of make-on-demand, purchasable, chemical libraries comprising dozens of billions or even trillions of molecules has challenged the efficient application of traditional structure-based virtual screening methods that rely on molecular docking. We present a novel computational methodology termed HIDDEN GEM (HIt Discovery using Docking ENriched by GEnerative Modeling) that greatly accelerates virtual screening. This workflow uniquely integrates machine learning, generative chemistry, massive chemical similarity searching and molecular docking of small, selected libraries in the beginning and the end of the workflow. For each target, HIDDEN GEM nominates a small number of top-scoring virtual hits prioritized from ultra-large chemical libraries. We have benchmarked HIDDEN GEM by conducting virtual screening campaigns for 16 diverse protein targets using Enamine REAL Space library comprising 37 billion molecules. We show that HIDDEN GEM yields the highest enrichment factors as compared to state of the art accelerated virtual screening methods, while requiring the least computational resources. HIDDEN GEM can be executed with any docking software and employed by users with limited computational resources.
Alternative weighting schemes for fine-tuned extended similarity indices.
Lopez Perez K, Racz A, Bajusz D, Gonzalez C, Heberger K, Alain Miranda-Quintana R J Chemom. 2024; 38(9).
PMID: 39640020 PMC: 11619927. DOI: 10.1002/cem.3558.
Mihalovits L, Szalai T, Bajusz D, Keseru G J Chem Inf Model. 2024; 64(23):8963-8979.
PMID: 39305268 PMC: 11632764. DOI: 10.1021/acs.jcim.4c00803.
Song R, Nicklaus M, Tarasova N J Comput Aided Mol Des. 2024; 38(1):22.
PMID: 38753096 PMC: 11098933. DOI: 10.1007/s10822-024-00562-4.