Predicting Locations of Cryptic Pockets from Single Protein Structures Using the PocketMiner Graph Neural Network
Overview
Authors
Affiliations
Cryptic pockets expand the scope of drug discovery by enabling targeting of proteins currently considered undruggable because they lack pockets in their ground state structures. However, identifying cryptic pockets is labor-intensive and slow. The ability to accurately and rapidly predict if and where cryptic pockets are likely to form from a structure would greatly accelerate the search for druggable pockets. Here, we present PocketMiner, a graph neural network trained to predict where pockets are likely to open in molecular dynamics simulations. Applying PocketMiner to single structures from a newly curated dataset of 39 experimentally confirmed cryptic pockets demonstrates that it accurately identifies cryptic pockets (ROC-AUC: 0.87) >1,000-fold faster than existing methods. We apply PocketMiner across the human proteome and show that predicted pockets open in simulations, suggesting that over half of proteins thought to lack pockets based on available structures likely contain cryptic pockets, vastly expanding the potentially druggable proteome.
Revisiting the druggable genome using predicted structures and data mining.
Godinez-Macias K, Chen D, Wallis J, Siegel M, Adam A, Bopp S NPJ Drug Discov. 2025; 2(1):3.
PMID: 40066064 PMC: 11892419. DOI: 10.1038/s44386-025-00006-5.
Zhu R, Wu C, Zha J, Lu S, Zhang J RSC Chem Biol. 2025; .
PMID: 39981029 PMC: 11836628. DOI: 10.1039/d4cb00282b.
Scaling Structure Aware Virtual Screening to Billions of Molecules with SPRINT.
McNutt A, Adduri A, Ellington C, Dayao M, Xing E, Mohimani H ArXiv. 2025; .
PMID: 39975427 PMC: 11838698.
Dunnett L, Das S, Venditti V, Prischi F bioRxiv. 2025; .
PMID: 39763864 PMC: 11702612. DOI: 10.1101/2024.12.17.628909.
CryptoBench: cryptic protein-ligand binding sites dataset and benchmark.
Skrhak V, Novotny M, Feidakis C, Krivak R, Hoksza D Bioinformatics. 2024; 41(1).
PMID: 39693053 PMC: 11725321. DOI: 10.1093/bioinformatics/btae745.