» Articles » PMID: 19434821

Maximum Unbiased Validation (MUV) Data Sets for Virtual Screening Based on PubChem Bioactivity Data

Overview
Date 2009 May 13
PMID 19434821
Citations 113
Authors
Affiliations
Soon will be listed here.
Abstract

Refined nearest neighbor analysis was recently introduced for the analysis of virtual screening benchmark data sets. It constitutes a technique from the field of spatial statistics and provides a mathematical framework for the nonparametric analysis of mapped point patterns. Here, refined nearest neighbor analysis is used to design benchmark data sets for virtual screening based on PubChem bioactivity data. A workflow is devised that purges data sets of compounds active against pharmaceutically relevant targets from unselective hits. Topological optimization using experimental design strategies monitored by refined nearest neighbor analysis functions is applied to generate corresponding data sets of actives and decoys that are unbiased with regard to analogue bias and artificial enrichment. These data sets provide a tool for Maximum Unbiased Validation (MUV) of virtual screening methods. The data sets and a software package implementing the MUV design workflow are freely available at http://www.pharmchem.tu-bs.de/lehre/baumann/MUV.html.

Citing Articles

Integrating convolutional layers and biformer network with forward-forward and backpropagation training.

Kianfar A, Razzaghi P, Asgari Z Sci Rep. 2025; 15(1):7230.

PMID: 40021838 PMC: 11871031. DOI: 10.1038/s41598-025-92218-y.


A Practical Guide to Computational Tools for Engineering Biocatalytic Properties.

Vega A, Planas A, Biarnes X Int J Mol Sci. 2025; 26(3).

PMID: 39940748 PMC: 11817184. DOI: 10.3390/ijms26030980.


From High Dimensions to Human Insight: Exploring Dimensionality Reduction for Chemical Space Visualization.

Orlov A, Akhmetshin T, Horvath D, Marcou G, Varnek A Mol Inform. 2024; 44(1):e202400265.

PMID: 39633514 PMC: 11733715. DOI: 10.1002/minf.202400265.


Do Molecular Fingerprints Identify Diverse Active Drugs in Large-Scale Virtual Screening? (No).

Venkatraman V, Gaiser J, Demekas D, Roy A, Xiong R, Wheeler T Pharmaceuticals (Basel). 2024; 17(8).

PMID: 39204097 PMC: 11356940. DOI: 10.3390/ph17080992.


A review of deep learning methods for ligand based drug virtual screening.

Wu H, Liu J, Zhang R, Lu Y, Cui G, Cui Z Fundam Res. 2024; 4(4):715-737.

PMID: 39156568 PMC: 11330120. DOI: 10.1016/j.fmre.2024.02.011.