» Articles » PMID: 12870906

Design and Evaluation of a Molecular Fingerprint Involving the Transformation of Property Descriptor Values into a Binary Classification Scheme

Overview
Date 2003 Jul 23
PMID 12870906
Citations 18
Authors
Affiliations
Soon will be listed here.
Abstract

A new fingerprint design concept is introduced that transforms molecular property descriptors into two-state descriptors and thus permits binary encoding. This transformation is based on the calculation of statistical medians of descriptor distributions in large compound collections and alleviates the need for value range encoding of these descriptors. For binary encoded property descriptors, bit positions that are set off capture as much information as bit positions that are set on, different from conventional fingerprint representations. Accordingly, a variant of the Tanimoto coefficient has been defined for comparison of these fingerprints. Following our design idea, a prototypic fingerprint termed MP-MFP was implemented by combining 61 binary encoded property descriptors with 110 structural fragment-type descriptors. The performance of this fingerprint was evaluated in systematic similarity search calculations in a database containing 549 molecules belonging to 38 different activity classes and 5000 background molecules. In these calculations, MP-MFP correctly recognized approximately 34% of all similarity relationships, with only 0.04% false positives, and performed better than previous designs and MACCS keys. The results suggest that combinations of simplified two-state property descriptors have predictive value in the analysis of molecular similarity.

Citing Articles

Non-animal models for blood-brain barrier permeability evaluation of drug-like compounds.

Dehnbostel F, Dixit V, Preissner R, Banerjee P Sci Rep. 2024; 14(1):8908.

PMID: 38632344 PMC: 11024088. DOI: 10.1038/s41598-024-59734-9.


Optimal high-throughput virtual screening pipeline for efficient selection of redox-active organic materials.

Woo H, Allam O, Chen J, Jang S, Yoon B iScience. 2022; 26(1):105735.

PMID: 36582827 PMC: 9793274. DOI: 10.1016/j.isci.2022.105735.


Introducing a Chemically Intuitive Core-Substituent Fingerprint Designed to Explore Structural Requirements for Effective Similarity Searching and Machine Learning.

Janela T, Takeuchi K, Bajorath J Molecules. 2022; 27(7).

PMID: 35408730 PMC: 9000322. DOI: 10.3390/molecules27072331.


S2DV: converting SMILES to a drug vector for predicting the activity of anti-HBV small molecules.

Shao J, Gong Q, Yin Z, Pan W, Pandiyan S, Wang L Brief Bioinform. 2022; 23(2).

PMID: 35062019 PMC: 8921627. DOI: 10.1093/bib/bbab593.


Profiling and analysis of chemical compounds using pointwise mutual information.

cmelo I, Vorsilak M, Svozil D J Cheminform. 2021; 13(1):3.

PMID: 33423694 PMC: 7798221. DOI: 10.1186/s13321-020-00483-y.