Searching for Target-selective Compounds Using Different Combinations of Multiclass Support Vector Machine Ranking Methods, Kernel Functions, and Fingerprint Descriptors
Overview
Medical Informatics
Affiliations
The identification of small chemical compounds that are selective for a target protein over one or more closely related members of the same family is of high relevance for applications in chemical biology. Conventional 2D similarity searching using known selective molecules as templates has recently been found to preferentially detect selective over non-selective and inactive database compounds. To improve the initially observed search performance, we have attempted to use 2D fingerprints as descriptors for support vector machine (SVM)-based selectivity searching. Different from typically applied binary SVM compound classification, SVM analysis has been adapted here for multiclass predictions and compound ranking to distinguish between selective, active but non-selective, and inactive compounds. In systematic database search calculations, we tested combinations of four alternative SVM ranking schemes, four different kernel functions, and four fingerprints and were able to further improve selectivity search performance by effectively removing non-selective molecules from high ranking positions while retaining high recall of selective compounds.
Pirzada R, Javaid N, Choi S Genes (Basel). 2020; 11(2).
PMID: 32012695 PMC: 7074480. DOI: 10.3390/genes11020131.
Zou B, Lee V, Yan H BMC Bioinformatics. 2018; 19(1):88.
PMID: 29514601 PMC: 5842518. DOI: 10.1186/s12859-018-2093-6.
Quantum probability ranking principle for ligand-based virtual screening.
Al-Dabbagh M, Salim N, Himmat M, Ahmed A, Saeed F J Comput Aided Mol Des. 2017; 31(4):365-378.
PMID: 28220440 DOI: 10.1007/s10822-016-0003-4.
Bioactive Molecule Prediction Using Extreme Gradient Boosting.
Babajide Mustapha I, Saeed F Molecules. 2016; 21(8).
PMID: 27483216 PMC: 6273295. DOI: 10.3390/molecules21080983.
Kurczab R, Canale V, Zajdel P, Bojarski A PLoS One. 2016; 11(6):e0156986.
PMID: 27271158 PMC: 4896471. DOI: 10.1371/journal.pone.0156986.