» Articles » PMID: 16452796

Discovering Compact and Highly Discriminative Features or Feature Combinations of Drug Activities Using Support Vector Machines

Overview
Specialty Biology
Date 2006 Feb 3
PMID 16452796
Citations 5
Authors
Affiliations
Soon will be listed here.
Abstract

Nowadays, high throughput experimental techniques make it feasible to examine and collect massive data at the molecular level. These data, typically mapped to a very high dimensional feature space, carry rich information about functionalities of certain chemical or biological entities and can be used to infer valuable knowledge for the purposes of classification and prediction. Typically, a small number of features or feature combinations may play determinant roles in functional discrimination. The identification of such features or feature combinations is of great importance. In this paper, we study the problem of discovering compact and highly discriminative features or feature combinations from a rich feature collection. We employ the support vector machine as the classification means and aim at finding compact feature combinations. Comparing to previous methods on feature selection, which identify features solely based on their individual roles in the classification, our method is able to identify minimal feature combinations that ultimately have determinant roles in a systematic fashion. Experimental study on drug activity data shows that our method can discover descriptors that are not necessarily significant individually but are most significant collectively.

Citing Articles

NRPreTo: A Machine Learning-Based Nuclear Receptor and Subfamily Prediction Tool.

Madugula S, Pandey S, Amalapurapu S, Bozdag S ACS Omega. 2023; 8(23):20379-20388.

PMID: 37323377 PMC: 10268018. DOI: 10.1021/acsomega.3c00286.


DemQSAR: predicting human volume of distribution and clearance of drugs.

Demir-Kavuk O, Bentzien J, Muegge I, Knapp E J Comput Aided Mol Des. 2011; 25(12):1121-33.

PMID: 22101402 DOI: 10.1007/s10822-011-9496-z.


Prediction of functional class of proteins and peptides irrespective of sequence homology by support vector machines.

Tang Z, Lin H, Zhang H, Han L, Chen X, Chen Y Bioinform Biol Insights. 2010; 1:19-47.

PMID: 20066123 PMC: 2789692. DOI: 10.4137/bbi.s315.


Efficacy of different protein descriptors in predicting protein functional families.

Ong S, Lin H, Chen Y, Rong Li Z, Cao Z BMC Bioinformatics. 2007; 8:300.

PMID: 17705863 PMC: 1997217. DOI: 10.1186/1471-2105-8-300.


An in silico approach for screening flavonoids as P-glycoprotein inhibitors based on a Bayesian-regularized neural network.

Wang Y, Li Y, Yang S, Yang L J Comput Aided Mol Des. 2005; 19(3):137-47.

PMID: 16059668 DOI: 10.1007/s10822-005-3321-5.