» Articles » PMID: 20704390

Visualization and Recovery of the (bio)chemical Interesting Variables in Data Analysis with Support Vector Machine Classification

Overview
Journal Anal Chem
Specialty Chemistry
Date 2010 Aug 14
PMID 20704390
Citations 11
Authors
Affiliations
Soon will be listed here.
Abstract

Support vector machines (SVMs) have become a popular technique in the chemometrics and bioinformatics field, and other fields, for the classification of complex data sets. Especially because SVMs are able to model nonlinear relationships, the usage of this technique has increased substantially. This modeling is obtained by mapping the data in a higher-dimensional feature space. The disadvantage of such a transformation is, however, that information about the contribution of the original variables in the classification is lost. In this paper we introduce an innovative method which can retrieve the information about the variables of complex data sets. We apply the proposed method to several benchmark data sets and a metabolomics data set to illustrate that we can determine the contribution of the original variables in SVM classifications. The corresponding visualization of the contribution of the variables can assist in a better understanding of the underlying chemical or biological process.

Citing Articles

Identification of Drug-Induced Liver Injury Biomarkers from Multiple Microarrays Based on Machine Learning and Bioinformatics Analysis.

Wang K, Zhang L, Li L, Wang Y, Zhong X, Hou C Int J Mol Sci. 2022; 23(19).

PMID: 36233241 PMC: 9570393. DOI: 10.3390/ijms231911945.


The exposome paradigm to predict environmental health in terms of systemic homeostasis and resource balance based on NMR data science.

Kikuchi J, Yamada S RSC Adv. 2022; 11(48):30426-30447.

PMID: 35480260 PMC: 9041152. DOI: 10.1039/d1ra03008f.


Toward a hemorrhagic trauma severity score: fusing five physiological biomarkers.

Bhat A, Podstawczyk D, Walther B, Aggas J, Machado-Aranda D, Ward K J Transl Med. 2020; 18(1):348.

PMID: 32928219 PMC: 7490913. DOI: 10.1186/s12967-020-02516-4.


Quantitative TLC-SERS detection of histamine in seafood with support vector machine analysis.

Tan A, Zhao Y, Sivashanmugan K, Squire K, Wang A Food Control. 2019; 103:111-118.

PMID: 31827314 PMC: 6905648. DOI: 10.1016/j.foodcont.2019.03.032.


SVM-RFE: selection and visualization of the most relevant features through non-linear kernels.

Sanz H, Valim C, Vegas E, Oller J, Reverter F BMC Bioinformatics. 2018; 19(1):432.

PMID: 30453885 PMC: 6245920. DOI: 10.1186/s12859-018-2451-4.