» Articles » PMID: 26703255

Variable Selection in Multivariate Calibration Based on Clustering of Variable Concept

Overview
Journal Anal Chim Acta
Publisher Elsevier
Specialty Chemistry
Date 2015 Dec 26
PMID 26703255
Citations 1
Authors
Affiliations
Soon will be listed here.
Abstract

Recently we have proposed a new variable selection algorithm, based on clustering of variable concept (CLoVA) in classification problem. With the same idea, this new concept has been applied to a regression problem and then the obtained results have been compared with conventional variable selection strategies for PLS. The basic idea behind the clustering of variable is that, the instrument channels are clustered into different clusters via clustering algorithms. Then, the spectral data of each cluster are subjected to PLS regression. Different real data sets (Cargill corn, Biscuit dough, ACE QSAR, Soy, and Tablet) have been used to evaluate the influence of the clustering of variables on the prediction performances of PLS. Almost in the all cases, the statistical parameter especially in prediction error shows the superiority of CLoVA-PLS respect to other variable selection strategies. Finally the synergy clustering of variable (sCLoVA-PLS), which is used the combination of cluster, has been proposed as an efficient and modification of CLoVA algorithm. The obtained statistical parameter indicates that variable clustering can split useful part from redundant ones, and then based on informative cluster; stable model can be reached.

Citing Articles

Multiblock variable influence on orthogonal projections (MB-VIOP) for enhanced interpretation of total, global, local and unique variations in OnPLS models.

Galindo-Prieto B, Geladi P, Trygg J BMC Bioinformatics. 2021; 22(1):176.

PMID: 33812384 PMC: 8019512. DOI: 10.1186/s12859-021-04015-9.