» Articles » PMID: 38404905

Numerically Stable Locality-preserving Partial Least Squares Discriminant Analysis for Efficient Dimensionality Reduction and Classification of High-dimensional Data

Overview
Journal Heliyon
Specialty Social Sciences
Date 2024 Feb 26
PMID 38404905
Authors
Affiliations
Soon will be listed here.
Abstract

Dimensionality reduction plays a pivotal role in preparing high-dimensional data for classification and discrimination tasks by eliminating redundant features and enhancing the efficiency of classifiers. The effectiveness of a dimensionality reduction algorithm hinges on its numerical stability. When data projections are numerically stable, they lead to enhanced class separability in the lower-dimensional embedding, consequently yielding higher classification accuracy. This paper investigates the numerical attributes of dimensionality reduction and discriminant subspace learning, with a specific focus on Locality-Preserving Partial Least Squares Discriminant Analysis (LPPLS-DA). High-dimensional data frequently introduce singularity in the scatter matrices, posing a significant challenge. To tackle this issue, the paper explores two robust implementations of LPPLS-DA. These approaches not only optimize data projections but also capture more discriminative features, resulting in a marked improvement in classification accuracy. Empirical evidence supports these findings through numerical experiments conducted on synthetic and spectral datasets. The results demonstrate the superior performance of the proposed methods when compared to several state-of-the-art dimensionality reduction techniques in terms of both classification accuracy and dimension reduction.

References
1.
Li H, Jiang T, Zhang K . Efficient and robust feature extraction by maximum margin criterion. IEEE Trans Neural Netw. 2006; 17(1):157-65. DOI: 10.1109/TNN.2005.860852. View

2.
Tapp H, Defernez M, Kemsley E . FTIR spectroscopy and multivariate analysis can distinguish the geographic origin of extra virgin olive oils. J Agric Food Chem. 2003; 51(21):6110-5. DOI: 10.1021/jf030232s. View

3.
Song W, Wang H, Maguire P, Nibouche O . Nearest clusters based partial least squares discriminant analysis for the classification of spectral data. Anal Chim Acta. 2018; 1009:27-38. DOI: 10.1016/j.aca.2018.01.023. View

4.
Nguyen D, Rocke D . Tumor classification by partial least squares using microarray gene expression data. Bioinformatics. 2002; 18(1):39-50. DOI: 10.1093/bioinformatics/18.1.39. View

5.
Kuligowski J, Quintas G, Herwig C, Lendl B . A rapid method for the differentiation of yeast cells grown under carbon and nitrogen-limited conditions by means of partial least squares discriminant analysis employing infrared micro-spectroscopic data of entire yeast cells. Talanta. 2012; 99:566-73. PMC: 3460240. DOI: 10.1016/j.talanta.2012.06.036. View