» Articles » PMID: 15531609

Classification Using Partial Least Squares with Penalized Logistic Regression

Overview
Journal Bioinformatics
Specialty Biology
Date 2004 Nov 9
PMID 15531609
Citations 31
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: One important aspect of data-mining of microarray data is to discover the molecular variation among cancers. In microarray studies, the number n of samples is relatively small compared to the number p of genes per sample (usually in thousands). It is known that standard statistical methods in classification are efficient (i.e. in the present case, yield successful classifiers) particularly when n is (far) larger than p. This naturally calls for the use of a dimension reduction procedure together with the classification one.

Results: In this paper, the question of classification in such a high-dimensional setting is addressed. We view the classification problem as a regression one with few observations and many predictor variables. We propose a new method combining partial least squares (PLS) and Ridge penalized logistic regression. We review the existing methods based on PLS and/or penalized likelihood techniques, outline their interest in some cases and theoretically explain their sometimes poor behavior. Our procedure is compared with these other classifiers. The predictive performance of the resulting classification rule is illustrated on three data sets: Leukemia, Colon and Prostate.

Citing Articles

Electroencephalography functional connectivity-A biomarker for painful polyneuropathy.

Topaz L, Frid A, Granovsky Y, Zubidat R, Crystal S, Buxbaum C Eur J Neurol. 2022; 30(1):204-214.

PMID: 36148823 PMC: 10092565. DOI: 10.1111/ene.15575.


Hide and seek shark teeth in Random Forests: machine learning applied to populations.

Berio F, Bayle Y, Baum D, Goudemand N, Debiais-Thibaud M PeerJ. 2022; 10:e13575.

PMID: 35811817 PMC: 9261926. DOI: 10.7717/peerj.13575.


Genome-Wide Association Study Statistical Models: A Review.

Yoosefzadeh-Najafabadi M, Eskandari M, Belzile F, Torkamaneh D Methods Mol Biol. 2022; 2481:43-62.

PMID: 35641758 DOI: 10.1007/978-1-0716-2237-7_4.


Multivariate Time Series Analysis of Temperatures in the Archaeological Museum of L'Almoina (Valencia, Spain).

Ramirez S, Zarzo M, Garcia-Diego F Sensors (Basel). 2021; 21(13).

PMID: 34206737 PMC: 8271729. DOI: 10.3390/s21134377.


Decreased effective connection from the parahippocampal gyrus to the prefrontal cortex in Internet gaming disorder: A MVPA and spDCM study.

Wang Z, Dong H, Du X, Zhang J, Dong G J Behav Addict. 2020; 9(1):105-115.

PMID: 32359234 PMC: 8935187. DOI: 10.1556/2006.2020.00012.