» Articles » PMID: 32968333

Sparse Linear Discriminant Analysis Using the Prior-Knowledge-Guided Block Covariance Matrix

Overview
Publisher Elsevier
Specialty Chemistry
Date 2020 Sep 24
PMID 32968333
Authors
Affiliations
Soon will be listed here.
Abstract

There are two key challenges when using a linear discriminant analysis in the high-dimensional setting, including singularity of the covariance matrix and difficulty of interpreting the resulting classifier. Although several methods have been proposed to address these problems, they focused only on identifying a parsimonious set of variables maximizing classification accuracy. However, most methods did not consider dependency between variables and efficacy of selected variables appropriately. To address these limitations, here we propose a new approach that directly estimates the sparse discriminant vector without a need of estimating the whole inverse covariance matrix, by formulating a quadratic optimization problem. Furthermore, this approach also allows to integrate external information to guide the structure of covariance matrix. We evaluated the proposed model with simulation studies. We then applied it to the transcriptomic study that aims to identify genomic markers predictive of the response to cancer immunotherapy, where the covariance matrix was constructed based on the prior knowledge available in the pathway database.

References
1.
Pang H, Tong T, Ng M . Block-diagonal discriminant analysis and its bias-corrected rules. Stat Appl Genet Mol Biol. 2013; 12(3):347-59. PMC: 3856189. DOI: 10.1515/sagmb-2012-0017. View

2.
Witten D, Tibshirani R . Penalized classification using Fisher's linear discriminant. J R Stat Soc Series B Stat Methodol. 2012; 73(5):753-772. PMC: 3272679. DOI: 10.1111/j.1467-9868.2011.00783.x. View

3.
Bien J, Tibshirani R . Sparse estimation of a covariance matrix. Biometrika. 2012; 98(4):807-820. PMC: 3413177. DOI: 10.1093/biomet/asr054. View

4.
Ye J, Li T, Xiong T, Janardan R . Using uncorrelated discriminant analysis for tissue classification with gene expression data. IEEE/ACM Trans Comput Biol Bioinform. 2006; 1(4):181-90. DOI: 10.1109/TCBB.2004.45. View

5.
Tibshirani R, Hastie T, Narasimhan B, Chu G . Diagnosis of multiple cancer types by shrunken centroids of gene expression. Proc Natl Acad Sci U S A. 2002; 99(10):6567-72. PMC: 124443. DOI: 10.1073/pnas.082099299. View