» Articles » PMID: 33739448

Sparse Linear Discriminant Analysis for Multiview Structured Data

Overview
Journal Biometrics
Specialty Public Health
Date 2021 Mar 19
PMID 33739448
Citations 13
Authors
Affiliations
Soon will be listed here.
Abstract

Classification methods that leverage the strengths of data from multiple sources (multiview data) simultaneously have enormous potential to yield more powerful findings than two-step methods: association followed by classification. We propose two methods, sparse integrative discriminant analysis (SIDA), and SIDA with incorporation of network information (SIDANet), for joint association and classification studies. The methods consider the overall association between multiview data, and the separation within each view in choosing discriminant vectors that are associated and optimally separate subjects into different classes. SIDANet is among the first methods to incorporate prior structural information in joint association and classification studies. It uses the normalized Laplacian of a graph to smooth coefficients of predictor variables, thus encouraging selection of predictors that are connected. We demonstrate the effectiveness of our methods on a set of synthetic datasets and explore their use in identifying potential nontraditional risk factors that discriminate healthy patients at low versus high risk for developing atherosclerosis cardiovascular disease in 10 years. Our findings underscore the benefit of joint association and classification methods if the goal is to correlate multiview data and to perform classification.

Citing Articles

Scalable randomized kernel methods for multiview data integration and prediction with application to Coronavirus disease.

Safo S, Lu H Biostatistics. 2025; 26(1).

PMID: 39973130 PMC: 11839864. DOI: 10.1093/biostatistics/kxaf001.


NMFProfiler: a multi-omics integration method for samples stratified in groups.

Mercadie A, Gravier E, Josse G, Fournier I, Viode C, Vialaneix N Bioinformatics. 2025; 41(2).

PMID: 39921890 PMC: 11855281. DOI: 10.1093/bioinformatics/btaf066.


HIP: a method for high-dimensional multi-view data integration and prediction accounting for subgroup heterogeneity.

Butts J, Verace L, Wendt C, Bowler R, Hersh C, Long Q Brief Bioinform. 2024; 25(6).

PMID: 39344710 PMC: 11440091. DOI: 10.1093/bib/bbae470.


Molecular Evaluation of the mRNA Expression of the , , , and Genes Linked to Fluconazole Resistance in in a Colombian Population.

Cardenas Parra L, Rojas Rodriguez A, Perez Cardenas J, Perez-Agudelo J J Fungi (Basel). 2024; 10(7).

PMID: 39057394 PMC: 11277825. DOI: 10.3390/jof10070509.


Deep IDA: a deep learning approach for integrative discriminant analysis of multi-omics data with feature ranking-an application to COVID-19.

Wang J, Safo S Bioinform Adv. 2024; 4(1):vbae060.

PMID: 39027641 PMC: 11256945. DOI: 10.1093/bioadv/vbae060.


References
1.
Borodzicz S, Czarzasta K, Kuch M, Cudnoch-Jedrzejewska A . Sphingolipids in cardiovascular diseases and metabolic disorders. Lipids Health Dis. 2015; 14:55. PMC: 4470334. DOI: 10.1186/s12944-015-0053-y. View

2.
Luo C, Liu J, Dey D, Chen K . Canonical variate regression. Biostatistics. 2016; 17(3):468-83. PMC: 5006412. DOI: 10.1093/biostatistics/kxw001. View

3.
Safo S, Li S, Long Q . Integrative analysis of transcriptomic and metabolomic data via sparse canonical correlation analysis with incorporation of biological information. Biometrics. 2017; 74(1):300-312. PMC: 5677597. DOI: 10.1111/biom.12715. View

4.
Peri S, Navarro J, Amanchy R, Kristiansen T, Jonnalagadda C, Surendranath V . Development of human protein reference database as an initial platform for approaching systems biology in humans. Genome Res. 2003; 13(10):2363-71. PMC: 403728. DOI: 10.1101/gr.1680803. View

5.
Zhang Y, Gaynanova I . Joint association and classification analysis of multi-view data. Biometrics. 2021; 78(4):1614-1625. DOI: 10.1111/biom.13536. View