» Articles » PMID: 40041743

Principal Variables Analysis for Non-Gaussian Data

Overview
Date 2025 Mar 5
PMID 40041743
Authors
Affiliations
Soon will be listed here.
Abstract

Principal variables analysis (PVA) is a technique for selecting a subset of variables that capture as much of the information in a dataset as possible. Existing approaches for PVA are based on the Pearson correlation matrix, which is not well-suited to describing the relationships between non-Gaussian variables. We propose a generalized approach to PVA enabling the use of different types of correlation, and we explore using Spearman, Gaussian copula, and polychoric correlations as alternatives to Pearson correlation. We compare performance in simulation studies varying the form of the true multivariate distribution over a range of possibilities. Our results show that on continuous non-Gaussian data, using generalized PVA with Gaussian copula or Spearman correlations provides a major improvement in performance compared to Pearson. On ordinal data, generalized PVA with polychoric correlations outperforms the rest by a wide margin. We apply generalized PVA to a dataset of 102 clinical variables measured on individuals with X-linked dystonia parkinsonism (XDP), a neurodegenerative disorder involving symptoms of both dystonia and parkinsonism. We find that using different types of correlation yields substantively different sets of principal variables; for example, parkinsonism-related metrics appear more explanatory than dystonia-related metrics on the observed data. Supplementary materials are available online.

References
1.
Aneichyk T, Hendriks W, Yadav R, Shin D, Gao D, Vaine C . Dissecting the Causal Mechanism of X-Linked Dystonia-Parkinsonism by Integrating Genome and Transcriptome Assembly. Cell. 2018; 172(5):897-909.e21. PMC: 5831509. DOI: 10.1016/j.cell.2018.02.011. View

2.
Makino S, Kaji R, Ando S, Tomizawa M, Yasuno K, Goto S . Reduced neuron-specific expression of the TAF1 gene is associated with X-linked dystonia-parkinsonism. Am J Hum Genet. 2007; 80(3):393-406. PMC: 1821114. DOI: 10.1086/512129. View

3.
Lee L, Rivera C, Teleg R, Dantes M, Pasco P, Jamora R . The unique phenomenology of sex-linked dystonia parkinsonism (XDP, DYT3, "Lubag"). Int J Neurosci. 2010; 121 Suppl 1:3-11. DOI: 10.3109/00207454.2010.526728. View

4.
Lee L, PASCASIO F, Fuentes F, Viterbo G . Torsion dystonia in Panay, Philippines. Adv Neurol. 1976; 14:137-51. View

5.
Beale E, KENDALL M, Mann D . The discarding of variables in multivariate analysis. Biometrika. 1967; 54(3):357-66. View