Estimating False Discovery Proportion Under Arbitrary Covariance Dependence
Overview
Authors
Affiliations
Multiple hypothesis testing is a fundamental problem in high dimensional inference, with wide applications in many scientific fields. In genome-wide association studies, tens of thousands of tests are performed simultaneously to find if any SNPs are associated with some traits and those tests are correlated. When test statistics are correlated, false discovery control becomes very challenging under arbitrary dependence. In the current paper, we propose a novel method based on principal factor approximation, which successfully subtracts the common dependence and weakens significantly the correlation structure, to deal with an arbitrary dependence structure. We derive an approximate expression for false discovery proportion (FDP) in large scale multiple testing when a common threshold is used and provide a consistent estimate of realized FDP. This result has important applications in controlling FDR and FDP. Our estimate of realized FDP compares favorably with Efron (2007)'s approach, as demonstrated in the simulated examples. Our approach is further illustrated by some real data applications. We also propose a dependence-adjusted procedure, which is more powerful than the fixed threshold procedure.
Asymptotic uncertainty of false discovery proportion.
Mei M, Yu T, Jiang Y Biometrics. 2024; 80(1).
PMID: 38497826 PMC: 10946230. DOI: 10.1093/biomtc/ujae015.
Optimal Estimation of Genetic Relatedness in High-dimensional Linear Models.
Guo Z, Wang W, Cai T, Li H J Am Stat Assoc. 2024; 114(525):358-369.
PMID: 38434789 PMC: 10907007. DOI: 10.1080/01621459.2017.1407774.
Rastaghi S, Saki A, Tabesh H BMC Bioinformatics. 2024; 25(1):57.
PMID: 38317067 PMC: 10840263. DOI: 10.1186/s12859-024-05678-w.
Transfer learning with false negative control improves polygenic risk prediction.
Jeng X, Hu Y, Venkat V, Lu T, Tzeng J PLoS Genet. 2023; 19(11):e1010597.
PMID: 38011285 PMC: 10723713. DOI: 10.1371/journal.pgen.1010597.
Mixture prior for sparse signals with dependent covariance structure.
Wang L, Liao Z PLoS One. 2023; 18(4):e0284284.
PMID: 37104465 PMC: 10138223. DOI: 10.1371/journal.pone.0284284.