Robust Bayesian Variable Selection for Gene-environment Interactions
Overview
Affiliations
Gene-environment (G× E) interactions have important implications to elucidate the etiology of complex diseases beyond the main genetic and environmental effects. Outliers and data contamination in disease phenotypes of G× E studies have been commonly encountered, leading to the development of a broad spectrum of robust regularization methods. Nevertheless, within the Bayesian framework, the issue has not been taken care of in existing studies. We develop a fully Bayesian robust variable selection method for G× E interaction studies. The proposed Bayesian method can effectively accommodate heavy-tailed errors and outliers in the response variable while conducting variable selection by accounting for structural sparsity. In particular, for the robust sparse group selection, the spike-and-slab priors have been imposed on both individual and group levels to identify important main and interaction effects robustly. An efficient Gibbs sampler has been developed to facilitate fast computation. Extensive simulation studies, analysis of diabetes data with single-nucleotide polymorphism measurements from the Nurses' Health Study, and The Cancer Genome Atlas melanoma data with gene expression measurements demonstrate the superior performance of the proposed method over multiple competing alternatives.
Sun N, Han Q, Wang Y, Sun M, Sun Z, Sun H BMC Bioinformatics. 2025; 26(1):58.
PMID: 39966697 PMC: 11834309. DOI: 10.1186/s12859-025-06077-5.
Fan K, Subedi S, Yang G, Lu X, Ren J, Wu C Entropy (Basel). 2024; 26(9).
PMID: 39330127 PMC: 11430850. DOI: 10.3390/e26090794.
The spike-and-slab quantile LASSO for robust variable selection in cancer genomics studies.
Liu Y, Ren J, Ma S, Wu C Stat Med. 2024; 43(26):4928-4983.
PMID: 39260448 PMC: 11585335. DOI: 10.1002/sim.10196.
The Bayesian Regularized Quantile Varying Coefficient Model.
Zhou F, Ren J, Ma S, Wu C Comput Stat Data Anal. 2024; 187.
PMID: 38746689 PMC: 11090482. DOI: 10.1016/j.csda.2023.107808.
Hierarchical False Discovery Rate Control for High-dimensional Survival Analysis with Interactions.
Liang W, Zhang Q, Ma S Comput Stat Data Anal. 2023; 192.
PMID: 38098875 PMC: 10718515. DOI: 10.1016/j.csda.2023.107906.