Weibull Regression with Bayesian Variable Selection to Identify Prognostic Tumour Markers of Breast Cancer Survival
Overview
Science
Affiliations
As data-rich medical datasets are becoming routinely collected, there is a growing demand for regression methodology that facilitates variable selection over a large number of predictors. Bayesian variable selection algorithms offer an attractive solution, whereby a sparsity inducing prior allows inclusion of sets of predictors simultaneously, leading to adjusted effect estimates and inference of which covariates are most important. We present a new implementation of Bayesian variable selection, based on a Reversible Jump MCMC algorithm, for survival analysis under the Weibull regression model. A realistic simulation study is presented comparing against an alternative LASSO-based variable selection strategy in datasets of up to 20,000 covariates. Across half the scenarios, our new method achieved identical sensitivity and specificity to the LASSO strategy, and a marginal improvement otherwise. Runtimes were comparable for both approaches, taking approximately a day for 20,000 covariates. Subsequently, we present a real data application in which 119 protein-based markers are explored for association with breast cancer survival in a case cohort of 2287 patients with oestrogen receptor-positive disease. Evidence was found for three independent prognostic tumour markers of survival, one of which is novel. Our new approach demonstrated the best specificity.
Reeder H, Haneuse S, Lee K Stat Methods Med Res. 2024; 33(8):1412-1423.
PMID: 39053572 PMC: 11833807. DOI: 10.1177/09622802241262523.
Linked shrinkage to improve estimation of interaction effects in regression models.
van de Wiel M, Amestoy M, Hoogland J Epidemiol Methods. 2024; 13(1):20230039.
PMID: 38989109 PMC: 11232106. DOI: 10.1515/em-2023-0039.
Adaptive MCMC for Bayesian Variable Selection in Generalised Linear Models and Survival Models.
Liang X, Livingstone S, Griffin J Entropy (Basel). 2023; 25(9).
PMID: 37761609 PMC: 10528396. DOI: 10.3390/e25091310.
Yeh C, Liao G, Emura T Biomedicines. 2023; 11(3).
PMID: 36979776 PMC: 10045003. DOI: 10.3390/biomedicines11030797.
Controlled variable selection in Weibull mixture cure models for high-dimensional data.
Fu H, Nicolet D, Mrozek K, Stone R, Eisfeld A, Byrd J Stat Med. 2022; 41(22):4340-4366.
PMID: 35792553 PMC: 9545322. DOI: 10.1002/sim.9513.