» Articles » PMID: 25193065

Weibull Regression with Bayesian Variable Selection to Identify Prognostic Tumour Markers of Breast Cancer Survival

Overview
Publisher Sage Publications
Specialties Public Health
Science
Date 2014 Sep 7
PMID 25193065
Citations 15
Authors
Affiliations
Soon will be listed here.
Abstract

As data-rich medical datasets are becoming routinely collected, there is a growing demand for regression methodology that facilitates variable selection over a large number of predictors. Bayesian variable selection algorithms offer an attractive solution, whereby a sparsity inducing prior allows inclusion of sets of predictors simultaneously, leading to adjusted effect estimates and inference of which covariates are most important. We present a new implementation of Bayesian variable selection, based on a Reversible Jump MCMC algorithm, for survival analysis under the Weibull regression model. A realistic simulation study is presented comparing against an alternative LASSO-based variable selection strategy in datasets of up to 20,000 covariates. Across half the scenarios, our new method achieved identical sensitivity and specificity to the LASSO strategy, and a marginal improvement otherwise. Runtimes were comparable for both approaches, taking approximately a day for 20,000 covariates. Subsequently, we present a real data application in which 119 protein-based markers are explored for association with breast cancer survival in a case cohort of 2287 patients with oestrogen receptor-positive disease. Evidence was found for three independent prognostic tumour markers of survival, one of which is novel. Our new approach demonstrated the best specificity.

Citing Articles

Group lasso priors for Bayesian accelerated failure time models with left-truncated and interval-censored data.

Reeder H, Haneuse S, Lee K Stat Methods Med Res. 2024; 33(8):1412-1423.

PMID: 39053572 PMC: 11833807. DOI: 10.1177/09622802241262523.


Linked shrinkage to improve estimation of interaction effects in regression models.

van de Wiel M, Amestoy M, Hoogland J Epidemiol Methods. 2024; 13(1):20230039.

PMID: 38989109 PMC: 11232106. DOI: 10.1515/em-2023-0039.


Adaptive MCMC for Bayesian Variable Selection in Generalised Linear Models and Survival Models.

Liang X, Livingstone S, Griffin J Entropy (Basel). 2023; 25(9).

PMID: 37761609 PMC: 10528396. DOI: 10.3390/e25091310.


Sensitivity Analysis for Survival Prognostic Prediction with Gene Selection: A Copula Method for Dependent Censoring.

Yeh C, Liao G, Emura T Biomedicines. 2023; 11(3).

PMID: 36979776 PMC: 10045003. DOI: 10.3390/biomedicines11030797.


Controlled variable selection in Weibull mixture cure models for high-dimensional data.

Fu H, Nicolet D, Mrozek K, Stone R, Eisfeld A, Byrd J Stat Med. 2022; 41(22):4340-4366.

PMID: 35792553 PMC: 9545322. DOI: 10.1002/sim.9513.


References
1.
Haibe-Kains B, Desmedt C, Loi S, Culhane A, Bontempi G, Quackenbush J . A three-gene model to robustly identify breast cancer molecular subtypes. J Natl Cancer Inst. 2012; 104(4):311-25. PMC: 3283537. DOI: 10.1093/jnci/djr545. View

2.
Tachmazidou I, Johnson M, De Iorio M . Bayesian variable selection for survival regression in genetics. Genet Epidemiol. 2010; 34(7):689-701. DOI: 10.1002/gepi.20530. View

3.
Buuren S, Boshuizen H, Knook D . Multiple imputation of missing blood pressure covariates in survival analysis. Stat Med. 1999; 18(6):681-94. DOI: 10.1002/(sici)1097-0258(19990330)18:6<681::aid-sim71>3.0.co;2-r. View

4.
Zoller M . CD44: can a cancer-initiating cell profit from an abundantly expressed molecule?. Nat Rev Cancer. 2011; 11(4):254-67. DOI: 10.1038/nrc3023. View

5.
Wishart G, Azzato E, Greenberg D, Rashbass J, Kearins O, Lawrence G . PREDICT: a new UK prognostic model that predicts survival following surgery for invasive breast cancer. Breast Cancer Res. 2010; 12(1):R1. PMC: 2880419. DOI: 10.1186/bcr2464. View