High-dimensional Cox Models: the Choice of Penalty As Part of the Model Building Process

Overview

Journal Biom J

Specialty Public Health

Date 2010 Feb 19

PMID 20166132

Citations 29

Authors

Axel Benner

Manuela Zucknick

Thomas Hielscher

Carina Ittrich

Ulrich Mansmann

Affiliations

Soon will be listed here.

Abstract

The Cox proportional hazards regression model is the most popular approach to model covariate information for survival times. In this context, the development of high-dimensional models where the number of covariates is much larger than the number of observations (p>>n) is an ongoing challenge. A practicable approach is to use ridge penalized Cox regression in such situations. Beside focussing on finding the best prediction rule, one is often interested in determining a subset of covariates that are the most important ones for prognosis. This could be a gene set in the biostatistical analysis of microarray data. Covariate selection can then, for example, be done by L(1)-penalized Cox regression using the lasso (Tibshirani (1997). Statistics in Medicine 16, 385-395). Several approaches beyond the lasso, that incorporate covariate selection, have been developed in recent years. This includes modifications of the lasso as well as nonconvex variants such as smoothly clipped absolute deviation (SCAD) (Fan and Li (2001). Journal of the American Statistical Association 96, 1348-1360; Fan and Li (2002). The Annals of Statistics 30, 74-99). The purpose of this article is to implement them practically into the model building process when analyzing high-dimensional data with the Cox proportional hazards model. To evaluate penalized regression models beyond the lasso, we included SCAD variants and the adaptive lasso (Zou (2006). Journal of the American Statistical Association 101, 1418-1429). We compare them with "standard" applications such as ridge regression, the lasso, and the elastic net. Predictive accuracy, features of variable selection, and estimation bias will be studied to assess the practical use of these methods. We observed that the performance of SCAD and adaptive lasso is highly dependent on nontrivial preselection procedures. A practical solution to this problem does not yet exist. Since there is high risk of missing relevant covariates when using SCAD or adaptive lasso applied after an inappropriate initial selection step, we recommend to stay with lasso or the elastic net in actual data applications. But with respect to the promising results for truly sparse models, we see some advantage of SCAD and adaptive lasso, if better preselection procedures would be available. This requires further methodological research.

Citing Articles

Predictive Models for Long Term Survival of AML Patients Treated with Venetoclax and Azacitidine or 7+3 Based on Post Treatment Events and Responses: Retrospective Cohort Study.

Islam N, Reuben J, Dale J, Coates J, Sapiah K, Markson F JMIR Cancer. 2024; 10:e54740.

PMID: 39167784 PMC: 11375398. DOI: 10.2196/54740.

Learning from vertically distributed data across multiple sites: An efficient privacy-preserving algorithm for Cox proportional hazards model with variable selection.

Miao G, Yu L, Yang J, Bennett D, Zhao J, Wu S J Biomed Inform. 2023; 149():104581.

PMID: 38142903 PMC: 10996392. DOI: 10.1016/j.jbi.2023.104581.

Comparison of models for stroke-free survival prediction in patients with CADASIL.

Chhoa H, Chabriat H, Chevret S, Biard L Sci Rep. 2023; 13(1):22443.

PMID: 38105268 PMC: 10725863. DOI: 10.1038/s41598-023-49552-w.

Penalized variable selection in multi-parameter regression survival modeling.

Jaouimaa F, Ha I, Burke K Stat Methods Med Res. 2023; 32(12):2455-2471.

PMID: 37823396 PMC: 10710000. DOI: 10.1177/09622802231203322.

Target Genes of c-MYC and MYCN with Prognostic Power in Neuroblastoma Exhibit Different Expressions during Sympathoadrenal Development.

Yuan Y, Alzrigat M, Rodriguez-Garcia A, Wang X, Sjoberg Bexelius T, Johnsen J Cancers (Basel). 2023; 15(18).

PMID: 37760568 PMC: 10527308. DOI: 10.3390/cancers15184599.