REGULARIZATION FOR COX'S PROPORTIONAL HAZARDS MODEL WITH NP-DIMENSIONALITY
Overview
Authors
Affiliations
High throughput genetic sequencing arrays with thousands of measurements per sample and a great amount of related censored clinical data have increased demanding need for better measurement specific model selection. In this paper we establish strong oracle properties of non-concave penalized methods for non-polynomial (NP) dimensional data with censoring in the framework of Cox's proportional hazards model. A class of folded-concave penalties are employed and both LASSO and SCAD are discussed specifically. We unveil the question under which dimensionality and correlation restrictions can an oracle estimator be constructed and grasped. It is demonstrated that non-concave penalties lead to significant reduction of the "irrepresentable condition" needed for LASSO model selection consistency. The large deviation result for martingales, bearing interests of its own, is developed for characterizing the strong oracle property. Moreover, the non-concave regularized estimator, is shown to achieve asymptotically the information bound of the oracle estimator. A coordinate-wise algorithm is developed for finding the grid of solution paths for penalized hazard regression problems, and its performance is evaluated on simulated and gene association study examples.
Gene-environment interaction analysis under the Cox model.
Fang K, Li J, Xu Y, Ma S, Zhang Q Ann Inst Stat Math. 2025; 75(6):931-948.
PMID: 39990259 PMC: 11843211. DOI: 10.1007/s10463-023-00871-9.
EFFICIENT ESTIMATION OF THE MAXIMAL ASSOCIATION BETWEEN MULTIPLE PREDICTORS AND A SURVIVAL OUTCOME.
Huang T, Luedtke A, McKeague I Ann Stat. 2024; 51(5):1965-1988.
PMID: 38405375 PMC: 10888526. DOI: 10.1214/23-aos2313.
Testing and Confidence Intervals for High Dimensional Proportional Hazards Model.
Fang E, Ning Y, Liu H J R Stat Soc Series B Stat Methodol. 2023; 79(5):1415-1437.
PMID: 37854943 PMC: 10584375. DOI: 10.1111/rssb.12224.
Liu Y, Li G J Comput Biol. 2023; 30(6):663-677.
PMID: 37140454 PMC: 10282795. DOI: 10.1089/cmb.2022.0416.
High-Dimensional Survival Analysis: Methods and Applications.
Salerno S, Li Y Annu Rev Stat Appl. 2023; 10(1):25-49.
PMID: 36968638 PMC: 10038209. DOI: 10.1146/annurev-statistics-032921-022127.