» Articles » PMID: 39888901

On the Estimation of Inverse-probability-of-censoring Weights for the Evaluation of Survival Prediction Error

Overview
Journal PLoS One
Date 2025 Jan 31
PMID 39888901
Authors
Affiliations
Soon will be listed here.
Abstract

Inverse probability weighting (IPW) is a popular method for making inferences regarding unobserved or unobservable data of a target population based on observed data. This paper considers IPW applied to right-censored time-to-event data. We investigate the behavior of the inverse-probability-of-censoring weighted (IPCW) Brier score, which is frequently used to assess the predictive performance of time-to-event models. A key requirement of the IPCW Brier score is the estimation of the censoring distribution, which is needed to compute the weights. The established paradigm of splitting a dataset into a training and a test set for model fitting and evaluation raises the question which of these datasets to use in order to fit the censoring model. There seems to be considerable disagreement between authors with regards to this issue, and no standard has been established so far. To shed light on this important question, we conducted a comprehensive experimental study exploring various data scenarios and estimation schemes. We found that it is generally of little importance which dataset is used to model the censoring distribution. However, in some circumstances, such as in the case of a covariate-dependent censoring process, a small sample size, or when dealing with noisy data, it may be advisable to use the test set instead of the training set to model the censoring distribution. A detailed set of practical recommendations concludes our paper.

References
1.
Gerds T, Schumacher M . Efron-type measures of prediction error for survival analysis. Biometrics. 2007; 63(4):1283-7. DOI: 10.1111/j.1541-0420.2007.00832.x. View

2.
Friedman J, Hastie T, Tibshirani R . Regularization Paths for Generalized Linear Models via Coordinate Descent. J Stat Softw. 2010; 33(1):1-22. PMC: 2929880. View

3.
Jiang W, Sun H, Peng Y . Prediction accuracy for the cure probabilities in mixture cure models. Stat Methods Med Res. 2017; 26(5):2029-2041. DOI: 10.1177/0962280217708673. View

4.
Uno H, Cai T, Pencina M, DAgostino R, Wei L . On the C-statistics for evaluating overall adequacy of risk prediction procedures with censored survival data. Stat Med. 2011; 30(10):1105-17. PMC: 3079915. DOI: 10.1002/sim.4154. View

5.
Sonabend R, Kiraly F, Bender A, Bischl B, Lang M . mlr3proba: an R package for machine learning in survival analysis. Bioinformatics. 2021; 37(17):2789-2791. PMC: 8428574. DOI: 10.1093/bioinformatics/btab039. View