» Articles » PMID: 36224712

Accommodating Population Differences when Validating Risk Prediction Models

Overview
Journal Stat Med
Publisher Wiley
Specialty Public Health
Date 2022 Oct 12
PMID 36224712
Authors
Affiliations
Soon will be listed here.
Abstract

Validation of risk prediction models in independent data provides a more rigorous assessment of model performance than internal assessment, for example, done by cross-validation in the data used for model development. However, several differences between the populations that gave rise to the training and the validation data can lead to seemingly poor performance of a risk model. In this paper we formalize the notions of "similarity" or "relatedness" of the training and validation data, and define reproducibility and transportability. We address the impact of different distributions of model predictors and differences in verifying the disease status or outcome on measures of calibration, accuracy and discrimination of a model. When individual level information from both the training and validation data sets is available, we propose and study weighted versions of the validation metrics that adjust for differences in the risk factor distributions and in outcome verification between the training and validation data to provide a more comprehensive assessment of model performance. We provide conditions on the risk model and the populations that gave rise to the training and validation data that ensure a model's reproducibility or transportability, and show how to check these conditions using weighted and unweighted performance measures. We illustrate the method by developing and validating a model that predicts the risk of developing prostate cancer using data from two large prostate cancer screening trials.

Citing Articles

A constrained maximum likelihood approach to developing well-calibrated models for predicting binary outcomes.

Cao Y, Ma W, Zhao G, McCarthy A, Chen J Lifetime Data Anal. 2024; 30(3):624-648.

PMID: 38717617 PMC: 11634939. DOI: 10.1007/s10985-024-09628-9.

References
1.
Ankerst D, Boeck A, Freedland S, Thompson I, Cronin A, Roobol M . Evaluating the PCPT risk calculator in ten international biopsy cohorts: results from the Prostate Biopsy Collaborative Group. World J Urol. 2012; 30(2):181-7. PMC: 3616370. DOI: 10.1007/s00345-011-0818-5. View

2.
Andriole G, Crawford E, Grubb 3rd R, Buys S, Chia D, Church T . Mortality results from a randomized prostate-cancer screening trial. N Engl J Med. 2009; 360(13):1310-9. PMC: 2944770. DOI: 10.1056/NEJMoa0810696. View

3.
Begg C, Greenes R . Assessment of diagnostic tests when disease verification is subject to selection bias. Biometrics. 1983; 39(1):207-15. View

4.
Ankerst D, Hoefler J, Bock S, Goodman P, Vickers A, Hernandez J . Prostate Cancer Prevention Trial risk calculator 2.0 for the prediction of low- vs high-grade prostate cancer. Urology. 2014; 83(6):1362-7. PMC: 4035700. DOI: 10.1016/j.urology.2014.02.035. View

5.
Song X, Yu A, Kellum J, Waitman L, Matheny M, Simpson S . Cross-site transportability of an explainable artificial intelligence model for acute kidney injury prediction. Nat Commun. 2020; 11(1):5668. PMC: 7653032. DOI: 10.1038/s41467-020-19551-w. View