Accommodating Population Differences when Validating Risk Prediction Models

Overview

Journal Stat Med

Publisher Wiley

Specialty Public Health

Date 2022 Oct 12

PMID 36224712

Authors

Ruth M Pfeiffer

Yiyao Chen

Mitchell H Gail

Donna P Ankerst

Affiliations

Soon will be listed here.

Abstract

Validation of risk prediction models in independent data provides a more rigorous assessment of model performance than internal assessment, for example, done by cross-validation in the data used for model development. However, several differences between the populations that gave rise to the training and the validation data can lead to seemingly poor performance of a risk model. In this paper we formalize the notions of "similarity" or "relatedness" of the training and validation data, and define reproducibility and transportability. We address the impact of different distributions of model predictors and differences in verifying the disease status or outcome on measures of calibration, accuracy and discrimination of a model. When individual level information from both the training and validation data sets is available, we propose and study weighted versions of the validation metrics that adjust for differences in the risk factor distributions and in outcome verification between the training and validation data to provide a more comprehensive assessment of model performance. We provide conditions on the risk model and the populations that gave rise to the training and validation data that ensure a model's reproducibility or transportability, and show how to check these conditions using weighted and unweighted performance measures. We illustrate the method by developing and validating a model that predicts the risk of developing prostate cancer using data from two large prostate cancer screening trials.

Citing Articles

A constrained maximum likelihood approach to developing well-calibrated models for predicting binary outcomes.

Cao Y, Ma W, Zhao G, McCarthy A, Chen J Lifetime Data Anal. 2024; 30(3):624-648.

PMID: 38717617 PMC: 11634939. DOI: 10.1007/s10985-024-09628-9.

References

Ankerst D, Boeck A, Freedland S, Thompson I, Cronin A, Roobol M . Evaluating the PCPT risk calculator in ten international biopsy cohorts: results from the Prostate Biopsy Collaborative Group. World J Urol. 2012; 30(2):181-7. PMC: 3616370. DOI: 10.1007/s00345-011-0818-5. View

Andriole G, Crawford E, Grubb 3rd R, Buys S, Chia D, Church T . Mortality results from a randomized prostate-cancer screening trial. N Engl J Med. 2009; 360(13):1310-9. PMC: 2944770. DOI: 10.1056/NEJMoa0810696. View

Begg C, Greenes R . Assessment of diagnostic tests when disease verification is subject to selection bias. Biometrics. 1983; 39(1):207-15. View

Ankerst D, Hoefler J, Bock S, Goodman P, Vickers A, Hernandez J . Prostate Cancer Prevention Trial risk calculator 2.0 for the prediction of low- vs high-grade prostate cancer. Urology. 2014; 83(6):1362-7. PMC: 4035700. DOI: 10.1016/j.urology.2014.02.035. View

Song X, Yu A, Kellum J, Waitman L, Matheny M, Simpson S . Cross-site transportability of an explainable artificial intelligence model for acute kidney injury prediction. Nat Commun. 2020; 11(1):5668. PMC: 7653032. DOI: 10.1038/s41467-020-19551-w. View

Lippman S, Klein E, Goodman P, Lucia M, Thompson I, Ford L . Effect of selenium and vitamin E on risk of prostate cancer and other cancers: the Selenium and Vitamin E Cancer Prevention Trial (SELECT). JAMA. 2008; 301(1):39-51. PMC: 3682779. DOI: 10.1001/jama.2008.864. View

Ankerst D, Straubinger J, Selig K, Guerrios L, de Hoedt A, Hernandez J . A Contemporary Prostate Biopsy Risk Calculator Based on Multiple Heterogeneous Cohorts. Eur Urol. 2018; 74(2):197-203. PMC: 6082177. DOI: 10.1016/j.eururo.2018.05.003. View

Debray T, Vergouwe Y, Koffijberg H, Nieboer D, Steyerberg E, Moons K . A new framework to enhance the interpretation of external validation studies of clinical prediction models. J Clin Epidemiol. 2014; 68(3):279-89. DOI: 10.1016/j.jclinepi.2014.06.018. View

Powers S, McGuire V, Bernstein L, Canchola A, Whittemore A . Evaluating disease prediction models using a cohort whose covariate distribution differs from that of the target population. Stat Methods Med Res. 2017; 28(1):309-320. PMC: 5895541. DOI: 10.1177/0962280217723945. View

10.

Tolksdorf J, Kattan M, Boorjian S, Freedland S, Saba K, Poyet C . Multi-cohort modeling strategies for scalable globally accessible prostate cancer risk tools. BMC Med Res Methodol. 2019; 19(1):191. PMC: 6792191. DOI: 10.1186/s12874-019-0839-0. View

11.

Austin P, van Klaveren D, Vergouwe Y, Nieboer D, Lee D, Steyerberg E . Geographic and temporal validity of prediction models: different approaches were useful to examine model performance. J Clin Epidemiol. 2016; 79:76-85. PMC: 5708595. DOI: 10.1016/j.jclinepi.2016.05.007. View

12.

Cook E, Moody-Thomas S, Anderson K, Campbell R, Hamilton S, Harrington J . Minority recruitment to the Selenium and Vitamin E Cancer Prevention Trial (SELECT). Clin Trials. 2005; 2(5):436-42. DOI: 10.1191/1740774505cn111oa. View

13.

Vergouwe Y, Moons K, Steyerberg E . External validity of risk models: Use of benchmark values to disentangle a case-mix effect from incorrect coefficients. Am J Epidemiol. 2010; 172(8):971-80. PMC: 2984249. DOI: 10.1093/aje/kwq223. View

14.

Li L, Greene T, Hu B . A simple method to estimate the time-dependent receiver operating characteristic curve and the area under the curve with right censored data. Stat Methods Med Res. 2016; 27(8):2264-2278. DOI: 10.1177/0962280216680239. View

15.

Metnitz P, Lang T, Vesely H, Valentin A, Le Gall J . Ratios of observed to expected mortality are affected by differences in case mix and quality of care. Intensive Care Med. 2000; 26(10):1466-72. DOI: 10.1007/s001340000638. View

16.

Steyerberg E, Nieboer D, Debray T, van Houwelingen H . Assessment of heterogeneity in an individual participant data meta-analysis of prediction models: An overview and illustration. Stat Med. 2019; 38(22):4290-4309. PMC: 6772012. DOI: 10.1002/sim.8296. View

17.

Davis S, Greevy R, Fonnesbeck C, Lasko T, Walsh C, Matheny M . A nonparametric updating method to correct clinical prediction model drift. J Am Med Inform Assoc. 2019; 26(12):1448-1457. PMC: 6857513. DOI: 10.1093/jamia/ocz127. View