» Articles » PMID: 37598153

Sample Size Requirements Are Not Being Considered in Studies Developing Prediction Models for Binary Outcomes: a Systematic Review

Overview
Publisher Biomed Central
Date 2023 Aug 19
PMID 37598153
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Having an appropriate sample size is important when developing a clinical prediction model. We aimed to review how sample size is considered in studies developing a prediction model for a binary outcome.

Methods: We searched PubMed for studies published between 01/07/2020 and 30/07/2020 and reviewed the sample size calculations used to develop the prediction models. Using the available information, we calculated the minimum sample size that would be needed to estimate overall risk and minimise overfitting in each study and summarised the difference between the calculated and used sample size.

Results: A total of 119 studies were included, of which nine studies provided sample size justification (8%). The recommended minimum sample size could be calculated for 94 studies: 73% (95% CI: 63-82%) used sample sizes lower than required to estimate overall risk and minimise overfitting including 26% studies that used sample sizes lower than required to estimate overall risk only. A similar number of studies did not meet the ≥ 10EPV criteria (75%, 95% CI: 66-84%). The median deficit of the number of events used to develop a model was 75 [IQR: 234 lower to 7 higher]) which reduced to 63 if the total available data (before any data splitting) was used [IQR:225 lower to 7 higher]. Studies that met the minimum required sample size had a median c-statistic of 0.84 (IQR:0.80 to 0.9) and studies where the minimum sample size was not met had a median c-statistic of 0.83 (IQR: 0.75 to 0.9). Studies that met the ≥ 10 EPP criteria had a median c-statistic of 0.80 (IQR: 0.73 to 0.84).

Conclusions: Prediction models are often developed with no sample size calculation, as a consequence many are too small to precisely estimate the overall risk. We encourage researchers to justify, perform and report sample size calculations when developing a prediction model.

Citing Articles

Risk prediction tools for pressure injury occurrence: an umbrella review of systematic reviews reporting model development and validation methods.

Hillier B, Scandrett K, Coombe A, Hernandez-Boussard T, Steyerberg E, Takwoingi Y Diagn Progn Res. 2025; 9(1):2.

PMID: 39806510 PMC: 11730812. DOI: 10.1186/s41512-024-00182-4.


Lipoprotein(a), high-sensitivity c-reactive protein, homocysteine and cardiovascular disease in the Multi-Ethnic Study of Atherosclerosis.

Nomura S, Bhatia H, Garg P, Karger A, Guan W, Cao J Am J Prev Cardiol. 2025; 21():100903.

PMID: 39802678 PMC: 11722194. DOI: 10.1016/j.ajpc.2024.100903.


Overlooked and underpowered: a meta-research addressing sample size in radiomics prediction models for binary outcomes.

Zhong J, Liu X, Lu J, Yang J, Zhang G, Mao S Eur Radiol. 2025; 35(3):1146-1156.

PMID: 39789271 PMC: 11835977. DOI: 10.1007/s00330-024-11331-0.


Machine Learning and Metabolomics Predict Mesenchymal Stem Cell Osteogenic Differentiation in 2D and 3D Cultures.

Klontzas M, Vernardis S, Batsali A, Papadogiannis F, Panoskaltsis N, Mantalaris A J Funct Biomater. 2024; 15(12).

PMID: 39728167 PMC: 11680063. DOI: 10.3390/jfb15120367.


Prognostic models for prediction of perioperative allogeneic red blood cell transfusion in adult cardiac surgery: A systematic review and meta-analysis.

Van den Eynde R, Vrancken A, Foubert R, Tuand K, Vandendriessche T, Schrijvers A Transfusion. 2024; 65(2):397-409.

PMID: 39726297 PMC: 11826302. DOI: 10.1111/trf.18108.


References
1.
Riley R, Snell K, Martin G, Whittle R, Archer L, Sperrin M . Penalization and shrinkage methods produced unreliable clinical prediction models especially when sample size was small. J Clin Epidemiol. 2020; 132:88-96. PMC: 8026952. DOI: 10.1016/j.jclinepi.2020.12.005. View

2.
Raseta M, Bazarova A, Wright H, Parrott A, Nayak S . A novel toolkit for the prediction of clinical outcomes following mechanical thrombectomy. Clin Radiol. 2020; 75(10):795.e15-795.e21. DOI: 10.1016/j.crad.2020.06.026. View

3.
van Smeden M, Moons K, de Groot J, Collins G, Altman D, Eijkemans M . Sample size for binary logistic prediction models: Beyond events per variable criteria. Stat Methods Med Res. 2018; 28(8):2455-2474. PMC: 6710621. DOI: 10.1177/0962280218784726. View

4.
Moons K, Wolff R, Riley R, Whiting P, Westwood M, Collins G . PROBAST: A Tool to Assess Risk of Bias and Applicability of Prediction Model Studies: Explanation and Elaboration. Ann Intern Med. 2019; 170(1):W1-W33. DOI: 10.7326/M18-1377. View

5.
Riley R, Snell K, Ensor J, Burke D, Harrell Jr F, Moons K . Minimum sample size for developing a multivariable prediction model: PART II - binary and time-to-event outcomes. Stat Med. 2018; 38(7):1276-1296. PMC: 6519266. DOI: 10.1002/sim.7992. View