Estimation of Minimal Data Sets Sizes for Machine Learning Predictions in Digital Mental Health Interventions

Overview

Journal NPJ Digit Med

Specialty Medical Informatics

Date 2024 Dec 18

PMID 39695276

Authors

Kirsten Zantvoort

Barbara Nacke

Dennis Gorlich

Silvan Hornstein

Corinna Jacobi

Burkhardt Funk

Affiliations

Soon will be listed here.

Abstract

Artificial intelligence promises to revolutionize mental health care, but small dataset sizes and lack of robust methods raise concerns about result generalizability. To provide insights on minimal necessary data set sizes, we explore domain-specific learning curves for digital intervention dropout predictions based on 3654 users from a single study (ISRCTN13716228, 26/02/2016). Prediction performance is analyzed based on dataset size (N = 100-3654), feature groups (F = 2-129), and algorithm choice (from Naive Bayes to Neural Networks). The results substantiate the concern that small datasets (N ≤ 300) overestimate predictive power. For uninformative feature groups, in-sample prediction performance was negatively correlated with dataset size. Sophisticated models overfitted in small datasets but maximized holdout test results in larger datasets. While N = 500 mitigated overfitting, performance did not converge until N = 750-1500. Consequently, we propose minimum dataset sizes of N = 500-1000. As such, this study offers an empirical reference for researchers designing or interpreting AI studies on Digital Mental Health Intervention data.

Citing Articles

Prediction models for treatment response in migraine: a systematic review and meta-analysis.

Chen Q, Zhang J, Cao B, Hu Y, Kong Y, Li B J Headache Pain. 2025; 26(1):32.

PMID: 39939885 PMC: 11817351. DOI: 10.1186/s10194-025-01972-x.

References

Andaur Navarro C, Damen J, Takada T, Nijman S, Dhiman P, Ma J . Risk of bias in studies on prediction models developed using supervised machine learning techniques: systematic review. BMJ. 2021; 375:n2281. PMC: 8527348. DOI: 10.1136/bmj.n2281. View

van Smeden M, Moons K, de Groot J, Collins G, Altman D, Eijkemans M . Sample size for binary logistic prediction models: Beyond events per variable criteria. Stat Methods Med Res. 2018; 28(8):2455-2474. PMC: 6710621. DOI: 10.1177/0962280218784726. View

Chekroud A, Hawrilenko M, Loho H, Bondar J, Gueorguieva R, Hasan A . Illusory generalizability of clinical prediction models. Science. 2024; 383(6679):164-167. DOI: 10.1126/science.adg8538. View

Harris C, Millman K, van der Walt S, Gommers R, Virtanen P, Cournapeau D . Array programming with NumPy. Nature. 2020; 585(7825):357-362. PMC: 7759461. DOI: 10.1038/s41586-020-2649-2. View

Bricker J, Miao Z, Mull K, Santiago-Torres M, Vock D . Can a Single Variable Predict Early Dropout From Digital Health Interventions? Comparison of Predictive Models From Two Large Randomized Trials. J Med Internet Res. 2023; 25:e43629. PMC: 9898835. DOI: 10.2196/43629. View

Nacke B, Beintner I, Gorlich D, Vollert B, Schmidt-Hantke J, Hutter K . everyBody-Tailored online health promotion and eating disorder prevention for women: Study protocol of a dissemination trial. Internet Interv. 2019; 16:20-25. PMC: 6364518. DOI: 10.1016/j.invent.2018.02.008. View

Donkin L, Hickie I, Christensen H, Naismith S, Neal B, Cockayne N . Rethinking the dose-response relationship between usage and outcome in an online intervention for depression: randomized controlled trial. J Med Internet Res. 2013; 15(10):e231. PMC: 3806549. DOI: 10.2196/jmir.2771. View

Zantvoort K, Scharfenberger J, Boss L, Lehr D, Funk B . Finding the Best Match - a Case Study on the (Text-)Feature and Model Choice in Digital Mental Health Interventions. J Healthc Inform Res. 2023; 7(4):447-479. PMC: 10620349. DOI: 10.1007/s41666-023-00148-z. View

Linardon J, Shatte A, Messer M, Firth J, Fuller-Tyszkiewicz M . E-mental health interventions for the treatment and prevention of eating disorders: An updated systematic review and meta-analysis. J Consult Clin Psychol. 2020; 88(11):994-1007. DOI: 10.1037/ccp0000575. View

10.

Pedersen D, Mansourvar M, Sortso C, Schmidt T . Predicting Dropouts From an Electronic Health Platform for Lifestyle Interventions: Analysis of Methods and Predictors. J Med Internet Res. 2019; 21(9):e13617. PMC: 6753691. DOI: 10.2196/13617. View

11.

Hornstein S, Zantvoort K, Lueken U, Funk B, Hilbert K . Personalization strategies in digital mental health interventions: a systematic review and conceptual framework for depressive symptoms. Front Digit Health. 2023; 5:1170002. PMC: 10239832. DOI: 10.3389/fdgth.2023.1170002. View

12.

DeLong E, Delong D, Clarke-Pearson D . Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics. 1988; 44(3):837-45. View

13.

Beintner I, Vollert B, Zarski A, Bolinski F, Musiat P, Gorlich D . Adherence Reporting in Randomized Controlled Trials Examining Manualized Multisession Online Interventions: Systematic Review of Practices and Proposal for Reporting Standards. J Med Internet Res. 2019; 21(8):e14181. PMC: 6713038. DOI: 10.2196/14181. View

14.

Spitzer R, Kroenke K, Williams J, Lowe B . A brief measure for assessing generalized anxiety disorder: the GAD-7. Arch Intern Med. 2006; 166(10):1092-7. DOI: 10.1001/archinte.166.10.1092. View

15.

Bremer V, Chow P, Funk B, Thorndike F, Ritterband L . Developing a Process for the Analysis of User Journeys and the Prediction of Dropout in Digital Health Interventions: Machine Learning Approach. J Med Internet Res. 2020; 22(10):e17738. PMC: 7657718. DOI: 10.2196/17738. View

16.

Bush K, Kivlahan D, McDonell M, Fihn S, Bradley K . The AUDIT alcohol consumption questions (AUDIT-C): an effective brief screening test for problem drinking. Ambulatory Care Quality Improvement Project (ACQUIP). Alcohol Use Disorders Identification Test. Arch Intern Med. 1998; 158(16):1789-95. DOI: 10.1001/archinte.158.16.1789. View

17.

Shatte A, Hutchinson D, Teague S . Machine learning in mental health: a scoping review of methods and applications. Psychol Med. 2019; 49(9):1426-1448. DOI: 10.1017/S0033291719000151. View

18.

Giesemann J, Delgadillo J, Schwartz B, Bennemann B, Lutz W . Predicting dropout from psychological treatment using different machine learning algorithms, resampling methods, and sample sizes. Psychother Res. 2023; 33(6):683-695. DOI: 10.1080/10503307.2022.2161432. View

19.

Jacobi C, Vollert B, Hutter K, von Bloh P, Eiterich N, Gorlich D . Indicated Web-Based Prevention for Women With Anorexia Nervosa Symptoms: Randomized Controlled Efficacy Trial. J Med Internet Res. 2022; 24(6):e35947. PMC: 9204567. DOI: 10.2196/35947. View

20.

Carey K, Neal D, Collins S . A psychometric analysis of the self-regulation questionnaire. Addict Behav. 2004; 29(2):253-60. DOI: 10.1016/j.addbeh.2003.08.001. View