» Articles » PMID: 39361401

Machine Learning-Based Prediction of Neurodegenerative Disease in Patients With Type 2 Diabetes by Derivation and Validation in 2 Independent Korean Cohorts: Model Development and Validation Study

Overview
Publisher JMIR Publications
Date 2024 Oct 3
PMID 39361401
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Several machine learning (ML) prediction models for neurodegenerative diseases (NDs) in type 2 diabetes mellitus (T2DM) have recently been developed. However, the predictive power of these models is limited by the lack of multiple risk factors.

Objective: This study aimed to assess the validity and use of an ML model for predicting the 3-year incidence of ND in patients with T2DM.

Methods: We used data from 2 independent cohorts-the discovery cohort (1 hospital; n=22,311) and the validation cohort (2 hospitals; n=2915)-to predict ND. The outcome of interest was the presence or absence of ND at 3 years. We selected different ML-based models with hyperparameter tuning in the discovery cohort and conducted an area under the receiver operating characteristic curve (AUROC) analysis in the validation cohort.

Results: The study dataset included 22,311 (discovery) and 2915 (validation) patients with T2DM recruited between 2008 and 2022. ND was observed in 133 (0.6%) and 15 patients (0.5%) in the discovery and validation cohorts, respectively. The AdaBoost model had a mean AUROC of 0.82 (95% CI 0.79-0.85) in the discovery dataset. When this result was applied to the validation dataset, the AdaBoost model exhibited the best performance among the models, with an AUROC of 0.83 (accuracy of 78.6%, sensitivity of 78.6%, specificity of 78.6%, and balanced accuracy of 78.6%). The most influential factors in the AdaBoost model were age and cardiovascular disease.

Conclusions: This study shows the use and feasibility of ML for assessing the incidence of ND in patients with T2DM and suggests its potential for use in screening patients. Further international studies are required to validate these findings.

Citing Articles

Prediction model for type 2 diabetes mellitus and its association with mortality using machine learning in three independent cohorts from South Korea, Japan, and the UK: a model development and validation study.

Lee H, Hwang S, Park S, Choi Y, Lee S, Park J EClinicalMedicine. 2025; 80:103069.

PMID: 39896872 PMC: 11787438. DOI: 10.1016/j.eclinm.2025.103069.


Predictive model for abdominal liposuction volume in patients with obesity using machine learning in a longitudinal multi-center study in Korea.

Sang H, Park J, Kim S, Lee M, Lee H, Lee S Sci Rep. 2024; 14(1):29791.

PMID: 39616163 PMC: 11608244. DOI: 10.1038/s41598-024-79654-y.

References
1.
Rhee S, Sung J, Kim S, Cho I, Lee S, Chang H . Development and Validation of a Deep Learning Based Diabetes Prediction System Using a Nationwide Population-Based Cohort. Diabetes Metab J. 2021; 45(4):515-525. PMC: 8369223. DOI: 10.4093/dmj.2020.0081. View

2.
Driver J . Inverse association between cancer and neurodegenerative disease: review of the epidemiologic and biological evidence. Biogerontology. 2014; 15(6):547-57. DOI: 10.1007/s10522-014-9523-2. View

3.
Xu W, Atti A, Gatz M, Pedersen N, Johansson B, Fratiglioni L . Midlife overweight and obesity increase late-life dementia risk: a population-based twin study. Neurology. 2011; 76(18):1568-74. PMC: 3100125. DOI: 10.1212/WNL.0b013e3182190d09. View

4.
Kim J, Lee H, Lee J, Rhee S, Shin J, Lee S . Quantification of identifying cognitive impairment using olfactory-stimulated functional near-infrared spectroscopy with machine learning: a post hoc analysis of a diagnostic trial and validation of an external additional trial. Alzheimers Res Ther. 2023; 15(1):127. PMC: 10362671. DOI: 10.1186/s13195-023-01268-9. View

5.
Meamar R, Maracy M, Chitsaz A, Aghaye Ghazvini M, Izadi M, Tanhaei A . Association between serum biochemical levels, related to bone metabolism and Parkinson's disease. J Res Med Sci. 2013; 18(Suppl 1):S39-42. PMC: 3743317. View