» Articles » PMID: 39924304

Development and Validation of a Machine Learning Algorithm for Predicting Diabetes Retinopathy in Patients With Type 2 Diabetes: Algorithm Development Study

Overview
Journal JMIR Med Inform
Publisher JMIR Publications
Date 2025 Feb 9
PMID 39924304
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Diabetic retinopathy (DR) is the leading cause of preventable blindness worldwide. Machine learning (ML) systems can enhance DR in community-based screening. However, predictive power models for usability and performance are still being determined.

Objective: This study used data from 3 university hospitals in South Korea to conduct a simple and accurate assessment of ML-based risk prediction for the development of DR that can be universally applied to adults with type 2 diabetes mellitus (T2DM).

Methods: DR was predicted using data from 2 independent electronic medical records: a discovery cohort (one hospital, n=14,694) and a validation cohort (2 hospitals, n=1856). The primary outcome was the presence of DR at 3 years. Different ML-based models were selected through hyperparameter tuning in the discovery cohort, and the area under the receiver operating characteristic (ROC) curve was analyzed in both cohorts.

Results: Among 14,694 patients screened for inclusion, 348 (2.37%) were diagnosed with DR. For DR, the extreme gradient boosting (XGBoost) system had an accuracy of 75.13% (95% CI 74.10-76.17), a sensitivity of 71.00% (95% CI 66.83-75.17), and a specificity of 75.23% (95% CI 74.16-76.31) in the original dataset. Among the validation datasets, XGBoost had an accuracy of 65.14%, a sensitivity of 64.96%, and a specificity of 65.15%. The most common feature in the XGBoost model is dyslipidemia, followed by cancer, hypertension, chronic kidney disease, neuropathy, and cardiovascular disease.

Conclusions: This approach shows the potential to enhance patient outcomes by enabling timely interventions in patients with T2DM, improving our understanding of contributing factors, and reducing DR-related complications. The proposed prediction model is expected to be both competitive and cost-effective, particularly for primary care settings in South Korea.

Citing Articles

Circulating CCN6/WISP3 in type 2 diabetes mellitus patients and its correlation with insulin resistance and inflammation: statistical and machine learning analyses.

Afrisham R, Jadidi Y, Moradi N, Ayyoubzadeh S, Fadaei R, Kiani Ghalesardi O BMC Med Inform Decis Mak. 2025; 25(1):114.

PMID: 40050813 PMC: 11887242. DOI: 10.1186/s12911-025-02957-1.

References
1.
Ting D, Yim-Lui Cheung C, Lim G, Wei Tan G, Quang N, Gan A . Development and Validation of a Deep Learning System for Diabetic Retinopathy and Related Eye Diseases Using Retinal Images From Multiethnic Populations With Diabetes. JAMA. 2017; 318(22):2211-2223. PMC: 5820739. DOI: 10.1001/jama.2017.18152. View

2.
Saeedi P, Petersohn I, Salpea P, Malanda B, Karuranga S, Unwin N . Global and regional diabetes prevalence estimates for 2019 and projections for 2030 and 2045: Results from the International Diabetes Federation Diabetes Atlas, 9 edition. Diabetes Res Clin Pract. 2019; 157:107843. DOI: 10.1016/j.diabres.2019.107843. View

3.
Hong J, Surapaneni A, Daya N, Selvin E, Coresh J, Grams M . Retinopathy and Risk of Kidney Disease in Persons With Diabetes. Kidney Med. 2021; 3(5):808-815.e1. PMC: 8515075. DOI: 10.1016/j.xkme.2021.04.018. View

4.
Zhou Y, Wang C, Shi K, Yin X . Relationship between dyslipidemia and diabetic retinopathy: A systematic review and meta-analysis. Medicine (Baltimore). 2018; 97(36):e12283. PMC: 6133445. DOI: 10.1097/MD.0000000000012283. View

5.
Lima V, Cavalieri G, Lima M, Nazario N, Lima G . Risk factors for diabetic retinopathy: a case-control study. Int J Retina Vitreous. 2016; 2:21. PMC: 5088444. DOI: 10.1186/s40942-016-0047-6. View