» Articles » PMID: 39650520

Integrated Bagging-RF Learning Model for Diabetes Diagnosis in Middle-aged and Elderly Population

Overview
Date 2024 Dec 9
PMID 39650520
Authors
Affiliations
Soon will be listed here.
Abstract

As the population ages, the increase in the number of middle-aged and older adults with diabetes poses new challenges to the allocation of resources in the healthcare system. Developing accurate diabetes prediction models is a critical public health strategy to improve the efficient use of healthcare resources and ensure timely and effective treatment. In order to improve the identification of diabetes in middle-aged and older patients, a Bagging-RF model is proposed. In the study, two diabetes datasets on Kaggle were first preprocessed, including unique heat coding, outlier removal, and age screening, after which the data were categorized into three age groups, 50-60, 60-70, and 70-80, and balanced using the SMOTE technique. Then, the machine learning classifiers were trained using the Bagging-RF integrated model with eight other machine learning classifiers. Finally, the model's performance was evaluated by accuracy, 1 score, and other metrics. The results showed that the Bagging-RF model outperformed the other eight machine learning classifiers, exhibiting 97.35%, 95.55%, 95.14% accuracy and 97.35%, 97.35%, 95.14% 1 Score at the Diabetes Prediction Dataset for diabetes prediction for the three age groups of 50-60, 60-70, and 70-80; and 97.03%, 94.90%, 93.70% accuracy and 97.03%, 94.90%, 93.70% 1 Score at the Diabetes Prediction Dataset. 95.55%, 95.13% 1 Score; and 97.03%, 94.90%, 93.70% accuracy; and 97.03%, 94.89%, 93.70% 1 Score at Diabetes Prediction Dataset. In addition, while other integrated learning models, such as ET, RF, Adaboost, and XGB, fail to outperform Bagging-RF, they also show excellent performance.

References
1.
Shaukat Z, Zafar W, Ahmad W, Ul Haq I, Husnain G, Al-Adhaileh M . Revolutionizing Diabetes Diagnosis: Machine Learning Techniques Unleashed. Healthcare (Basel). 2023; 11(21). PMC: 10648466. DOI: 10.3390/healthcare11212864. View

2.
Jiang L, Xia Z, Zhu R, Gong H, Wang J, Li J . Diabetes risk prediction model based on community follow-up data using machine learning. Prev Med Rep. 2023; 35:102358. PMC: 10465943. DOI: 10.1016/j.pmedr.2023.102358. View

3.
Xu Q, Peng Y, Tan J, Zhao W, Yang M, Tian J . Prediction of Atrial Fibrillation in Hospitalized Elderly Patients With Coronary Heart Disease and Type 2 Diabetes Mellitus Using Machine Learning: A Multicenter Retrospective Study. Front Public Health. 2022; 10:842104. PMC: 8931193. DOI: 10.3389/fpubh.2022.842104. View

4.
Biesheuvel C, Vergouwe Y, Steyerberg E, Grobbee D, Moons K . Polytomous logistic regression analysis could be applied more often in diagnostic research. J Clin Epidemiol. 2008; 61(2):125-34. DOI: 10.1016/j.jclinepi.2007.03.002. View

5.
Olisah C, Smith L, Smith M . Diabetes mellitus prediction and diagnosis from a data preprocessing and machine learning perspective. Comput Methods Programs Biomed. 2022; 220:106773. DOI: 10.1016/j.cmpb.2022.106773. View