A Hybrid Super Ensemble Learning Model for the Early-stage Prediction of Diabetes Risk
Overview
Medical Informatics
Authors
Affiliations
Diabetes mellitus has become a rapidly growing chronic health problem worldwide. There has been a noticeable increase in diabetes cases in the last two decades. Recent advances in ensemble machine learning methods play an important role in the early detection of diabetes mellitus. These methods are both faster and less costly than traditional methods. This study aims to propose a new super ensemble learning model to enable an early diagnosis of diabetes mellitus. Super learner is a cross-validation-based approach that makes better predictions by combining prediction results of more than one machine learning algorithm. The proposed super learner model was created with four base-learners (logistic regression, decision tree, random forest, gradient boosting) and a meta learner (support vector machines) as a result of a case study. Three different dataset were used to measure the robustness of the proposed model. Chi-square was determined as an optimal feature selection technique from five different techniques, and also hyper-parameter settings were made with GridSearch. Finally, the proposed new super learner model achieved to obtain the best accuracy results in the detection of Diabetes mellitus compared to the base-learners for the early-stage diabetes risk prediction (99.6%), PIMA (92%), and diabetes 130-US hospitals (98%) dataset, respectively. This study revealed that super learner algorithms can be effectively used in the detection of diabetes mellitus. Also, obtaining of the high and convincing statistical scores shows the robustness of the proposed super learner model.
New AI explained and validated deep learning approaches to accurately predict diabetes.
Shaheen I, Javaid N, Alrajeh N, Asim Y, Akber S Med Biol Eng Comput. 2025; .
PMID: 40035798 DOI: 10.1007/s11517-025-03338-6.
Zhao J, Gao H, Sun L, Shi L, Kuang Z, Wang H Sci Rep. 2025; 15(1):133.
PMID: 39747427 PMC: 11696117. DOI: 10.1038/s41598-024-83902-6.
Integrated bagging-RF learning model for diabetes diagnosis in middle-aged and elderly population.
Shi Y, Sun J PeerJ Comput Sci. 2024; 10:e2436.
PMID: 39650520 PMC: 11623014. DOI: 10.7717/peerj-cs.2436.
Kaliappan J, Saravana Kumar I, Sundaravelan S, Anesh T, Rithik R, Singh Y Front Artif Intell. 2024; 7:1421751.
PMID: 39233892 PMC: 11371799. DOI: 10.3389/frai.2024.1421751.
Du Q, Wang D, Zhang Y Front Med (Lausanne). 2024; 11:1425305.
PMID: 39170045 PMC: 11335546. DOI: 10.3389/fmed.2024.1425305.