» Articles » PMID: 36602674

A Hybrid Super Ensemble Learning Model for the Early-stage Prediction of Diabetes Risk

Overview
Publisher Springer
Date 2023 Jan 5
PMID 36602674
Authors
Affiliations
Soon will be listed here.
Abstract

Diabetes mellitus has become a rapidly growing chronic health problem worldwide. There has been a noticeable increase in diabetes cases in the last two decades. Recent advances in ensemble machine learning methods play an important role in the early detection of diabetes mellitus. These methods are both faster and less costly than traditional methods. This study aims to propose a new super ensemble learning model to enable an early diagnosis of diabetes mellitus. Super learner is a cross-validation-based approach that makes better predictions by combining prediction results of more than one machine learning algorithm. The proposed super learner model was created with four base-learners (logistic regression, decision tree, random forest, gradient boosting) and a meta learner (support vector machines) as a result of a case study. Three different dataset were used to measure the robustness of the proposed model. Chi-square was determined as an optimal feature selection technique from five different techniques, and also hyper-parameter settings were made with GridSearch. Finally, the proposed new super learner model achieved to obtain the best accuracy results in the detection of Diabetes mellitus compared to the base-learners for the early-stage diabetes risk prediction (99.6%), PIMA (92%), and diabetes 130-US hospitals (98%) dataset, respectively. This study revealed that super learner algorithms can be effectively used in the detection of diabetes mellitus. Also, obtaining of the high and convincing statistical scores shows the robustness of the proposed super learner model.

Citing Articles

New AI explained and validated deep learning approaches to accurately predict diabetes.

Shaheen I, Javaid N, Alrajeh N, Asim Y, Akber S Med Biol Eng Comput. 2025; .

PMID: 40035798 DOI: 10.1007/s11517-025-03338-6.


Type 2 diabetes prediction method based on dual-teacher knowledge distillation and feature enhancement.

Zhao J, Gao H, Sun L, Shi L, Kuang Z, Wang H Sci Rep. 2025; 15(1):133.

PMID: 39747427 PMC: 11696117. DOI: 10.1038/s41598-024-83902-6.


Integrated bagging-RF learning model for diabetes diagnosis in middle-aged and elderly population.

Shi Y, Sun J PeerJ Comput Sci. 2024; 10:e2436.

PMID: 39650520 PMC: 11623014. DOI: 10.7717/peerj-cs.2436.


Analyzing classification and feature selection strategies for diabetes prediction across diverse diabetes datasets.

Kaliappan J, Saravana Kumar I, Sundaravelan S, Anesh T, Rithik R, Singh Y Front Artif Intell. 2024; 7:1421751.

PMID: 39233892 PMC: 11371799. DOI: 10.3389/frai.2024.1421751.


The role of artificial intelligence in disease prediction: using ensemble model to predict disease mellitus.

Du Q, Wang D, Zhang Y Front Med (Lausanne). 2024; 11:1425305.

PMID: 39170045 PMC: 11335546. DOI: 10.3389/fmed.2024.1425305.