» Articles » PMID: 36306578

Explainable Diabetes Classification Using Hybrid Bayesian-optimized TabNet Architecture

Overview
Journal Comput Biol Med
Publisher Elsevier
Date 2022 Oct 28
PMID 36306578
Authors
Affiliations
Soon will be listed here.
Abstract

Diabetes is a deadly chronic disease that occurs when the pancreas is not able to produce ample insulin or when the body cannot use insulin effectively. If undetected, it may lead to a host of health complications. Hence, accurate and explainable early-stage detection of diabetes is essential for the proper administration of treatment options in leading a healthy and productive life. For this, we developed an interpretable TabNet model tuned via Bayesian optimization (BO). To achieve model-specific interpretability, the attention mechanism of TabNet architecture was used, which offered the local and global model explanations on the influence of the attributes on the outcomes. The model was further explained locally and globally using more robust model-agnostic LIME and SHAP eXplainable Artificial Intelligence (XAI) tools. The proposed model outperformed all benchmarked models by obtaining high accuracy of 92.2% and 99.4% using the Pima Indians diabetes dataset (PIDD) and the early-stage diabetes risk prediction dataset (ESDRPD), respectively. Based on the XAI results, it was clear that the most influential attribute for diabetes classification using PIDD and ESDRPD were Insulin and Polyuria, respectively. The feature importance values registered for insulin was 0.301 (PIDD) and for polyuria 0.206 was registered (ESDRPD). The high accuracy and ancillary interpretability of our objective model is expected to increase end-users trust and confidence in early-stage detection of diabetes.

Citing Articles

Prediction of cancer cell line-specific synergistic drug combinations based on multi-omics data.

Chen J, Han H, Li L, Chen Z, Liu X, Li T PeerJ. 2025; 13:e19078.

PMID: 40028209 PMC: 11869890. DOI: 10.7717/peerj.19078.


Example dependent cost sensitive learning based selective deep ensemble model for customer credit scoring.

Xiao J, Li S, Tian Y, Huang J, Jiang X, Wang S Sci Rep. 2025; 15(1):6000.

PMID: 39966605 PMC: 11836470. DOI: 10.1038/s41598-025-89880-7.


An Ensemble Learning Approach Based on TabNet and Machine Learning Models for Cheating Detection in Educational Tests.

Zhen Y, Zhu X Educ Psychol Meas. 2024; 84(4):780-809.

PMID: 39055097 PMC: 11268385. DOI: 10.1177/00131644231191298.


Machine learning analysis of thermophysical and thermohydraulic properties in ethylene glycol- and glycerol-based SiO nanofluids.

Akilu S, Sharma K, Baheta A, Kanti P, Paramasivam P Sci Rep. 2024; 14(1):14829.

PMID: 38937518 PMC: 11211413. DOI: 10.1038/s41598-024-65411-8.


Personalized venlafaxine dose prediction using artificial intelligence technology: a retrospective analysis based on real-world data.

Liu Y, Yu Z, Ye X, Zhang J, Hao X, Gao F Int J Clin Pharm. 2024; 46(4):926-936.

PMID: 38733475 DOI: 10.1007/s11096-024-01729-7.