Enhancing the Interpretability of Malaria and Typhoid Diagnosis with Explainable AI and Large Language Models

Overview

Journal Trop Med Infect Dis

Specialty Infectious Diseases

Date 2024 Sep 27

PMID 39330905

Authors

Kingsley Attai

Moses Ekpenyong

Constance Amannah

Daniel Asuquo

Peterben Ajuga

Okure Obot

Ekemini Johnson

Anietie John

Omosivie Maduka

Christie Akwaowo

Faith-Michael Uzoka

Affiliations

Soon will be listed here.

Abstract

Malaria and Typhoid fever are prevalent diseases in tropical regions, and both are exacerbated by unclear protocols, drug resistance, and environmental factors. Prompt and accurate diagnosis is crucial to improve accessibility and reduce mortality rates. Traditional diagnosis methods cannot effectively capture the complexities of these diseases due to the presence of similar symptoms. Although machine learning (ML) models offer accurate predictions, they operate as "black boxes" with non-interpretable decision-making processes, making it challenging for healthcare providers to comprehend how the conclusions are reached. This study employs explainable AI (XAI) models such as Local Interpretable Model-agnostic Explanations (LIME), and Large Language Models (LLMs) like GPT to clarify diagnostic results for healthcare workers, building trust and transparency in medical diagnostics by describing which symptoms had the greatest impact on the model's decisions and providing clear, understandable explanations. The models were implemented on Google Colab and Visual Studio Code because of their rich libraries and extensions. Results showed that the Random Forest model outperformed the other tested models; in addition, important features were identified with the LIME plots while ChatGPT 3.5 had a comparative advantage over other LLMs. The study integrates RF, LIME, and GPT in building a mobile app to enhance the interpretability and transparency in malaria and typhoid diagnosis system. Despite its promising results, the system's performance is constrained by the quality of the dataset. Additionally, while LIME and GPT improve transparency, they may introduce complexities in real-time deployment due to computational demands and the need for internet service to maintain relevance and accuracy. The findings suggest that AI-driven diagnostic systems can significantly enhance healthcare delivery in environments with limited resources, and future works can explore the applicability of this framework to other medical conditions and datasets.

References

Ghosh D, Cabrera J . Enriched Random Forest for High Dimensional Genomic Data. IEEE/ACM Trans Comput Biol Bioinform. 2021; 19(5):2817-2828. PMC: 9923687. DOI: 10.1109/TCBB.2021.3089417. View

Fan Y, Lu X, Sun G . IHCP: interpretable hepatitis C prediction system based on black-box machine learning models. BMC Bioinformatics. 2023; 24(1):333. PMC: 10481489. DOI: 10.1186/s12859-023-05456-0. View

Nakamura Y, Hanaoka S, Nomura Y, Nakao T, Miki S, Watadani T . Automatic detection of actionable radiology reports using bidirectional encoder representations from transformers. BMC Med Inform Decis Mak. 2021; 21(1):262. PMC: 8436473. DOI: 10.1186/s12911-021-01623-6. View

Mather R, Hopkins H, Parry C, Dittrich S . Redefining typhoid diagnosis: what would an improved test need to look like?. BMJ Glob Health. 2019; 4(5):e001831. PMC: 6830052. DOI: 10.1136/bmjgh-2019-001831. View

Galan J . Typhoid toxin provides a window into typhoid fever and the biology of Salmonella Typhi. Proc Natl Acad Sci U S A. 2016; 113(23):6338-44. PMC: 4988619. DOI: 10.1073/pnas.1606335113. View

Sato S . Plasmodium-a brief introduction to the parasites causing human malaria and their basic biology. J Physiol Anthropol. 2021; 40(1):1. PMC: 7792015. DOI: 10.1186/s40101-020-00251-9. View

Zhu T, Liu X, Wang J, Kou R, Hu Y, Yuan M . Explainable machine-learning algorithms to differentiate bipolar disorder from major depressive disorder using self-reported symptoms, vital signs, and blood-based markers. Comput Methods Programs Biomed. 2023; 240:107723. DOI: 10.1016/j.cmpb.2023.107723. View

Kiseleva A, Kotzinos D, De Hert P . Transparency of AI in Healthcare as a Multilayered System of Accountabilities: Between Legal Requirements and Technical Limitations. Front Artif Intell. 2022; 5:879603. PMC: 9189302. DOI: 10.3389/frai.2022.879603. View

Gorenstein L, Konen E, Green M, Klang E . Bidirectional Encoder Representations from Transformers in Radiology: A Systematic Review of Natural Language Processing Applications. J Am Coll Radiol. 2024; 21(6):914-941. DOI: 10.1016/j.jacr.2024.01.012. View

10.

Bria Y, Yeh C, Bedingfield S . Significant symptoms and nonsymptom-related factors for malaria diagnosis in endemic regions of Indonesia. Int J Infect Dis. 2020; 103:194-200. DOI: 10.1016/j.ijid.2020.11.177. View

11.

Paton D, Childs L, Itoe M, Holmdahl I, Buckee C, Catteruccia F . Exposing Anopheles mosquitoes to antimalarials blocks Plasmodium parasite transmission. Nature. 2019; 567(7747):239-243. PMC: 6438179. DOI: 10.1038/s41586-019-0973-1. View

12.

Attai K, Amannejad Y, Vahdat Pour M, Obot O, Uzoka F . A Systematic Review of Applications of Machine Learning and Other Soft Computing Techniques for the Diagnosis of Tropical Diseases. Trop Med Infect Dis. 2022; 7(12). PMC: 9787706. DOI: 10.3390/tropicalmed7120398. View

13.

Bosco A, Nankabirwa J, Yeka A, Nsobya S, Gresty K, Anderson K . Limitations of rapid diagnostic tests in malaria surveys in areas with varied transmission intensity in Uganda 2017-2019: Implications for selection and use of HRP2 RDTs. PLoS One. 2020; 15(12):e0244457. PMC: 7774953. DOI: 10.1371/journal.pone.0244457. View

14.

Sohanang Nodem F, Ymele D, Fadimatou M, Fodouop S . Malaria and Typhoid Fever Coinfection among Febrile Patients in Ngaoundéré (Adamawa, Cameroon): A Cross-Sectional Study. J Parasitol Res. 2023; 2023:5334813. PMC: 10545472. DOI: 10.1155/2023/5334813. View

15.

Islam M, Alam M, Maniruzzaman M, Ahmed N, Ali M, Rahman M . Predicting the risk of hypertension using machine learning algorithms: A cross sectional study in Ethiopia. PLoS One. 2023; 18(8):e0289613. PMC: 10449142. DOI: 10.1371/journal.pone.0289613. View

16.

Li F, Jin Y, Liu W, Rawat B, Cai P, Yu H . Fine-Tuning Bidirectional Encoder Representations From Transformers (BERT)-Based Models on Large-Scale Electronic Health Record Notes: An Empirical Study. JMIR Med Inform. 2019; 7(3):e14830. PMC: 6746103. DOI: 10.2196/14830. View

17.

Silva-Aravena F, Nunez Delafuente H, Gutierrez-Bahamondes J, Morales J . A Hybrid Algorithm of ML and XAI to Prevent Breast Cancer: A Strategy to Support Decision Making. Cancers (Basel). 2023; 15(9). PMC: 10177162. DOI: 10.3390/cancers15092443. View

18.

Asuquo D, Attai K, Johnson E, Obot O, Adeoye O, Akwaowo C . Multi-criteria decision analysis method for differential diagnosis of tropical febrile diseases. Health Informatics J. 2024; 30(2):14604582241260659. DOI: 10.1177/14604582241260659. View