» Articles » PMID: 35071611

Performance Analysis of Data Mining Algorithms for Diagnosing COVID-19

Overview
Specialty Medical Education
Date 2022 Jan 24
PMID 35071611
Authors
Affiliations
Soon will be listed here.
Abstract

Background: An outbreak of atypical pneumonia termed COVID-19 has widely spread all over the world since the beginning of 2020. In this regard, designing a prediction system for the early detection of COVID-19 is a critical issue in mitigating virus spread. In this study, we have applied selected machine learning techniques to select the best predictive models based on their performance.

Materials And Methods: The data of 435 suspicious cases with COVID-19 which were recorded from the Imam Khomeini Hospital database between May 9, 2020 and December 20, 2020, have been taken into consideration. The Chi-square method was used to determine the most important features in diagnosing the COVID-19; eight selected data mining algorithms including multilayer perceptron (MLP), J-48, Bayesian Net (Bayes Net), logistic regression, K-star, random forest, Ada-boost, and sequential minimal optimization (SMO) were applied in data mining. Finally, the most appropriate diagnostic model for COVID-19 was obtained based on comparing the performance of the selected algorithms.

Results: As the result of using the Chi-square method, 21 variables were identified as the most important diagnostic criteria in COVID-19. The results of evaluating the eight selected data mining algorithms showed that the J-48 with true-positive rate = 0.85, false-positive rate = 0.173, precision = 0.85, recall = 0.85, F-score = 0.85, Matthews Correlation Coefficient = 0.68, and area under the receiver operator characteristics = 0.68, respectively, had the higher performance than the other algorithms.

Conclusion: The results of evaluating the performance criteria showed that the J-48 can be considered as a suitable computational prediction model for diagnosing COVID-19 disease.

Citing Articles

Towards Improved XAI-Based Epidemiological Research into the Next Potential Pandemic.

Khalili H, Wimmer M Life (Basel). 2024; 14(7).

PMID: 39063538 PMC: 11278356. DOI: 10.3390/life14070783.


COVID-19 infection inference with graph neural networks.

Song K, Park H, Lee J, Kim A, Jung J Sci Rep. 2023; 13(1):11469.

PMID: 37454206 PMC: 10349841. DOI: 10.1038/s41598-023-38314-3.


Prediction of line heating deformation on sheet metal based on an ISSA-ELM model.

Li L, Qi S, Zhou H, Wang L Sci Rep. 2023; 13(1):1252.

PMID: 36690795 PMC: 9869312. DOI: 10.1038/s41598-023-28538-8.


Supervised Machine Learning Approach to COVID-19 Detection Based on Clinical Data.

Yazdani A, Zahmatkeshan M, Ravangard R, Sharifian R, Shirdeli M Med J Islam Repub Iran. 2022; 36:110.

PMID: 36447543 PMC: 9700415. DOI: 10.47176/mjiri.36.110.

References
1.
Torrealba-Rodriguez O, Conde-Gutierrez R, Hernandez-Javier A . Modeling and prediction of COVID-19 in Mexico applying mathematical and computational models. Chaos Solitons Fractals. 2020; 138:109946. PMC: 7256618. DOI: 10.1016/j.chaos.2020.109946. View

2.
Gao Y, Cai G, Fang W, Li H, Wang S, Chen L . Machine learning based early warning system enables accurate mortality risk prediction for COVID-19. Nat Commun. 2020; 11(1):5033. PMC: 7538910. DOI: 10.1038/s41467-020-18684-2. View

3.
Shereen M, Khan S, Kazmi A, Bashir N, Siddique R . COVID-19 infection: Origin, transmission, and characteristics of human coronaviruses. J Adv Res. 2020; 24:91-98. PMC: 7113610. DOI: 10.1016/j.jare.2020.03.005. View

4.
Shaban W, Rabie A, Saleh A, Abo-Elsoud M . A new COVID-19 Patients Detection Strategy (CPDS) based on hybrid feature selection and enhanced KNN classifier. Knowl Based Syst. 2020; 205:106270. PMC: 7368426. DOI: 10.1016/j.knosys.2020.106270. View

5.
Zhao Z, Chen A, Hou W, Graham J, Li H, Richman P . Prediction model and risk scores of ICU admission and mortality in COVID-19. PLoS One. 2020; 15(7):e0236618. PMC: 7392248. DOI: 10.1371/journal.pone.0236618. View