Performance Analysis of Data Mining Algorithms for Diagnosing COVID-19
Overview
Affiliations
Background: An outbreak of atypical pneumonia termed COVID-19 has widely spread all over the world since the beginning of 2020. In this regard, designing a prediction system for the early detection of COVID-19 is a critical issue in mitigating virus spread. In this study, we have applied selected machine learning techniques to select the best predictive models based on their performance.
Materials And Methods: The data of 435 suspicious cases with COVID-19 which were recorded from the Imam Khomeini Hospital database between May 9, 2020 and December 20, 2020, have been taken into consideration. The Chi-square method was used to determine the most important features in diagnosing the COVID-19; eight selected data mining algorithms including multilayer perceptron (MLP), J-48, Bayesian Net (Bayes Net), logistic regression, K-star, random forest, Ada-boost, and sequential minimal optimization (SMO) were applied in data mining. Finally, the most appropriate diagnostic model for COVID-19 was obtained based on comparing the performance of the selected algorithms.
Results: As the result of using the Chi-square method, 21 variables were identified as the most important diagnostic criteria in COVID-19. The results of evaluating the eight selected data mining algorithms showed that the J-48 with true-positive rate = 0.85, false-positive rate = 0.173, precision = 0.85, recall = 0.85, F-score = 0.85, Matthews Correlation Coefficient = 0.68, and area under the receiver operator characteristics = 0.68, respectively, had the higher performance than the other algorithms.
Conclusion: The results of evaluating the performance criteria showed that the J-48 can be considered as a suitable computational prediction model for diagnosing COVID-19 disease.
Towards Improved XAI-Based Epidemiological Research into the Next Potential Pandemic.
Khalili H, Wimmer M Life (Basel). 2024; 14(7).
PMID: 39063538 PMC: 11278356. DOI: 10.3390/life14070783.
COVID-19 infection inference with graph neural networks.
Song K, Park H, Lee J, Kim A, Jung J Sci Rep. 2023; 13(1):11469.
PMID: 37454206 PMC: 10349841. DOI: 10.1038/s41598-023-38314-3.
Prediction of line heating deformation on sheet metal based on an ISSA-ELM model.
Li L, Qi S, Zhou H, Wang L Sci Rep. 2023; 13(1):1252.
PMID: 36690795 PMC: 9869312. DOI: 10.1038/s41598-023-28538-8.
Supervised Machine Learning Approach to COVID-19 Detection Based on Clinical Data.
Yazdani A, Zahmatkeshan M, Ravangard R, Sharifian R, Shirdeli M Med J Islam Repub Iran. 2022; 36:110.
PMID: 36447543 PMC: 9700415. DOI: 10.47176/mjiri.36.110.