» Articles » PMID: 33764883

Prediction and Feature Importance Analysis for Severity of COVID-19 in South Korea Using Artificial Intelligence: Model Development and Validation

Overview
Publisher JMIR Publications
Date 2021 Mar 25
PMID 33764883
Citations 18
Authors
Affiliations
Soon will be listed here.
Abstract

Background: The number of deaths from COVID-19 continues to surge worldwide. In particular, if a patient's condition is sufficiently severe to require invasive ventilation, it is more likely to lead to death than to recovery.

Objective: The goal of our study was to analyze the factors related to COVID-19 severity in patients and to develop an artificial intelligence (AI) model to predict the severity of COVID-19 at an early stage.

Methods: We developed an AI model that predicts severity based on data from 5601 COVID-19 patients from all national and regional hospitals across South Korea as of April 2020. The clinical severity of COVID-19 was divided into two categories: low and high severity. The condition of patients in the low-severity group corresponded to no limit of activity, oxygen support with nasal prong or facial mask, and noninvasive ventilation. The condition of patients in the high-severity group corresponded to invasive ventilation, multi-organ failure with extracorporeal membrane oxygenation required, and death. For the AI model input, we used 37 variables from the medical records, including basic patient information, a physical index, initial examination findings, clinical findings, comorbid diseases, and general blood test results at an early stage. Feature importance analysis was performed with AdaBoost, random forest, and eXtreme Gradient Boosting (XGBoost); the AI model for predicting COVID-19 severity among patients was developed with a 5-layer deep neural network (DNN) with the 20 most important features, which were selected based on ranked feature importance analysis of 37 features from the comprehensive data set. The selection procedure was performed using sensitivity, specificity, accuracy, balanced accuracy, and area under the curve (AUC).

Results: We found that age was the most important factor for predicting disease severity, followed by lymphocyte level, platelet count, and shortness of breath or dyspnea. Our proposed 5-layer DNN with the 20 most important features provided high sensitivity (90.2%), specificity (90.4%), accuracy (90.4%), balanced accuracy (90.3%), and AUC (0.96).

Conclusions: Our proposed AI model with the selected features was able to predict the severity of COVID-19 accurately. We also made a web application so that anyone can access the model. We believe that sharing the AI model with the public will be helpful in validating and improving its performance.

Citing Articles

A Taxonomy and Archetypes of AI-Based Health Care Services: Qualitative Study.

Blass M, Gimpel H, Karnebogen P J Med Internet Res. 2024; 26:e53986.

PMID: 39602787 PMC: 11635336. DOI: 10.2196/53986.


Correlation between oxygenation function and laboratory indicators in COVID-19 patients based on non-enhanced chest CT images and construction of an artificial intelligence prediction model.

Kong W, Liu Y, Li W, Yang K, Yu L, Jiao G Front Microbiol. 2024; 15:1495432.

PMID: 39569002 PMC: 11576442. DOI: 10.3389/fmicb.2024.1495432.


Development and validation of predictive models for mortality of cases with COVID-19 (Omicron BA.5.2.48 and B.7.14): a retrospective study.

Li P, Yang H, Wu J, Ma Y, Hou A, Chen J BMJ Open. 2024; 14(10):e082616.

PMID: 39384246 PMC: 11474906. DOI: 10.1136/bmjopen-2023-082616.


Application of the Unbalanced Ensemble Algorithm for Prognostic Prediction Outcomes of All-Cause Mortality in Coronary Heart Disease Patients Comorbid with Hypertension.

Zan J, Dong X, Yang H, Yan J, He Z, Tian J Risk Manag Healthc Policy. 2024; 17:1921-1936.

PMID: 39135612 PMC: 11317517. DOI: 10.2147/RMHP.S472398.


Limitations of the Cough Sound-Based COVID-19 Diagnosis Artificial Intelligence Model and its Future Direction: Longitudinal Observation Study.

Kim J, Choi Y, Lee Y, Yeo S, Kim K, Kim M J Med Internet Res. 2024; 26:e51640.

PMID: 38319694 PMC: 10879967. DOI: 10.2196/51640.


References
1.
Wu Z, McGoogan J . Characteristics of and Important Lessons From the Coronavirus Disease 2019 (COVID-19) Outbreak in China: Summary of a Report of 72 314 Cases From the Chinese Center for Disease Control and Prevention. JAMA. 2020; 323(13):1239-1242. DOI: 10.1001/jama.2020.2648. View

2.
Brinati D, Campagner A, Ferrari D, Locatelli M, Banfi G, Cabitza F . Detection of COVID-19 Infection from Routine Blood Exams with Machine Learning: A Feasibility Study. J Med Syst. 2020; 44(8):135. PMC: 7326624. DOI: 10.1007/s10916-020-01597-4. View

3.
Kaminski B, Jakubczyk M, Szufel P . A framework for sensitivity analysis of decision trees. Cent Eur J Oper Res. 2018; 26(1):135-159. PMC: 5767274. DOI: 10.1007/s10100-017-0479-6. View

4.
Zhu J, Shen B, Abbasi A, Hoshmand-Kochi M, Li H, Duong T . Deep transfer learning artificial intelligence accurately stages COVID-19 lung disease severity on portable chest radiographs. PLoS One. 2020; 15(7):e0236621. PMC: 7386587. DOI: 10.1371/journal.pone.0236621. View

5.
Li X, Xu S, Yu M, Wang K, Tao Y, Zhou Y . Risk factors for severity and mortality in adult COVID-19 inpatients in Wuhan. J Allergy Clin Immunol. 2020; 146(1):110-118. PMC: 7152876. DOI: 10.1016/j.jaci.2020.04.006. View