» Articles » PMID: 36609221

Machine Learning to Analyse Omic-data for COVID-19 Diagnosis and Prognosis

Overview
Publisher Biomed Central
Specialty Biology
Date 2023 Jan 7
PMID 36609221
Authors
Affiliations
Soon will be listed here.
Abstract

Background: With the global spread of COVID-19, the world has seen many patients, including many severe cases. The rapid development of machine learning (ML) has made significant disease diagnosis and prediction achievements. Current studies have confirmed that omics data at the host level can reflect the development process and prognosis of the disease. Since early diagnosis and effective treatment of severe COVID-19 patients remains challenging, this research aims to use omics data in different ML models for COVID-19 diagnosis and prognosis. We used several ML models on omics data of a large number of individuals to first predict whether patients are COVID-19 positive or negative, followed by the severity of the disease.

Results: On the COVID-19 diagnosis task, we got the best AUC of 0.99 with our multilayer perceptron model and the highest F1-score of 0.95 with our logistic regression (LR) model. For the severity prediction task, we achieved the highest accuracy of 0.76 with an LR model. Beyond classification and predictive modeling, our study founds ML models performed better on integrated multi-omics data, rather than single omics. By comparing top features from different omics dataset, we also found the robustness of our model, with a wider range of applicability in diverse dataset related to COVID-19. Additionally, we have found that omics-based models performed better than image or physiological feature-based models, proving the importance of the omics-based dataset for future model development.

Conclusions: This study diagnoses COVID-19 positive cases and predicts accurate severity levels. It lowers the dependence on clinical data and professional judgment, by leveraging the utilization of state-of-the-art models. our model showed wider applicability across different omics dataset, which is highly transferable in other respiratory or similar diseases. Hospital and public health care mechanisms can optimize the distribution of medical resources and improve the robustness of the medical system.

Citing Articles

Bioinformatics and molecular biology tools for diagnosis, prevention, treatment and prognosis of COVID-19.

Meira D, Zetum A, Casotti M, Campos da Silva D, de Araujo B, Vicente C Heliyon. 2025; 10(14):e34393.

PMID: 39816364 PMC: 11734128. DOI: 10.1016/j.heliyon.2024.e34393.


Efficient clinical decision-making process via AI-based multimodal data fusion: A COVID-19 case study.

Moris D, de Moura J, Marcos P, Miguez Rey E, Novo J, Ortega M Heliyon. 2024; 10(20):e38642.

PMID: 39640748 PMC: 11619951. DOI: 10.1016/j.heliyon.2024.e38642.


The two-stage molecular scenery of SARS-CoV-2 infection with implications to disease severity: An in-silico quest.

Potamias G, Gkoublia P, Kanterakis A Front Immunol. 2023; 14:1251067.

PMID: 38077337 PMC: 10699200. DOI: 10.3389/fimmu.2023.1251067.


AI and data science for smart emergency, crisis and disaster resilience.

Cao L Int J Data Sci Anal. 2023; 15(3):231-246.

PMID: 37035277 PMC: 10041487. DOI: 10.1007/s41060-023-00393-w.


SARS-CoV-2 Diagnosis Using Transcriptome Data: A Machine Learning Approach.

Jeyananthan P SN Comput Sci. 2023; 4(3):218.

PMID: 36844504 PMC: 9936926. DOI: 10.1007/s42979-023-01703-6.

References
1.
Brinati D, Campagner A, Ferrari D, Locatelli M, Banfi G, Cabitza F . Detection of COVID-19 Infection from Routine Blood Exams with Machine Learning: A Feasibility Study. J Med Syst. 2020; 44(8):135. PMC: 7326624. DOI: 10.1007/s10916-020-01597-4. View

2.
Aktar S, Ahamad M, Rashed-Al-Mahfuz M, Azad A, Uddin S, Kamal A . Machine Learning Approach to Predicting COVID-19 Disease Severity Based on Clinical Blood Test Data: Statistical Analysis and Model Development. JMIR Med Inform. 2021; 9(4):e25884. PMC: 8045777. DOI: 10.2196/25884. View

3.
Shanbehzadeh M, Nopour R, Kazemi-Arpanahi H . Developing an artificial neural network for detecting COVID-19 disease. J Educ Health Promot. 2022; 11:2. PMC: 8893090. DOI: 10.4103/jehp.jehp_387_21. View

4.
Harrell Jr F, Lee K, Mark D . Multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors. Stat Med. 1996; 15(4):361-87. DOI: 10.1002/(SICI)1097-0258(19960229)15:4<361::AID-SIM168>3.0.CO;2-4. View

5.
Baillie J . Translational genomics. Targeting the host immune response to fight infection. Science. 2014; 344(6186):807-8. DOI: 10.1126/science.1255074. View