» Articles » PMID: 38242107

Novel Graph-based Machine-learning Technique for Viral Infectious Diseases: Application to Influenza and Hepatitis Diseases

Overview
Journal Ann Med
Publisher Informa Healthcare
Specialty General Medicine
Date 2024 Jan 19
PMID 38242107
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Most infectious diseases are caused by viruses, fungi, bacteria and parasites. Their ability to easily infect humans and trigger large-scale epidemics makes them a public health concern. Methods for early detection of these diseases have been developed; however, they are hindered by the absence of a unified, interoperable and reusable model. This study seeks to create a holistic and real-time model for swift, preliminary detection of infectious diseases using symptoms and additional clinical data.

Materials And Methods: In this study, we present a medical knowledge graph (MKG) that leverages multiple data sources to analyse connections between different nodes. Medical ontologies were used to enhance the MKG. We applied various graph algorithms to extract key features. The performance of multiple machine-learning (ML) techniques for influenza and hepatitis detection was assessed, selecting multi-layer perceptron (MLP) and random forest (RF) models due to their superior outcomes. The hyperparameters of both graph-based ML models were automatically fine-tuned.

Results: Both the graph-based MLP and RF models showcased the least loss and error rates, along with the most specific, accurate recall, precision and 1 scores. Their Matthews correlation coefficients were also optimal. When compared with existing ML techniques and findings from the literature, these graph-based ML models manifested superior detection accuracy.

Conclusions: The graph-based MLP and RF models effectively diagnosed influenza and hepatitis, respectively. This underlines the potential of graph data science in enhancing ML model performance and uncovering concealed relationships in the MKG.

Citing Articles

Distinguishable topology of the task-evoked functional genome networks in HIV-1 reservoirs.

Wisniewski J, Wiecek K, Ali H, Pyrc K, Kula-Pacurar A, Wagner M iScience. 2024; 27(11):111222.

PMID: 39559761 PMC: 11570469. DOI: 10.1016/j.isci.2024.111222.


Generative AI-based knowledge graphs for the illustration and development of mHealth self-management content.

Blanchard M, Venerito V, Ming Azevedo P, Hugle T Front Digit Health. 2024; 6:1466211.

PMID: 39434919 PMC: 11491428. DOI: 10.3389/fdgth.2024.1466211.

References
1.
Mosharaf M, Reza M, Kibria M, Ahmed F, Kabir M, Hasan S . Computational identification of host genomic biomarkers highlighting their functions, pathways and regulators that influence SARS-CoV-2 infections and drug repurposing. Sci Rep. 2022; 12(1):4279. PMC: 8915158. DOI: 10.1038/s41598-022-08073-8. View

2.
Chen Q, Allot A, Lu Z . LitCovid: an open database of COVID-19 literature. Nucleic Acids Res. 2020; 49(D1):D1534-D1540. PMC: 7778958. DOI: 10.1093/nar/gkaa952. View

3.
Dai S, Han L . Influenza surveillance with Baidu index and attention-based long short-term memory model. PLoS One. 2023; 18(1):e0280834. PMC: 9870163. DOI: 10.1371/journal.pone.0280834. View

4.
Weng C, Chen L, Lin C, Chen H, Lee H, Ling T . Association between the risk of lung cancer and influenza: A population-based nested case-control study. Int J Infect Dis. 2019; 88:8-13. DOI: 10.1016/j.ijid.2019.07.030. View

5.
Apweiler R, Bairoch A, Wu C, Barker W, Boeckmann B, Ferro S . UniProt: the Universal Protein knowledgebase. Nucleic Acids Res. 2003; 32(Database issue):D115-9. PMC: 308865. DOI: 10.1093/nar/gkh131. View