A Fast, Resource Efficient, and Reliable Rule-based System for COVID-19 Symptom Identification
Overview
Authors
Affiliations
Objective: With COVID-19, there was a need for a rapidly scalable annotation system that facilitated real-time integration with clinical decision support systems (CDS). Current annotation systems suffer from a high-resource utilization and poor scalability limiting real-world integration with CDS. A potential solution to mitigate these issues is to use the rule-based gazetteer developed at our institution.
Materials And Methods: Performance, resource utilization, and runtime of the rule-based gazetteer were compared with five annotation systems: BioMedICUS, cTAKES, MetaMap, CLAMP, and MedTagger.
Results: This rule-based gazetteer was the fastest, had a low resource footprint, and similar performance for weighted microaverage and macroaverage measures of precision, recall, and f1-score compared to other annotation systems.
Discussion: Opportunities to increase its performance include fine-tuning lexical rules for symptom identification. Additionally, it could run on multiple compute nodes for faster runtime.
Conclusion: This rule-based gazetteer overcame key technical limitations facilitating real-time symptomatology identification for COVID-19 and integration of unstructured data elements into our CDS. It is ideal for large-scale deployment across a wide variety of healthcare settings for surveillance of acute COVID-19 symptoms for integration into prognostic modeling. Such a system is currently being leveraged for monitoring of postacute sequelae of COVID-19 (PASC) progression in COVID-19 survivors. This study conducted the first in-depth analysis and developed a rule-based gazetteer for COVID-19 symptom extraction with the following key features: low processor and memory utilization, faster runtime, and similar weighted microaverage and macroaverage measures for precision, recall, and f1-score compared to industry-standard annotation systems.
Ameri A, Ameri A, Salmanizadeh F, Bahaadinbeigy K Health Sci Rep. 2024; 7(2):e1919.
PMID: 38384976 PMC: 10879639. DOI: 10.1002/hsr2.1919.
Sun J, Peng L, Li T, Adila D, Zaiman Z, Melton-Meaux G Radiol Artif Intell. 2022; 4(4):e210217.
PMID: 35923381 PMC: 9344211. DOI: 10.1148/ryai.210217.
Wang L, Foer D, MacPhaul E, Lo Y, Bates D, Zhou L J Biomed Inform. 2021; 125:103951.
PMID: 34785382 PMC: 8590503. DOI: 10.1016/j.jbi.2021.103951.