» Articles » PMID: 38887523

Towards Automating Systematic Reviews on Immunization Using an Advanced Natural Language Processing-based Extraction System

Overview
Date 2024 Jun 18
PMID 38887523
Authors
Affiliations
Soon will be listed here.
Abstract

Evidence-informed decision making is based on the premise that the entirety of information on a topic is collected and analyzed. Systematic reviews allow for data from different studies to be rigorously assessed according to PICO principles (population, intervention, control, outcomes). However, conducting a systematic review is generally a slow process that is a significant drain on resources. The fundamental problem is that the current approach to creating a systematic review cannot scale to meet the challenges resulting from the massive body of unstructured evidence. For this reason, the Public Health Agency of Canada has been examining the automation of different stages of evidence synthesis to increase efficiencies. In this article, we present an overview of an initial version of a novel machine learning-based system that is powered by recent advances in natural language processing (NLP), such as BioBERT, with further optimizations completed using a new immunization-specific document database. The resulting optimized NLP model at the core of this system is able to identify and extract PICO-related fields from publications on immunization with an average accuracy of 88% across five classes of text. Functionality is provided through a straightforward web interface.

Citing Articles

COVID-19 vaccine evidence monitoring assisted by artificial Intelligence: An emergency system implemented by the Public Health Agency of Canada to capture and describe the trajectory of evolving pandemic vaccine literature.

Lim S, Hersi M, Krishnan R, Montroy J, Rook B, Farrah K Vaccine X. 2024; 21:100575.

PMID: 39555243 PMC: 11564917. DOI: 10.1016/j.jvacx.2024.100575.

References
1.
Marshall I, Wallace B . Toward systematic review automation: a practical guide to using machine learning tools in research synthesis. Syst Rev. 2019; 8(1):163. PMC: 6621996. DOI: 10.1186/s13643-019-1074-9. View

2.
Nye B, Li J, Patel R, Yang Y, Marshall I, Nenkova A . A Corpus with Multi-Level Annotations of Patients, Interventions and Outcomes to Support Language Processing for Medical Literature. Proc Conf Assoc Comput Linguist Meet. 2018; 2018:197-207. PMC: 6174533. View

3.
Lau J . Editorial: Systematic review automation thematic series. Syst Rev. 2019; 8(1):70. PMC: 6410513. DOI: 10.1186/s13643-019-0974-z. View

4.
Borah R, Brown A, Capers P, Kaiser K . Analysis of the time and workers needed to conduct systematic reviews of medical interventions using data from the PROSPERO registry. BMJ Open. 2017; 7(2):e012545. PMC: 5337708. DOI: 10.1136/bmjopen-2016-012545. View

5.
Jonnalagadda S, Goyal P, Huffman M . Automating data extraction in systematic reviews: a systematic review. Syst Rev. 2015; 4:78. PMC: 4514954. DOI: 10.1186/s13643-015-0066-7. View