Text Mining to Support Abstract Screening for Knowledge Syntheses: a Semi-automated Workflow

Overview

Journal Syst Rev

Publisher Biomed Central

Date 2021 May 27

PMID 34039433

Citations 14

Authors

Ba Pham

Jelena Jovanovic

Ebrahim Bagheri

Jesmin Antony

Huda Ashoor

Tam T Nguyen

Patricia Rios

Reid Robson

Sonia M Thomas

Jennifer Watt

Sharon E Straus

Andrea C Tricco

Affiliations

Soon will be listed here.

Abstract

Background: Current text mining tools supporting abstract screening in systematic reviews are not widely used, in part because they lack sensitivity and precision. We set out to develop an accessible, semi-automated "workflow" to conduct abstract screening for systematic reviews and other knowledge synthesis methods.

Methods: We adopt widely recommended text-mining and machine-learning methods to (1) process title-abstracts into numerical training data; and (2) train a classification model to predict eligible abstracts. The predicted abstracts are screened by human reviewers for ("true") eligibility, and the newly eligible abstracts are used to identify similar abstracts, using near-neighbor methods, which are also screened. These abstracts, as well as their eligibility results, are used to update the classification model, and the above steps are iterated until no new eligible abstracts are identified. The workflow was implemented in R and evaluated using a systematic review of insulin formulations for type-1 diabetes (14,314 abstracts) and a scoping review of knowledge-synthesis methods (17,200 abstracts). Workflow performance was evaluated against the recommended practice of screening abstracts by 2 reviewers, independently. Standard measures were examined: sensitivity (inclusion of all truly eligible abstracts), specificity (exclusion of all truly ineligible abstracts), precision (inclusion of all truly eligible abstracts among all abstracts screened as eligible), F1-score (harmonic average of sensitivity and precision), and accuracy (correctly predicted eligible or ineligible abstracts). Workload reduction was measured as the hours the workflow saved, given only a subset of abstracts needed human screening.

Results: With respect to the systematic and scoping reviews respectively, the workflow attained 88%/89% sensitivity, 99%/99% specificity, 71%/72% precision, an F1-score of 79%/79%, 98%/97% accuracy, 63%/55% workload reduction, with 12%/11% fewer abstracts for full-text retrieval and screening, and 0%/1.5% missed studies in the completed reviews.

Conclusion: The workflow was a sensitive, precise, and efficient alternative to the recommended practice of screening abstracts with 2 reviewers. All eligible studies were identified in the first case, while 6 studies (1.5%) were missed in the second that would likely not impact the review's conclusions. We have described the workflow in language accessible to reviewers with limited exposure to natural language processing and machine learning, and have made the code available to reviewers.

Citing Articles

Trial characteristics, geographic distribution, and selected methodological issues of 1425 infertility trials published from 2012 to 2023: a systematic review.

Feng Q, Li W, Crispin J, Longobardi S, DHooghe T, Mol B Hum Reprod Open. 2025; 2025(1):hoaf004.

PMID: 39980657 PMC: 11842059. DOI: 10.1093/hropen/hoaf004.

Human-Comparable Sensitivity of Large Language Models in Identifying Eligible Studies Through Title and Abstract Screening: 3-Layer Strategy Using GPT-3.5 and GPT-4 for Systematic Reviews.

Matsui K, Utsumi T, Aoki Y, Maruki T, Takeshima M, Takaesu Y J Med Internet Res. 2024; 26:e52758.

PMID: 39151163 PMC: 11364944. DOI: 10.2196/52758.

Machine learning enables automated screening for systematic reviews and meta-analysis in urology.

Menold H, Wieland V, Haney C, Uysal D, Wessels F, Cacciamani G World J Urol. 2024; 42(1):396.

PMID: 38985296 PMC: 11236840. DOI: 10.1007/s00345-024-05078-y.

Automation of systematic reviews of biomedical literature: a scoping review of studies indexed in PubMed.

Toth B, Berek L, Gulacsi L, Pentek M, Zrubka Z Syst Rev. 2024; 13(1):174.

PMID: 38978132 PMC: 11229257. DOI: 10.1186/s13643-024-02592-3.

Title and abstract screening for literature reviews using large language models: an exploratory study in the biomedical domain.

Dennstadt F, Zink J, Putora P, Hastings J, Cihoric N Syst Rev. 2024; 13(1):158.

PMID: 38879534 PMC: 11180407. DOI: 10.1186/s13643-024-02575-4.

References

Cooper M, Ungar W, Zlotkin S . An assessment of inter-rater agreement of the literature filtering process in the development of evidence-based dietary guidelines. Public Health Nutr. 2006; 9(4):494-500. DOI: 10.1079/phn2005877. View

Shemilt I, Khan N, Park S, Thomas J . Use of cost-effectiveness analysis to compare the efficiency of study identification methods in systematic reviews. Syst Rev. 2016; 5(1):140. PMC: 4989498. DOI: 10.1186/s13643-016-0315-4. View

OMara-Eves A, Thomas J, McNaught J, Miwa M, Ananiadou S . Using text mining for study identification in systematic reviews: a systematic review of current approaches. Syst Rev. 2015; 4:5. PMC: 4320539. DOI: 10.1186/2046-4053-4-5. View

LeCun Y, Bengio Y, Hinton G . Deep learning. Nature. 2015; 521(7553):436-44. DOI: 10.1038/nature14539. View

Matwin S, Kouznetsov A, Inkpen D, Frunza O, OBlenis P . A new algorithm for reducing the workload of experts in performing systematic reviews. J Am Med Inform Assoc. 2010; 17(4):446-53. PMC: 2995653. DOI: 10.1136/jamia.2010.004325. View

Allen I, Olkin I . Estimating time to conduct a meta-analysis from number of citations retrieved. JAMA. 1999; 282(7):634-5. DOI: 10.1001/jama.282.7.634. View

Borah R, Brown A, Capers P, Kaiser K . Analysis of the time and workers needed to conduct systematic reviews of medical interventions using data from the PROSPERO registry. BMJ Open. 2017; 7(2):e012545. PMC: 5337708. DOI: 10.1136/bmjopen-2016-012545. View

Robson R, Pham B, Hwee J, Thomas S, Rios P, Page M . Few studies exist examining methods for selecting studies, abstracting data, and appraising quality in a systematic review. J Clin Epidemiol. 2018; 106:121-135. DOI: 10.1016/j.jclinepi.2018.10.003. View

Gates A, Johnson C, Hartling L . Technology-assisted title and abstract screening for systematic reviews: a retrospective evaluation of the Abstrackr machine learning tool. Syst Rev. 2018; 7(1):45. PMC: 5848519. DOI: 10.1186/s13643-018-0707-8. View

10.

Ng L, Pitt V, Huckvale K, Clavisi O, Turner T, Gruen R . Title and Abstract Screening and Evaluation in Systematic Reviews (TASER): a pilot randomised controlled trial of title and abstract screening by medical students. Syst Rev. 2014; 3:121. PMC: 4217707. DOI: 10.1186/2046-4053-3-121. View

11.

Tricco A, Soobiah C, Antony J, Cogo E, MacDonald H, Lillie E . A scoping review identifies multiple emerging knowledge synthesis methods, but few studies operationalize the method. J Clin Epidemiol. 2016; 73:19-28. DOI: 10.1016/j.jclinepi.2015.08.030. View

12.

Cohen A, Hersh W . A survey of current work in biomedical text mining. Brief Bioinform. 2005; 6(1):57-71. DOI: 10.1093/bib/6.1.57. View

13.

Page M, Shamseer L, Altman D, Tetzlaff J, Sampson M, Tricco A . Epidemiology and Reporting Characteristics of Systematic Reviews of Biomedical Research: A Cross-Sectional Study. PLoS Med. 2016; 13(5):e1002028. PMC: 4878797. DOI: 10.1371/journal.pmed.1002028. View

14.

Tricco A, Ashoor H, Antony J, Bouck Z, Rodrigues M, Pham B . Comparative Efficacy and Safety of Ultra-Long-Acting, Long-Acting, Intermediate-Acting, and Biosimilar Insulins for Type 1 Diabetes Mellitus: a Systematic Review and Network Meta-Analysis. J Gen Intern Med. 2021; 36(8):2414-2426. PMC: 8342652. DOI: 10.1007/s11606-021-06642-7. View

15.

Bannach-Brown A, Przybyla P, Thomas J, Rice A, Ananiadou S, Liao J . Machine learning algorithms for systematic review: reducing workload in a preclinical review of animal studies and reducing human screening error. Syst Rev. 2019; 8(1):23. PMC: 6334440. DOI: 10.1186/s13643-019-0942-7. View

16.

Tricco A, Antony J, Soobiah C, Kastner M, Cogo E, MacDonald H . Knowledge synthesis methods for generating or refining theory: a scoping review reveals that little guidance is available. J Clin Epidemiol. 2016; 73:36-42. DOI: 10.1016/j.jclinepi.2015.11.021. View

17.

Waddington H, Stevenson J, Sonnenfeld A, Gaarder M . PROTOCOL: Participation, inclusion, transparency and accountability (PITA) to improve public services in low- and middle-income countries: a systematic review. Campbell Syst Rev. 2023; 14(1):1-69. PMC: 8428066. DOI: 10.1002/CL2.205. View

18.

OConnor A, Tsafnat G, Gilbert S, Thayer K, Wolfe M . Moving toward the automation of the systematic review process: a summary of discussions at the second meeting of International Collaboration for the Automation of Systematic Reviews (ICASR). Syst Rev. 2018; 7(1):3. PMC: 5759184. DOI: 10.1186/s13643-017-0667-4. View

19.

Boyack K, Newman D, Duhon R, Klavans R, Patek M, Biberstine J . Clustering more than two million biomedical publications: comparing the accuracies of nine text-based similarity approaches. PLoS One. 2011; 6(3):e18029. PMC: 3060097. DOI: 10.1371/journal.pone.0018029. View

20.

Edwards P, Clarke M, DiGuiseppi C, Pratap S, Roberts I, Wentz R . Identification of randomized controlled trials in systematic reviews: accuracy and reliability of screening records. Stat Med. 2002; 21(11):1635-40. DOI: 10.1002/sim.1190. View