» Articles » PMID: 25656516

Automated Confidence Ranked Classification of Randomized Controlled Trial Articles: an Aid to Evidence-based Medicine

Overview
Date 2015 Feb 7
PMID 25656516
Citations 23
Authors
Affiliations
Soon will be listed here.
Abstract

Objective: For many literature review tasks, including systematic review (SR) and other aspects of evidence-based medicine, it is important to know whether an article describes a randomized controlled trial (RCT). Current manual annotation is not complete or flexible enough for the SR process. In this work, highly accurate machine learning predictive models were built that include confidence predictions of whether an article is an RCT.

Materials And Methods: The LibSVM classifier was used with forward selection of potential feature sets on a large human-related subset of MEDLINE to create a classification model requiring only the citation, abstract, and MeSH terms for each article.

Results: The model achieved an area under the receiver operating characteristic curve of 0.973 and mean squared error of 0.013 on the held out year 2011 data. Accurate confidence estimates were confirmed on a manually reviewed set of test articles. A second model not requiring MeSH terms was also created, and performs almost as well.

Discussion: Both models accurately rank and predict article RCT confidence. Using the model and the manually reviewed samples, it is estimated that about 8000 (3%) additional RCTs can be identified in MEDLINE, and that 5% of articles tagged as RCTs in Medline may not be identified.

Conclusion: Retagging human-related studies with a continuously valued RCT confidence is potentially more useful for article ranking and review than a simple yes/no prediction. The automated RCT tagging tool should offer significant savings of time and effort during the process of writing SRs, and is a key component of a multistep text mining pipeline that we are building to streamline SR workflow. In addition, the model may be useful for identifying errors in MEDLINE publication types. The RCT confidence predictions described here have been made available to users as a web service with a user query form front end at: http://arrowsmith.psych.uic.edu/cgi-bin/arrowsmith_uic/RCT_Tagger.cgi.

Citing Articles

COVID-19-related research data availability and quality according to the FAIR principles: A meta-research study.

Sofi-Mahmudi A, Raittio E, Khazaei Y, Ashraf J, Schwendicke F, Uribe S PLoS One. 2024; 19(11):e0313991.

PMID: 39556553 PMC: 11573139. DOI: 10.1371/journal.pone.0313991.


Automation of systematic reviews of biomedical literature: a scoping review of studies indexed in PubMed.

Toth B, Berek L, Gulacsi L, Pentek M, Zrubka Z Syst Rev. 2024; 13(1):174.

PMID: 38978132 PMC: 11229257. DOI: 10.1186/s13643-024-02592-3.


How to optimize the systematic review process using AI tools.

Fabiano N, Gupta A, Bhambra N, Luu B, Wong S, Maaz M JCPP Adv. 2024; 4(2):e12234.

PMID: 38827982 PMC: 11143948. DOI: 10.1002/jcv2.12234.


Insights into the nutritional prevention of macular degeneration based on a comparative topic modeling approach.

Jacaruso L PeerJ Comput Sci. 2024; 10:e1940.

PMID: 38660183 PMC: 11042009. DOI: 10.7717/peerj-cs.1940.


Bat4RCT: A suite of benchmark data and baseline methods for text classification of randomized controlled trials.

Kim J, Kim J, Lee A, Kim J PLoS One. 2023; 18(3):e0283342.

PMID: 36961852 PMC: 10038262. DOI: 10.1371/journal.pone.0283342.


References
1.
Smalheiser N, Lin C, Jia L, Jiang Y, Cohen A, Yu C . Design and implementation of Metta, a metasearch engine for biomedical literature retrieval intended for systematic reviewers. Health Inf Sci Syst. 2015; 2:1. PMC: 4375844. DOI: 10.1186/2047-2501-2-1. View

2.
Haynes R . What kind of evidence is it that Evidence-Based Medicine advocates want health care providers and consumers to pay attention to?. BMC Health Serv Res. 2002; 2:3. PMC: 99045. DOI: 10.1186/1472-6963-2-3. View

3.
Gurusamy K, Davidson C, Gluud C, Davidson B . Early versus delayed laparoscopic cholecystectomy for people with acute cholecystitis. Cochrane Database Syst Rev. 2013; (6):CD005440. DOI: 10.1002/14651858.CD005440.pub3. View

4.
Haynes R, Wilczynski N . Finding the gold in MEDLINE: clinical queries. ACP J Club. 2005; 142(1):A8-9. View

5.
Kilicoglu H, Demner-Fushman D, Rindflesch T, Wilczynski N, Haynes R . Towards automatic recognition of scientifically rigorous clinical research evidence. J Am Med Inform Assoc. 2008; 16(1):25-31. PMC: 2605595. DOI: 10.1197/jamia.M2996. View