Integrating Natural Language Processing and Machine Learning Algorithms to Categorize Oncologic Response in Radiology Reports

Overview

Journal J Digit Imaging

Publisher Springer

Specialties Medical Informatics
Radiology

Date 2017 Oct 29

PMID 29079959

Citations 30

Authors

Po-Hao Chen

Hanna Zafar

Maya Galperin-Aizenberg

Tessa Cook

Affiliations

Soon will be listed here.

Abstract

A significant volume of medical data remains unstructured. Natural language processing (NLP) and machine learning (ML) techniques have shown to successfully extract insights from radiology reports. However, the codependent effects of NLP and ML in this context have not been well-studied. Between April 1, 2015 and November 1, 2016, 9418 cross-sectional abdomen/pelvis CT and MR examinations containing our internal structured reporting element for cancer were separated into four categories: Progression, Stable Disease, Improvement, or No Cancer. We combined each of three NLP techniques with five ML algorithms to predict the assigned label using the unstructured report text and compared the performance of each combination. The three NLP algorithms included term frequency-inverse document frequency (TF-IDF), term frequency weighting (TF), and 16-bit feature hashing. The ML algorithms included logistic regression (LR), random decision forest (RDF), one-vs-all support vector machine (SVM), one-vs-all Bayes point machine (BPM), and fully connected neural network (NN). The best-performing NLP model consisted of tokenized unigrams and bigrams with TF-IDF. Increasing N-gram length yielded little to no added benefit for most ML algorithms. With all parameters optimized, SVM had the best performance on the test dataset, with 90.6 average accuracy and F score of 0.813. The interplay between ML and NLP algorithms and their effect on interpretation accuracy is complex. The best accuracy is achieved when both algorithms are optimized concurrently.

Citing Articles

[Transformation of free-text radiology reports into structured data].

Graf M, Bressem K, Adams L Radiologie (Heidelb). 2025; .

PMID: 39934245 DOI: 10.1007/s00117-025-01422-4.

Artificial Intelligence Applications in Lymphoma Diagnosis and Management: Opportunities, Challenges, and Future Directions.

Shen M, Jiang Z J Multidiscip Healthc. 2024; 17:5329-5339.

PMID: 39582879 PMC: 11583773. DOI: 10.2147/JMDH.S485724.

BERT-based natural language processing analysis of French CT reports: Application to the measurement of the positivity rate for pulmonary embolism.

Jupin-Delevaux E, Djahnine A, Talbot F, Richard A, Gouttard S, Mansuy A Res Diagn Interv Imaging. 2024; 6:100027.

PMID: 39077547 PMC: 11265488. DOI: 10.1016/j.redii.2023.100027.

ESR paper on structured reporting in radiology-update 2023.

Insights Imaging. 2023; 14(1):199.

PMID: 37995019 PMC: 10667169. DOI: 10.1186/s13244-023-01560-0.

Artificial Intelligence to Improve Patient Understanding of Radiology Reports.

Amin K, Khosla P, Doshi R, Chheang S, Forman H Yale J Biol Med. 2023; 96(3):407-417.

PMID: 37780992 PMC: 10524809. DOI: 10.59249/NKOY5498.

References

Horng S, Sontag D, Halpern Y, Jernite Y, Shapiro N, Nathanson L . Creating an automated trigger for sepsis clinical decision support at emergency department triage using machine learning. PLoS One. 2017; 12(4):e0174708. PMC: 5383046. DOI: 10.1371/journal.pone.0174708. View

Therasse P, Arbuck S, Eisenhauer E, Wanders J, Kaplan R, Rubinstein L . New guidelines to evaluate the response to treatment in solid tumors. European Organization for Research and Treatment of Cancer, National Cancer Institute of the United States, National Cancer Institute of Canada. J Natl Cancer Inst. 2000; 92(3):205-16. DOI: 10.1093/jnci/92.3.205. View

Wei W, Marmor R, Singh S, Wang S, Demner-Fushman D, Kuo T . Finding Related Publications: Extending the Set of Terms Used to Assess Article Similarity. AMIA Jt Summits Transl Sci Proc. 2016; 2016:225-34. PMC: 5001748. View

Schwartz L, Panicek D, Berk A, Li Y, Hricak H . Improving communication of diagnostic radiology findings through structured reporting. Radiology. 2011; 260(1):174-81. PMC: 3121011. DOI: 10.1148/radiol.11101913. View

Lipton Z, Elkan C, Naryanaswamy B . Optimal Thresholding of Classifiers to Maximize F1 Measure. Mach Learn Knowl Discov Databases. 2015; 8725:225-239. PMC: 4442797. DOI: 10.1007/978-3-662-44851-9_15. View

Hassanpour S, Langlotz C . Unsupervised Topic Modeling in a Large Free Text Radiology Report Repository. J Digit Imaging. 2015; 29(1):59-62. PMC: 4722022. DOI: 10.1007/s10278-015-9823-3. View

Liu X, Song M, Tao D, Liu Z, Zhang L, Chen C . Random forest construction with robust semisupervised node splitting. IEEE Trans Image Process. 2014; 24(1):471-83. DOI: 10.1109/TIP.2014.2378017. View

Hripcsak G, Rothschild A . Agreement, the f-measure, and reliability in information retrieval. J Am Med Inform Assoc. 2005; 12(3):296-8. PMC: 1090460. DOI: 10.1197/jamia.M1733. View

Kocbek S, Cavedon L, Martinez D, Bain C, Manus C, Haffari G . Text mining electronic hospital records to automatically classify admissions against disease: Measuring the impact of linking data sources. J Biomed Inform. 2016; 64:158-167. DOI: 10.1016/j.jbi.2016.10.008. View

10.

Hassanpour S, Langlotz C . Information extraction from multi-institutional radiology reports. Artif Intell Med. 2015; 66:29-39. PMC: 5221793. DOI: 10.1016/j.artmed.2015.09.007. View

11.

Morid M, Fiszman M, Raja K, Jonnalagadda S, Del Fiol G . Classification of clinically useful sentences in clinical evidence resources. J Biomed Inform. 2016; 60:14-22. PMC: 4836984. DOI: 10.1016/j.jbi.2016.01.003. View

12.

Yim W, Yetisgen M, Harris W, Kwan S . Natural Language Processing in Oncology: A Review. JAMA Oncol. 2016; 2(6):797-804. DOI: 10.1001/jamaoncol.2016.0213. View

13.

Lakhani P, Kim W, Langlotz C . Automated detection of critical results in radiology reports. J Digit Imaging. 2011; 25(1):30-6. PMC: 3264731. DOI: 10.1007/s10278-011-9426-6. View

14.

Polak S, Mendyk A . Artificial neural networks as an engine of Internet based hypertension prediction tool. Stud Health Technol Inform. 2005; 103:61-9. View

15.

Wang J, Zhang J, An Y, Lin H, Yang Z, Zhang Y . Biomedical event trigger detection by dependency-based word embedding. BMC Med Genomics. 2016; 9 Suppl 2:45. PMC: 4980775. DOI: 10.1186/s12920-016-0203-8. View

16.

Tong W, Xie Q, Hong H, Shi L, Fang H, Perkins R . Using decision forest to classify prostate cancer samples on the basis of SELDI-TOF MS data: assessing chance correlation and prediction confidence. Environ Health Perspect. 2004; 112(16):1622-7. PMC: 1247659. DOI: 10.1289/txg.7109. View

17.

Zafar H, Chadalavada S, Kahn Jr C, Cook T, Sloan C, Lalevic D . Code Abdomen: An Assessment Coding Scheme for Abdominal Imaging Findings Possibly Representing Cancer. J Am Coll Radiol. 2015; 12(9):947-50. PMC: 4852851. DOI: 10.1016/j.jacr.2015.04.005. View

18.

Lakhani P, Kim W, Langlotz C . Automated extraction of critical test values and communications from unstructured radiology reports: an analysis of 9.3 million reports from 1990 to 2011. Radiology. 2012; 265(3):809-18. DOI: 10.1148/radiol.12112438. View

19.

Cai T, Giannopoulos A, Yu S, Kelil T, Ripley B, Kumamaru K . Natural Language Processing Technologies in Radiology Research and Clinical Applications. Radiographics. 2016; 36(1):176-91. PMC: 4734053. DOI: 10.1148/rg.2016150080. View