» Articles » PMID: 27124593

Natural Language Processing in Oncology: A Review

Overview
Journal JAMA Oncol
Specialty Oncology
Date 2016 Apr 29
PMID 27124593
Citations 121
Authors
Affiliations
Soon will be listed here.
Abstract

Importance: Natural language processing (NLP) has the potential to accelerate translation of cancer treatments from the laboratory to the clinic and will be a powerful tool in the era of personalized medicine. This technology can harvest important clinical variables trapped in the free-text narratives within electronic medical records.

Observations: Natural language processing can be used as a tool for oncological evidence-based research and quality improvement. Oncologists interested in applying NLP for clinical research can play pivotal roles in building NLP systems and, in doing so, contribute to both oncological and clinical NLP research. Herein, we provide an introduction to NLP and its potential applications in oncology, a description of specific tools available, and a review on the state of the current technology with respect to cancer case identification, staging, and outcomes quantification.

Conclusions And Relevance: More automated means of leveraging unstructured data from daily clinical practice is crucial as therapeutic options and access to individual-level health information increase. Research-minded oncologists may push the avenues of evidence-based research by taking advantage of the new technologies available with clinical NLP. As continued progress is made with applying NLP toward oncological research, incremental gains will lead to large impacts, building a cost-effective infrastructure for advancing cancer care.

Citing Articles

Developing and Validating an Automatic Support System for Tumor Coding in Pathology Reports in Spanish.

Villena F, Baez P, Penafiel S, Rojas M, Paredes I, Dunstan J JCO Clin Cancer Inform. 2025; 9:e2400124.

PMID: 39993248 PMC: 11872266. DOI: 10.1200/CCI.24.00124.


The Transformative Potential of Large Language Models in Mining Electronic Health Records Data: Content Analysis.

Wals Zurita A, Miras Del Rio H, Ugarte Ruiz de Aguirre N, Nebrera Navarro C, Rubio Jimenez M, Munoz Carmona D JMIR Med Inform. 2025; 13():e58457.

PMID: 39746191 PMC: 11739723. DOI: 10.2196/58457.


Utilizing a domain-specific large language model for LI-RADS v2018 categorization of free-text MRI reports: a feasibility study.

Matute-Gonzalez M, Darnell A, Comas-Cufi M, Pazo J, Soler A, Saborido B Insights Imaging. 2024; 15(1):280.

PMID: 39576290 PMC: 11584817. DOI: 10.1186/s13244-024-01850-1.


Enhancing Thoracic Surgery with AI: A Review of Current Practices and Emerging Trends.

Aleem M, Khan J, Younes A, Sabbah B, Saleh W, Migliore M Curr Oncol. 2024; 31(10):6232-6244.

PMID: 39451768 PMC: 11506543. DOI: 10.3390/curroncol31100464.


Harnessing Natural Language Processing to Assess Quality of End-of-Life Care for Children With Cancer.

Lindsay M, de Oliveira S, Sciacca K, Lindvall C, Ananth P JCO Clin Cancer Inform. 2024; 8:e2400134.

PMID: 39265122 PMC: 11407740. DOI: 10.1200/CCI.24.00134.