» Articles » PMID: 39167746

Automated Extraction of Patient-Centered Outcomes After Breast Cancer Treatment: An Open-Source Large Language Model-Based Toolkit

Overview
Date 2024 Aug 21
PMID 39167746
Authors
Affiliations
Soon will be listed here.
Abstract

Purpose: Patient-centered outcomes (PCOs) are pivotal in cancer treatment, as they directly reflect patients' quality of life. Although multiple studies suggest that factors affecting breast cancer-related morbidity and survival are influenced by treatment side effects and adherence to long-term treatment, such data are generally only available on a smaller scale or from a single center. The primary challenge with collecting these data is that the outcomes are captured as free text in clinical narratives written by clinicians.

Materials And Methods: Given the complexity of PCO documentation in these narratives, computerized methods are necessary to unlock the wealth of information buried in unstructured text notes that often document PCOs. Inspired by the success of large language models (LLMs), we examined the adaptability of three LLMs, GPT-2, BioGPT, and PMC-LLaMA, on PCO tasks across three institutions, Mayo Clinic, Emory University Hospital, and Stanford University. We developed an open-source framework for fine-tuning LLM that can directly extract the five different categories of PCO from the clinic notes.

Results: We found that these LLMs without fine-tuning (zero-shot) struggle with challenging PCO extraction tasks, displaying almost random performance, even with some task-specific examples (few-shot learning). The performance of our fine-tuned, task-specific models is notably superior compared with their non-fine-tuned LLM models. Moreover, the fine-tuned GPT-2 model has demonstrated a significantly better performance than the other two larger LLMs.

Conclusion: Our discovery indicates that although LLMs serve as effective general-purpose models for tasks across various domains, they require fine-tuning when applied to the clinician domain. Our proposed approach has the potential to lead more efficient, adaptable models for PCO information extraction, reducing reliance on extensive computational resources while still delivering superior performance for specific tasks.

Citing Articles

Large language models in cancer: potentials, risks, and safeguards.

Zitu M, Le T, Duong T, Haddadan S, Garcia M, Amorrortu R BJR Artif Intell. 2025; 2(1):ubae019.

PMID: 39777117 PMC: 11703354. DOI: 10.1093/bjrai/ubae019.

References
1.
Lee S, Lee J, Park J, Park J, Kim D, Lee J . Deep learning-based natural language processing for detecting medical symptoms and histories in emergency patient triage. Am J Emerg Med. 2023; 77:29-38. DOI: 10.1016/j.ajem.2023.11.063. View

2.
Yang L, Manhas D, Howard A, Olson R . Patient-reported outcome use in oncology: a systematic review of the impact on patient-clinician communication. Support Care Cancer. 2017; 26(1):41-60. DOI: 10.1007/s00520-017-3865-7. View

3.
Schmidt M, Scherer S, Wiskemann J, Steindorf K . Return to work after breast cancer: The role of treatment-related side effects and potential impact on quality of life. Eur J Cancer Care (Engl). 2019; 28(4):e13051. DOI: 10.1111/ecc.13051. View

4.
Paladino A, Anderson J, Krukowski R, Waters T, Kocak M, Graff C . THRIVE study protocol: a randomized controlled trial evaluating a web-based app and tailored messages to improve adherence to adjuvant endocrine therapy among women with breast cancer. BMC Health Serv Res. 2019; 19(1):977. PMC: 6924011. DOI: 10.1186/s12913-019-4588-x. View

5.
Ruopp M, Perkins N, Whitcomb B, Schisterman E . Youden Index and optimal cut-point estimated from observations affected by a lower limit of detection. Biom J. 2008; 50(3):419-30. PMC: 2515362. DOI: 10.1002/bimj.200710415. View