» Articles » PMID: 30914179

A Systematic Review of Natural Language Processing and Text Mining of Symptoms from Electronic Patient-authored Text Data

Overview
Date 2019 Mar 28
PMID 30914179
Citations 56
Authors
Affiliations
Soon will be listed here.
Abstract

Objective: In this systematic review, we aim to synthesize the literature on the use of natural language processing (NLP) and text mining as they apply to symptom extraction and processing in electronic patient-authored text (ePAT).

Materials And Methods: A comprehensive literature search of 1964 articles from PubMed and EMBASE was narrowed to 21 eligible articles. Data related to purpose, text source, number of users and/or posts, evaluation metrics, and quality indicators were recorded.

Results: Pain (n = 18) and fatigue and sleep disturbance (n = 18) were the most frequently evaluated symptom clinical content categories. Studies accessed ePAT from sources such as Twitter and online community forums or patient portals focused on diseases, including diabetes, cancer, and depression. Fifteen studies used NLP as a primary methodology. Studies reported evaluation metrics including the precision, recall, and F-measure for symptom-specific research questions.

Discussion: NLP and text mining have been used to extract and analyze patient-authored symptom data in a wide variety of online communities. Though there are computational challenges with accessing ePAT, the depth of information provided directly from patients offers new horizons for precision medicine, characterization of sub-clinical symptoms, and the creation of personal health libraries as outlined by the National Library of Medicine.

Conclusion: Future research should consider the needs of patients expressed through ePAT and its relevance to symptom science. Understanding the role that ePAT plays in health communication and real-time assessment of symptoms, through the use of NLP and text mining, is critical to a patient-centered health system.

Citing Articles

Government plans in the 2016 and 2021 Peruvian presidential elections: A natural language processing analysis of the health chapters.

Carrillo-Larco R, Castillo-Cara M, Lovon-Melgarejo J Wellcome Open Res. 2025; 6:177.

PMID: 39931661 PMC: 11809155. DOI: 10.12688/wellcomeopenres.16867.3.


The Impact of Collaborative Documentation on Person-Centered Care: Textual Analysis of Clinical Notes.

Stanhope V, Yoo N, Matthews E, Baslock D, Hu Y JMIR Med Inform. 2024; 12:e52678.

PMID: 39302636 PMC: 11429664. DOI: 10.2196/52678.


Exploring a method for extracting concerns of multiple breast cancer patients in the domain of patient narratives using BERT and its optimization by domain adaptation using masked language modeling.

Watabe S, Watanabe T, Yada S, Aramaki E, Yajima H, Kizaki H PLoS One. 2024; 19(9):e0305496.

PMID: 39241041 PMC: 11379386. DOI: 10.1371/journal.pone.0305496.


Leveraging Artificial Intelligence to Optimize Transcranial Direct Current Stimulation for Long COVID Management: A Forward-Looking Perspective.

Rudroff T, Rainio O, Klen R Brain Sci. 2024; 14(8).

PMID: 39199522 PMC: 11353063. DOI: 10.3390/brainsci14080831.


Using natural language processing to evaluate temporal patterns in suicide risk variation among high-risk Veterans.

Levis M, Levy J, DiMambro M, Dufort V, Ludmer D, Goldberg M Psychiatry Res. 2024; 339:116097.

PMID: 39083961 PMC: 11488589. DOI: 10.1016/j.psychres.2024.116097.


References
1.
Mowery D, Smith H, Cheney T, Stoddard G, Coppersmith G, Bryan C . Understanding Depressive Symptoms and Psychosocial Stressors on Twitter: A Corpus-Based Study. J Med Internet Res. 2017; 19(2):e48. PMC: 5350450. DOI: 10.2196/jmir.6895. View

2.
Meyer A, Longhurst C, Singh H . Crowdsourcing Diagnosis for Patients With Undiagnosed Illnesses: An Evaluation of CrowdMed. J Med Internet Res. 2016; 18(1):e12. PMC: 4731679. DOI: 10.2196/jmir.4887. View

3.
Cashion A, Gill J, Hawes R, Henderson W, Saligan L . National Institutes of Health Symptom Science Model sheds light on patient symptoms. Nurs Outlook. 2016; 64(5):499-506. PMC: 5014584. DOI: 10.1016/j.outlook.2016.05.008. View

4.
Curtis J, Chen L, Higginbotham P, Nowell W, Gal-Levy R, Willig J . Social media for arthritis-related comparative effectiveness and safety research and the impact of direct-to-consumer advertising. Arthritis Res Ther. 2017; 19(1):48. PMC: 5341200. DOI: 10.1186/s13075-017-1251-y. View

5.
Sunkureddi P, Gibson D, Doogan S, Heid J, Benosman S, Park Y . Using Self-Reported Patient Experiences to Understand Patient Burden: Learnings from Digital Patient Communities in Ankylosing Spondylitis. Adv Ther. 2018; 35(3):424-437. PMC: 5859700. DOI: 10.1007/s12325-018-0669-1. View