» Articles » PMID: 34210317

Supporting the Classification of Patients in Public Hospitals in Chile by Designing, Deploying and Validating a System Based on Natural Language Processing

Overview
Publisher Biomed Central
Date 2021 Jul 2
PMID 34210317
Citations 5
Authors
Affiliations
Soon will be listed here.
Abstract

Background: In Chile, a patient needing a specialty consultation or surgery has to first be referred by a general practitioner, then placed on a waiting list. The Explicit Health Guarantees (GES in Spanish) ensures, by law, the maximum time to solve 85 health problems. Usually, a health professional manually verifies if each referral, written in natural language, corresponds or not to a GES-covered disease. An error in this classification is catastrophic for patients, as it puts them on a non-prioritized waiting list, characterized by prolonged waiting times.

Methods: To support the manual process, we developed and deployed a system that automatically classifies referrals as GES-covered or not using historical data. Our system is based on word embeddings specially trained for clinical text produced in Chile. We used a vector representation of the reason for referral and patient's age as features for training machine learning models using human-labeled historical data. We constructed a ground truth dataset combining classifications made by three healthcare experts, which was used to validate our results.

Results: The best performing model over ground truth reached an AUC score of 0.94, with a weighted F1-score of 0.85 (0.87 in precision and 0.86 in recall). During seven months of continuous and voluntary use, the system has amended 87 patient misclassifications.

Conclusion: This system is a result of a collaboration between technical and clinical experts, and the design of the classifier was custom-tailored for a hospital's clinical workflow, which encouraged the voluntary use of the platform. Our solution can be easily expanded across other hospitals since the registry is uniform in Chile.

Citing Articles

NLP modeling recommendations for restricted data availability in clinical settings.

Villena F, Bravo-Marquez F, Dunstan J BMC Med Inform Decis Mak. 2025; 25(1):116.

PMID: 40055634 PMC: 11889813. DOI: 10.1186/s12911-025-02948-2.


Demystifying the Role of Natural Language Processing (NLP) in Smart City Applications: Background, Motivation, Recent Advances, and Future Research Directions.

Tyagi N, Bhushan B Wirel Pers Commun. 2023; 130(2):857-908.

PMID: 37168438 PMC: 10019426. DOI: 10.1007/s11277-023-10312-8.


Year 2021: COVID-19, Information Extraction and BERTization among the Hottest Topics in Medical Natural Language Processing.

Grabar N, Grouin C Yearb Med Inform. 2022; 31(1):254-260.

PMID: 36463883 PMC: 9719758. DOI: 10.1055/s-0042-1742547.


Pneumonia and Pulmonary Thromboembolism Classification Using Electronic Health Records.

Siordia-Millan S, Torres-Ramos S, Salido-Ruiz R, Hernandez-Gordillo D, Perez-Gutierrez T, Roman-Godinez I Diagnostics (Basel). 2022; 12(10).

PMID: 36292225 PMC: 9601338. DOI: 10.3390/diagnostics12102536.


Training and intrinsic evaluation of lightweight word embeddings for the clinical domain in Spanish.

Chiu C, Villena F, Martin K, Nunez F, Besa C, Dunstan J Front Artif Intell. 2022; 5:970517.

PMID: 36213168 PMC: 9533099. DOI: 10.3389/frai.2022.970517.


References
1.
Pons E, Braun L, Hunink M, Kors J . Natural Language Processing in Radiology: A Systematic Review. Radiology. 2016; 279(2):329-43. DOI: 10.1148/radiol.16142770. View

2.
Yim W, Yetisgen M, Harris W, Kwan S . Natural Language Processing in Oncology: A Review. JAMA Oncol. 2016; 2(6):797-804. DOI: 10.1001/jamaoncol.2016.0213. View

3.
Roque F, Jensen P, Schmock H, Dalgaard M, Andreatta M, Hansen T . Using electronic patient records to discover disease correlations and stratify patient cohorts. PLoS Comput Biol. 2011; 7(8):e1002141. PMC: 3161904. DOI: 10.1371/journal.pcbi.1002141. View

4.
Koza W, Filippo D, Cotik V, Stricker V, Munoz M, Godoy N . Automatic Detection of Negated Findings in Radiological Reports for Spanish Language: Methodology Based on Lexicon-Grammatical Information Processing. J Digit Imaging. 2018; 32(1):19-29. PMC: 6382643. DOI: 10.1007/s10278-018-0113-8. View

5.
Cotik V, Filippo D, Castano J . An Approach for Automatic Classification of Radiology Reports in Spanish. Stud Health Technol Inform. 2015; 216:634-8. View