» Articles » PMID: 18693842

Three Approaches to Automatic Assignment of ICD-9-CM Codes to Radiology Reports

Overview
Date 2008 Aug 13
PMID 18693842
Citations 22
Authors
Affiliations
Soon will be listed here.
Abstract

We describe and evaluate three systems for automatically predicting the ICD-9-CM codes of radiology reports from short excerpts of text. The first system benefits from an open source search engine, Lucene, and takes advantage of the relevance of reports to one another based on individual words. The second uses BoosTexter, a boosting algorithm based on n-grams (sequences of consecutive words) and s-grams (sequences of non-consecutive words) extracted from the reports. The third employs a set of hand-crafted rules that capture lexical elements (short, meaningful, strings of words) derived from BoosTexter's n-grams, and that are enhanced by shallow semantic information in the form of negation, synonymy, and uncertainty. Our evaluation shows that semantic information significantly contributes to ICD-9-CM coding with lexical elements. Also, a simple hand-crafted rule-based system with lexical elements and semantic information can outperform algorithmically more complex systems, such as Lucene and BoosTexter, when these systems base their ICD-9-CM predictions only upon individual words, n-grams, or s grams.

Citing Articles

Automated vs. manual coding of neuroimaging reports via natural language processing, using the international classification of diseases, tenth revision.

McKinney A, Moore J, Campbell K, Braga T, Rykken J, Jagadeesan B Heliyon. 2024; 10(10):e30106.

PMID: 38799748 PMC: 11126795. DOI: 10.1016/j.heliyon.2024.e30106.


An overview of biomedical entity linking throughout the years.

French E, McInnes B J Biomed Inform. 2022; 137:104252.

PMID: 36464228 PMC: 9845184. DOI: 10.1016/j.jbi.2022.104252.


Classification of user queries according to a hierarchical medical procedure encoding system using an ensemble classifier.

Deng Y, Denecke K Front Artif Intell. 2022; 5:1000283.

PMID: 36406473 PMC: 9672500. DOI: 10.3389/frai.2022.1000283.


Deep-ADCA: Development and Validation of Deep Learning Model for Automated Diagnosis Code Assignment Using Clinical Notes in Electronic Medical Records.

Masud J, Shun C, Kuo C, Islam M, Yeh C, Yang H J Pers Med. 2022; 12(5).

PMID: 35629129 PMC: 9146030. DOI: 10.3390/jpm12050707.


Strategies to Address the Lack of Labeled Data for Supervised Machine Learning Training With Electronic Health Records: Case Study for the Extraction of Symptoms From Clinical Notes.

Humbert-Droz M, Mukherjee P, Gevaert O JMIR Med Inform. 2022; 10(3):e32903.

PMID: 35285805 PMC: 8961340. DOI: 10.2196/32903.


References
1.
Auerbuch M, Karson T, Ben-Ami B, Maimon O, Rokach L . Context-sensitive medical information retrieval. Stud Health Technol Inform. 2004; 107(Pt 1):282-6. View

2.
Aronson A . Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program. Proc AMIA Symp. 2002; :17-21. PMC: 2243666. View

3.
Chapman W, Bridewell W, Hanbury P, Cooper G, Buchanan B . A simple algorithm for identifying negated findings and diseases in discharge summaries. J Biomed Inform. 2002; 34(5):301-10. DOI: 10.1006/jbin.2001.1029. View

4.
Tsui F, Wagner M, Dato V, Chang C . Value of ICD-9 coded chief complaints for detection of epidemics. Proc AMIA Symp. 2002; :711-5. PMC: 2243339. View

5.
Hersh W, Mailhot M, Lowe H . Selective automated indexing of findings and diagnoses in radiology reports. J Biomed Inform. 2002; 34(4):262-73. DOI: 10.1006/jbin.2001.1025. View