» Articles » PMID: 25332357

Comparison of a Semi-automatic Annotation Tool and a Natural Language Processing Application for the Generation of Clinical Statement Entries

Overview
Date 2014 Oct 22
PMID 25332357
Citations 3
Authors
Affiliations
Soon will be listed here.
Abstract

Background And Objective: Electronic medical records with encoded entries should enhance the semantic interoperability of document exchange. However, it remains a challenge to encode the narrative concept and to transform the coded concepts into a standard entry-level document. This study aimed to use a novel approach for the generation of entry-level interoperable clinical documents.

Methods: Using HL7 clinical document architecture (CDA) as the example, we developed three pipelines to generate entry-level CDA documents. The first approach was a semi-automatic annotation pipeline (SAAP), the second was a natural language processing (NLP) pipeline, and the third merged the above two pipelines. We randomly selected 50 test documents from the i2b2 corpora to evaluate the performance of the three pipelines.

Results: The 50 randomly selected test documents contained 9365 words, including 588 Observation terms and 123 Procedure terms. For the Observation terms, the merged pipeline had a significantly higher F-measure than the NLP pipeline (0.89 vs 0.80, p<0.0001), but a similar F-measure to that of the SAAP (0.89 vs 0.87). For the Procedure terms, the F-measure was not significantly different among the three pipelines.

Conclusions: The combination of a semi-automatic annotation approach and the NLP application seems to be a solution for generating entry-level interoperable clinical documents.

Citing Articles

Designing an openEHR-Based Pipeline for Extracting and Standardizing Unstructured Clinical Data Using Natural Language Processing.

Wulff A, Mast M, Hassler M, Montag S, Marschollek M, Jack T Methods Inf Med. 2020; 59(S 02):e64-e78.

PMID: 33058101 PMC: 7725544. DOI: 10.1055/s-0040-1716403.


Words prediction based on N-gram model for free-text entry in electronic health records.

Yazdani A, Safdari R, Golkar A, Niakan Kalhori S Health Inf Sci Syst. 2019; 7(1):6.

PMID: 30886701 PMC: 6395458. DOI: 10.1007/s13755-019-0065-5.


A computational framework for converting textual clinical diagnostic criteria into the quality data model.

Hong N, Li D, Yu Y, Xiu Q, Liu H, Jiang G J Biomed Inform. 2016; 63:11-21.

PMID: 27444185 PMC: 5077690. DOI: 10.1016/j.jbi.2016.07.016.

References
1.
Aronson A, Bodenreider O, Chang H, Humphrey S, Mork J, Nelson S . The NLM Indexing Initiative. Proc AMIA Symp. 2000; :17-21. PMC: 2243970. View

2.
Friedman C, Shagina L, Lussier Y, Hripcsak G . Automated encoding of clinical documents based on natural language processing. J Am Med Inform Assoc. 2004; 11(5):392-402. PMC: 516246. DOI: 10.1197/jamia.M1552. View

3.
Savova G, Fan J, Ye Z, Murphy S, Zheng J, Chute C . Discovering peripheral arterial disease cases from radiology notes using natural language processing. AMIA Annu Symp Proc. 2011; 2010:722-6. PMC: 3041293. View

4.
Wu Y, Denny J, Rosenbloom S, Miller R, Giuse D, Xu H . A comparative study of current Clinical Natural Language Processing systems on handling abbreviations in discharge summaries. AMIA Annu Symp Proc. 2013; 2012:997-1003. PMC: 3540461. View

5.
Burgun A, Denier P, Bodenreider O, Botti G, Delamarre D, Pouliquen B . A Web terminology server using UMLS for the description of medical procedures. J Am Med Inform Assoc. 1997; 4(5):356-63. PMC: 61253. DOI: 10.1136/jamia.1997.0040356. View