» Articles » PMID: 7719796

Natural Language Processing and the Representation of Clinical Data

Overview
Date 1994 Mar 1
PMID 7719796
Citations 88
Authors
Affiliations
Soon will be listed here.
Abstract

Objective: Develop a representation of clinical observations and actions and a method of processing free-text patient documents to facilitate applications such as quality assurance.

Design: The Linguistic String Project (LSP) system of New York University utilizes syntactic analysis, augmented by a sublanguage grammar and an information structure that are specific to the clinical narrative, to map free-text documents into a database for querying.

Measurements: Information precision (I-P) and information recall (I-R) were measured for queries for the presence of 13 asthma-health-care quality assurance criteria in a database generated from 59 discharge letters.

Results: I-P, using counts of major errors only, was 95.7% for the 28-letter training set and 98.6% for the 31-letter test set. I-R, using counts of major omissions only, was 93.9% for the training set and 92.5% for the test set.

Citing Articles

Hot topics in artificial intelligence.

Bakken S, Poon E J Am Med Inform Assoc. 2025; 32(2):265-267.

PMID: 39836895 PMC: 11756649. DOI: 10.1093/jamia/ocae324.


JAMIA at 30: looking back and forward.

Stead W, Miller R, Ohno-Machado L, Bakken S J Am Med Inform Assoc. 2023; 31(1):1-9.

PMID: 38134400 PMC: 10746314. DOI: 10.1093/jamia/ocad215.


Critical assessment of transformer-based AI models for German clinical notes.

Lentzen M, Madan S, Lage-Rupprecht V, Kuhnel L, Fluck J, Jacobs M JAMIA Open. 2022; 5(4):ooac087.

PMID: 36380848 PMC: 9663939. DOI: 10.1093/jamiaopen/ooac087.


Evaluation of Natural Language Processing for the Identification of Crohn Disease-Related Variables in Spanish Electronic Health Records: A Validation Study for the PREMONITION-CD Project.

Montoto C, Gisbert J, Guerra I, Plaza R, Pajares Villarroya R, Moreno Almazan L JMIR Med Inform. 2022; 10(2):e30345.

PMID: 35179507 PMC: 8900906. DOI: 10.2196/30345.


Assessing the Performance of Clinical Natural Language Processing Systems: Development of an Evaluation Methodology.

Canales L, Menke S, Marchesseau S, DAgostino A, Del Rio-Bermudez C, Taberna M JMIR Med Inform. 2021; 9(7):e20492.

PMID: 34297002 PMC: 8367121. DOI: 10.2196/20492.


References
1.
Campbell K, Musen M . Representation of clinical data using SNOMED III and conceptual graphs. Proc Annu Symp Comput Appl Med Care. 1992; :354-8. PMC: 2248067. View

2.
Musen M . Dimensions of knowledge sharing and reuse. Comput Biomed Res. 1992; 25(5):435-67. DOI: 10.1016/0010-4809(92)90003-s. View

3.
Dunham G, Pacak M, Pratt A . Automatic indexing of pathology data. J Am Soc Inf Sci. 1978; 29(2):81-90. DOI: 10.1002/asi.4630290207. View

4.
SAGER N, Lyman M . Computerized language processing: implications for health care evaluation. Med Rec News. 1978; 49(3):20-1, 23-4, 26-8 passim. View

5.
Hirschman L, Story G, Marsh E, Lyman M, SAGER N . An experiment in automated health care evaluation from narrative medical records. Comput Biomed Res. 1981; 14(5):447-63. DOI: 10.1016/0010-4809(81)90021-5. View