» Articles » PMID: 30961596

Constructing a Chinese Electronic Medical Record Corpus for Named Entity Recognition on Resident Admit Notes

Overview
Publisher Biomed Central
Date 2019 Apr 10
PMID 30961596
Citations 2
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Electronic Medical Records(EMRs) contain much medical information about patients. Medical named entity extracting from EMRs can provide value information to support doctors' decision making. The research on information extraction of Chinese Electronic Medical Records is still behind that has done in English.

Methods: This paper proposed a practical annotation scheme for medical entity extraction on Resident Admit Notes (RANs), and a model which can automatic extract medical entity. Nine types of clinical entities, four types of clinical relationships were defined in our annotation scheme. An end-to-end deep neural network with convolution neural network and long-short term memory units was applied in our model for the medical named entity recognition(NER).

Result: We annotated RANs in three rounds. The overall F-score of annotation consistency was up to 97.73%. And our NER model on the 255 annotated RANs achieved the best F-score of 91.08%.

Conclusion: The annotation scheme and the model for NER in this paper are effective to extract medical named entity from RANs and provide the basis for fully excavating the patient's information.

Citing Articles

Construction, evaluation, and application of an electronic medical record corpus for cerebral palsy rehabilitation.

Xiao M, Pang Q, Zhu Y, Shuai L, Jin G Digit Health. 2024; 10:20552076241286260.

PMID: 39347507 PMC: 11437554. DOI: 10.1177/20552076241286260.


Constructing fine-grained entity recognition corpora based on clinical records of traditional Chinese medicine.

Zhang T, Wang Y, Wang X, Yang Y, Ye Y BMC Med Inform Decis Mak. 2020; 20(1):64.

PMID: 32252745 PMC: 7132896. DOI: 10.1186/s12911-020-1079-2.

References
1.
Roberts A, Gaizauskas R, Hepple M, Davis N, Demetriou G, Guo Y . The CLEF corpus: semantic annotation of clinical text. AMIA Annu Symp Proc. 2008; :625-9. PMC: 2655900. View

2.
Sun W, Rumshisky A, Uzuner O . Evaluating temporal relations in clinical text: 2012 i2b2 Challenge. J Am Med Inform Assoc. 2013; 20(5):806-13. PMC: 3756273. DOI: 10.1136/amiajnl-2013-001628. View

3.
South B, Shen S, Jones M, Garvin J, Samore M, Chapman W . Developing a manually annotated clinical document corpus to identify phenotypic information for inflammatory bowel disease. BMC Bioinformatics. 2009; 10 Suppl 9:S12. PMC: 2745683. DOI: 10.1186/1471-2105-10-S9-S12. View

4.
Uzuner O, Solti I, Xia F, Cadag E . Community annotation experiment for ground truth generation for the i2b2 medication challenge. J Am Med Inform Assoc. 2010; 17(5):519-23. PMC: 2995684. DOI: 10.1136/jamia.2010.004200. View

5.
Meystre S, Haug P . Natural language processing to extract medical problems from electronic clinical documents: performance evaluation. J Biomed Inform. 2005; 39(6):589-99. DOI: 10.1016/j.jbi.2005.11.004. View