» Articles » PMID: 31259033

A Domain Knowledge-Enhanced LSTM-CRF Model for Disease Named Entity Recognition

Overview
Specialty Biology
Date 2019 Jul 2
PMID 31259033
Citations 4
Authors
Affiliations
Soon will be listed here.
Abstract

Disease named entity recognition (NER) is a critical task for most biomedical natural language processing (NLP) applications. For example, extracting diseases from clinical trial text can be helpful for patient profiling and other downstream applications such as matching clinical trials to eligible patients. Similarly, disease annotation in biomedical articles can help information search engines to accurately index them such that clinicians can easily find relevant articles to enhance their knowledge. In this paper, we propose a domain knowledge-enhanced long short-term memory network-conditional random field (LSTM-CRF) model for disease named entity recognition, which also augments a character-level convolutional neural network (CNN) and a character-level LSTM network for input embedding. Experimental results on a scientific article dataset show the effectiveness of our proposed models compared to state-of-the-art methods in disease recognition.

Citing Articles

A multi-layer soft lattice based model for Chinese clinical named entity recognition.

Guo S, Yang W, Han L, Song X, Wang G BMC Med Inform Decis Mak. 2022; 22(1):201.

PMID: 35908055 PMC: 9338545. DOI: 10.1186/s12911-022-01924-4.


Exploring deep learning methods for recognizing rare diseases and their clinical manifestations from texts.

Segura-Bedmar I, Camino-Perdones D, Guerrero-Aspizua S BMC Bioinformatics. 2022; 23(1):263.

PMID: 35794528 PMC: 9258216. DOI: 10.1186/s12859-022-04810-y.


Extracting clinical named entity for pituitary adenomas from Chinese electronic medical records.

Fang A, Hu J, Zhao W, Feng M, Fu J, Feng S BMC Med Inform Decis Mak. 2022; 22(1):72.

PMID: 35321705 PMC: 8941801. DOI: 10.1186/s12911-022-01810-z.


Named Entity Recognition and Relation Detection for Biomedical Information Extraction.

Perera N, Dehmer M, Emmert-Streib F Front Cell Dev Biol. 2020; 8:673.

PMID: 32984300 PMC: 7485218. DOI: 10.3389/fcell.2020.00673.

References
1.
Aronson A . Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program. Proc AMIA Symp. 2002; :17-21. PMC: 2243666. View

2.
Leaman R, Gonzalez G . BANNER: an executable survey of advances in biomedical named entity recognition. Pac Symp Biocomput. 2008; :652-63. View

3.
Shah N, Bhatia N, Jonquet C, Rubin D, Chiang A, Musen M . Comparison of concept recognizers for building the Open Biomedical Annotator. BMC Bioinformatics. 2009; 10 Suppl 9:S14. PMC: 2745685. DOI: 10.1186/1471-2105-10-S9-S14. View

4.
Savova G, Masanz J, Ogren P, Zheng J, Sohn S, Kipper-Schuler K . Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications. J Am Med Inform Assoc. 2010; 17(5):507-13. PMC: 2995668. DOI: 10.1136/jamia.2009.001560. View

5.
Islamaj Dogan R, Leaman R, Lu Z . NCBI disease corpus: a resource for disease name recognition and concept normalization. J Biomed Inform. 2014; 47:1-10. PMC: 3951655. DOI: 10.1016/j.jbi.2013.12.006. View