» Articles » PMID: 29186323

An Attention-based BiLSTM-CRF Approach to Document-level Chemical Named Entity Recognition

Overview
Journal Bioinformatics
Specialty Biology
Date 2017 Nov 30
PMID 29186323
Citations 55
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: In biomedical research, chemical is an important class of entities, and chemical named entity recognition (NER) is an important task in the field of biomedical information extraction. However, most popular chemical NER methods are based on traditional machine learning and their performances are heavily dependent on the feature engineering. Moreover, these methods are sentence-level ones which have the tagging inconsistency problem.

Results: In this paper, we propose a neural network approach, i.e. attention-based bidirectional Long Short-Term Memory with a conditional random field layer (Att-BiLSTM-CRF), to document-level chemical NER. The approach leverages document-level global information obtained by attention mechanism to enforce tagging consistency across multiple instances of the same token in a document. It achieves better performances with little feature engineering than other state-of-the-art methods on the BioCreative IV chemical compound and drug name recognition (CHEMDNER) corpus and the BioCreative V chemical-disease relation (CDR) task corpus (the F-scores of 91.14 and 92.57%, respectively).

Availability And Implementation: Data and code are available at https://github.com/lingluodlut/Att-ChemdNER.

Contact: yangzh@dlut.edu.cn or wangleibihami@gmail.com.

Supplementary Information: Supplementary data are available at Bioinformatics online.

Citing Articles

Exploiting question-answer framework with multi-GRU to detect adverse drug reaction on social media.

Luo J, Yang A Sci Rep. 2025; 15(1):4157.

PMID: 39905141 PMC: 11794948. DOI: 10.1038/s41598-025-87724-y.


Predicting CRISPR-Cas9 off-target effects in human primary cells using bidirectional LSTM with BERT embedding.

Sari O, Liu Z, Pan Y, Shao X Bioinform Adv. 2025; 5(1):vbae184.

PMID: 39758829 PMC: 11696696. DOI: 10.1093/bioadv/vbae184.


Research on the construction of a knowledge graph for tomato leaf pests and diseases based on the named entity recognition model.

Wang K, Miao Y, Wang X, Li Y, Li F, Song H Front Plant Sci. 2024; 15:1482275.

PMID: 39574459 PMC: 11578693. DOI: 10.3389/fpls.2024.1482275.


Alzheimer's Disease Knowledge Graph Enhances Knowledge Discovery and Disease Prediction.

Yang Y, Yu K, Gao S, Yu S, Xiong D, Qin C bioRxiv. 2024; .

PMID: 39005357 PMC: 11245034. DOI: 10.1101/2024.07.03.601339.


A Deep Learning-Based Method for Preventing Data Leakage in Electric Power Industrial Internet of Things Business Data Interactions.

Miao W, Zhao X, Zhang Y, Chen S, Li X, Li Q Sensors (Basel). 2024; 24(13).

PMID: 39000847 PMC: 11243995. DOI: 10.3390/s24134069.