An Effective Neural Model Extracting Document Level Chemical-induced Disease Relations from Biomedical Literature
Overview
Authors
Affiliations
Since identifying relations between chemicals and diseases (CDR) are important for biomedical research and healthcare, the challenge proposed by BioCreative V requires automatically mining causal relationships between chemicals and diseases which may span sentence boundaries. Although most systems explore feature engineering and knowledge bases to recognize document level CDR relations, feature learning automatically is limited only in a sentence. In this work, we proposed an effective model that automatically learns document level semantic representations to extract chemical-induced disease (CID) relations from articles by combining advantages of convolutional neural network and recurrent neural network. First, to purposefully collect contexts, candidate entities existing in multiple sentences of an article were masked to make the model have ability to discern candidate entities and general terms. Next, considering the contiguity and temporality among associated sentences as well as the topic of an article, a hierarchical network architecture was designed at the document level to capture semantic information of different types of text segments in an article. Finally, a softmax classifier performed the CID recognition. Experimental results on the CDR corpus show that the proposed model achieves a good overall performance compared with other state-of-the-art methods. Although only using two types of embedding vectors, our approach can perform well for recognizing not only intra-sentential but also inter-sentential CID relations.
Exploiting document graphs for inter sentence relation extraction.
Le H, Can D, Collier N J Biomed Semantics. 2022; 13(1):15.
PMID: 35659292 PMC: 9166375. DOI: 10.1186/s13326-022-00267-3.
Li Z, Wang M, Peng D, Liu J, Xie Y, Dai Z Interdiscip Sci. 2022; 14(3):683-696.
PMID: 35391615 DOI: 10.1007/s12539-022-00511-5.
Biomedical relation extraction via knowledge-enhanced reading comprehension.
Chen J, Hu B, Peng W, Chen Q, Tang B BMC Bioinformatics. 2022; 23(1):20.
PMID: 34991458 PMC: 8734165. DOI: 10.1186/s12859-021-04534-5.
Named Entity Recognition and Relation Detection for Biomedical Information Extraction.
Perera N, Dehmer M, Emmert-Streib F Front Cell Dev Biol. 2020; 8:673.
PMID: 32984300 PMC: 7485218. DOI: 10.3389/fcell.2020.00673.
Exploiting sequence labeling framework to extract document-level relations from biomedical texts.
Li Z, Yang Z, Xiang Y, Luo L, Sun Y, Lin H BMC Bioinformatics. 2020; 21(1):125.
PMID: 32216746 PMC: 7099809. DOI: 10.1186/s12859-020-3457-2.