» Articles » PMID: 27016700

CD-REST: a System for Extracting Chemical-induced Disease Relation in Literature

Overview
Specialty Biology
Date 2016 Mar 27
PMID 27016700
Citations 33
Authors
Affiliations
Soon will be listed here.
Abstract

Mining chemical-induced disease relations embedded in the vast biomedical literature could facilitate a wide range of computational biomedical applications, such as pharmacovigilance. The BioCreative V organized a Chemical Disease Relation (CDR) Track regarding chemical-induced disease relation extraction from biomedical literature in 2015. We participated in all subtasks of this challenge. In this article, we present our participation system Chemical Disease Relation Extraction SysTem (CD-REST), an end-to-end system for extracting chemical-induced disease relations in biomedical literature. CD-REST consists of two main components: (1) a chemical and disease named entity recognition and normalization module, which employs the Conditional Random Fields algorithm for entity recognition and a Vector Space Model-based approach for normalization; and (2) a relation extraction module that classifies both sentence-level and document-level candidate drug-disease pairs by support vector machines. Our system achieved the best performance on the chemical-induced disease relation extraction subtask in the BioCreative V CDR Track, demonstrating the effectiveness of our proposed machine learning-based approaches for automatic extraction of chemical-induced disease relations in biomedical literature. The CD-REST system provides web services using HTTP POST request. The web services can be accessed fromhttp://clinicalnlptool.com/cdr The online CD-REST demonstration system is available athttp://clinicalnlptool.com/cdr/cdr.html. Database URL:http://clinicalnlptool.com/cdr;http://clinicalnlptool.com/cdr/cdr.html.

Citing Articles

A framework for integrating biomedical knowledge in Wikidata with open biological and biomedical ontologies and MeSH keywords.

Turki H, Chebil K, Dossou B, Emezue C, Owodunni A, Hadj Taieb M Heliyon. 2024; 10(19):e38448.

PMID: 39403518 PMC: 11471508. DOI: 10.1016/j.heliyon.2024.e38448.


PubTator 3.0: an AI-powered literature resource for unlocking biomedical knowledge.

Wei C, Allot A, Lai P, Leaman R, Tian S, Luo L Nucleic Acids Res. 2024; 52(W1):W540-W546.

PMID: 38572754 PMC: 11223843. DOI: 10.1093/nar/gkae235.


A metric learning-based method for biomedical entity linking.

Le N, Nguyen N Front Res Metr Anal. 2024; 8:1247094.

PMID: 38173988 PMC: 10762861. DOI: 10.3389/frma.2023.1247094.


The precision medicine process for treating rare disease using the artificial intelligence tool mediKanren.

Foksinska A, Crowder C, Crouse A, Henrikson J, Byrd W, Rosenblatt G Front Artif Intell. 2022; 5:910216.

PMID: 36248623 PMC: 9562701. DOI: 10.3389/frai.2022.910216.


Identification of Chemical-Disease Associations Through Integration of Molecular Fingerprint, Gene Ontology and Pathway Information.

Li Z, Wang M, Peng D, Liu J, Xie Y, Dai Z Interdiscip Sci. 2022; 14(3):683-696.

PMID: 35391615 DOI: 10.1007/s12539-022-00511-5.


References
1.
Chen E, Hripcsak G, Xu H, Markatou M, Friedman C . Automated acquisition of disease drug knowledge from biomedical and clinical documents: an initial study. J Am Med Inform Assoc. 2007; 15(1):87-98. PMC: 2274872. DOI: 10.1197/jamia.M2401. View

2.
Islamaj Dogan R, Leaman R, Lu Z . NCBI disease corpus: a resource for disease name recognition and concept normalization. J Biomed Inform. 2014; 47:1-10. PMC: 3951655. DOI: 10.1016/j.jbi.2013.12.006. View

3.
Leaman R, Wei C, Lu Z . tmChem: a high performance approach for chemical named entity recognition and normalization. J Cheminform. 2015; 7:S3. PMC: 4331693. DOI: 10.1186/1758-2946-7-S1-S3. View

4.
Rocktaschel T, Weidlich M, Leser U . ChemSpot: a hybrid system for chemical named entity recognition. Bioinformatics. 2012; 28(12):1633-40. DOI: 10.1093/bioinformatics/bts183. View

5.
Bodenreider O . The Unified Medical Language System (UMLS): integrating biomedical terminology. Nucleic Acids Res. 2003; 32(Database issue):D267-70. PMC: 308795. DOI: 10.1093/nar/gkh061. View