» Articles » PMID: 23323936

Negated Bio-events: Analysis and Identification

Overview
Publisher Biomed Central
Specialty Biology
Date 2013 Jan 18
PMID 23323936
Citations 8
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Negation occurs frequently in scientific literature, especially in biomedical literature. It has previously been reported that around 13% of sentences found in biomedical research articles contain negation. Historically, the main motivation for identifying negated events has been to ensure their exclusion from lists of extracted interactions. However, recently, there has been a growing interest in negative results, which has resulted in negation detection being identified as a key challenge in biomedical relation extraction. In this article, we focus on the problem of identifying negated bio-events, given gold standard event annotations.

Results: We have conducted a detailed analysis of three open access bio-event corpora containing negation information (i.e., GENIA Event, BioInfer and BioNLP'09 ST), and have identified the main types of negated bio-events. We have analysed the key aspects of a machine learning solution to the problem of detecting negated events, including selection of negation cues, feature engineering and the choice of learning algorithm. Combining the best solutions for each aspect of the problem, we propose a novel framework for the identification of negated bio-events. We have evaluated our system on each of the three open access corpora mentioned above. The performance of the system significantly surpasses the best results previously reported on the BioNLP'09 ST corpus, and achieves even better results on the GENIA Event and BioInfer corpora, both of which contain more varied and complex events.

Conclusions: Recently, in the field of biomedical text mining, the development and enhancement of event-based systems has received significant interest. The ability to identify negated events is a key performance element for these systems. We have conducted the first detailed study on the analysis and identification of negated bio-events. Our proposed framework can be integrated with state-of-the-art event extraction systems. The resulting systems will be able to extract bio-events with attached polarities from textual documents, which can serve as the foundation for more elaborate systems that are able to detect mutually contradicting bio-events.

Citing Articles

A novel corpus of molecular to higher-order events that facilitates the understanding of the pathogenic mechanisms of idiopathic pulmonary fibrosis.

Nagano N, Tokunaga N, Ikeda M, Inoura H, Khoa D, Miwa M Sci Rep. 2023; 13(1):5986.

PMID: 37045907 PMC: 10092917. DOI: 10.1038/s41598-023-32915-8.


A survey on clinical natural language processing in the United Kingdom from 2007 to 2022.

Wu H, Wang M, Wu J, Francis F, Chang Y, Shavick A NPJ Digit Med. 2022; 5(1):186.

PMID: 36544046 PMC: 9770568. DOI: 10.1038/s41746-022-00730-6.


An automated approach to identify scientific publications reporting pharmacokinetic parameters.

Gonzalez Hernandez F, Carter S, Iso-Sipila J, Goldsmith P, Almousa A, Gastine S Wellcome Open Res. 2021; 6:88.

PMID: 34381873 PMC: 8343403. DOI: 10.12688/wellcomeopenres.16718.1.


Antibody Exchange: Information extraction of biological antibody donation and a web-portal to find donors and seekers.

Subramanian S, Ganapathiraju M Data (Basel). 2018; 2(4).

PMID: 30498741 PMC: 6258257. DOI: 10.3390/data2040038.


Annotation and detection of drug effects in text for pharmacovigilance.

Thompson P, Daikou S, Ueno K, Batista-Navarro R, Tsujii J, Ananiadou S J Cheminform. 2018; 10(1):37.

PMID: 30105604 PMC: 6089860. DOI: 10.1186/s13321-018-0290-y.


References
1.
Smialowski P, Pagel P, Wong P, Brauner B, Dunger I, Fobo G . The Negatome database: a reference set of non-interacting protein pairs. Nucleic Acids Res. 2009; 38(Database issue):D540-4. PMC: 2808923. DOI: 10.1093/nar/gkp1026. View

2.
Thompson P, Iqbal S, McNaught J, Ananiadou S . Construction of an annotated corpus to support biomedical information extraction. BMC Bioinformatics. 2009; 10:349. PMC: 2774701. DOI: 10.1186/1471-2105-10-349. View

3.
Chapman W, Bridewell W, Hanbury P, Cooper G, Buchanan B . A simple algorithm for identifying negated findings and diseases in discharge summaries. J Biomed Inform. 2002; 34(5):301-10. DOI: 10.1006/jbin.2001.1029. View

4.
Agarwal S, Yu H . Biomedical negation scope detection with conditional random fields. J Am Med Inform Assoc. 2010; 17(6):696-701. PMC: 3000754. DOI: 10.1136/jamia.2010.003228. View

5.
Mutalik P, Deshpande A, Nadkarni P . Use of general-purpose negation detection to augment concept indexing of medical documents: a quantitative study using the UMLS. J Am Med Inform Assoc. 2001; 8(6):598-609. PMC: 130070. DOI: 10.1136/jamia.2001.0080598. View