» Articles » PMID: 22554700

The EU-ADR Corpus: Annotated Drugs, Diseases, Targets, and Their Relationships

Overview
Journal J Biomed Inform
Publisher Elsevier
Date 2012 May 5
PMID 22554700
Citations 47
Authors
Affiliations
Soon will be listed here.
Abstract

Corpora with specific entities and relationships annotated are essential to train and evaluate text-mining systems that are developed to extract specific structured information from a large corpus. In this paper we describe an approach where a named-entity recognition system produces a first annotation and annotators revise this annotation using a web-based interface. The agreement figures achieved show that the inter-annotator agreement is much better than the agreement with the system provided annotations. The corpus has been annotated for drugs, disorders, genes and their inter-relationships. For each of the drug-disorder, drug-target, and target-disorder relations three experts have annotated a set of 100 abstracts. These annotated relationships will be used to train and evaluate text-mining software to capture these relationships in texts.

Citing Articles

Dataset of miRNA-disease relations extracted from textual data using transformer-based neural networks.

Madan S, Kuhnel L, Frohlich H, Hofmann-Apitius M, Fluck J Database (Oxford). 2024; 2024.

PMID: 39104284 PMC: 11300841. DOI: 10.1093/database/baae066.


Large Language Models and Genomics for Summarizing the Role of microRNA in Regulating mRNA Expression.

Bhasuran B, Manoharan S, Iyyappan O, Murugesan G, Prabahar A, Raja K Biomedicines. 2024; 12(7).

PMID: 39062108 PMC: 11274411. DOI: 10.3390/biomedicines12071535.


Transformers and large language models in healthcare: A review.

Nerella S, Bandyopadhyay S, Zhang J, Contreras M, Siegel S, Bumin A Artif Intell Med. 2024; 154:102900.

PMID: 38878555 PMC: 11638972. DOI: 10.1016/j.artmed.2024.102900.


A Study of Biomedical Relation Extraction Using GPT Models.

Zhang J, Wibert M, Zhou H, Peng X, Chen Q, Keloth V AMIA Jt Summits Transl Sci Proc. 2024; 2024:391-400.

PMID: 38827097 PMC: 11141827.


An in-depth evaluation of federated learning on biomedical natural language processing for information extraction.

Peng L, Luo G, Zhou S, Chen J, Xu Z, Sun J NPJ Digit Med. 2024; 7(1):127.

PMID: 38750290 PMC: 11096157. DOI: 10.1038/s41746-024-01126-4.