» Articles » PMID: 17142812

RelEx--relation Extraction Using Dependency Parse Trees

Overview
Journal Bioinformatics
Specialty Biology
Date 2006 Dec 5
PMID 17142812
Citations 123
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: The discovery of regulatory pathways, signal cascades, metabolic processes or disease models requires knowledge on individual relations like e.g. physical or regulatory interactions between genes and proteins. Most interactions mentioned in the free text of biomedical publications are not yet contained in structured databases.

Results: We developed RelEx, an approach for relation extraction from free text. It is based on natural language preprocessing producing dependency parse trees and applying a small number of simple rules to these trees. We applied RelEx on a comprehensive set of one million MEDLINE abstracts dealing with gene and protein relations and extracted approximately 150,000 relations with an estimated performance of both 80% precision and 80% recall.

Availability: The used natural language preprocessing tools are free for use for academic research. Test sets and relation term lists are available from our website (http://www.bio.ifi.lmu.de/publications/RelEx/).

Citing Articles

An Accurate and Efficient Approach to Knowledge Extraction from Scientific Publications Using Structured Ontology Models, Graph Neural Networks, and Large Language Models.

Ivanisenko T, Demenkov P, Ivanisenko V Int J Mol Sci. 2024; 25(21).

PMID: 39519363 PMC: 11546091. DOI: 10.3390/ijms252111811.


Learning to explain is a good biomedical few-shot learner.

Chen P, Wang J, Luo L, Lin H, Yang Z Bioinformatics. 2024; 40(10).

PMID: 39360976 PMC: 11483110. DOI: 10.1093/bioinformatics/btae589.


Evaluating GPT and BERT models for protein-protein interaction identification in biomedical text.

Rehana H, Cam N, Basmaci M, Zheng J, Jemiyo C, He Y Bioinform Adv. 2024; 4(1):vbae133.

PMID: 39319026 PMC: 11419952. DOI: 10.1093/bioadv/vbae133.


STRING-ing together protein complexes: corpus and methods for extracting physical protein interactions from the biomedical literature.

Mehryary F, Nastou K, Ohta T, Jensen L, Pyysalo S Bioinformatics. 2024; 40(9).

PMID: 39276156 PMC: 11441320. DOI: 10.1093/bioinformatics/btae552.


Unsupervised literature mining approaches for extracting relationships pertaining to habitats and reproductive conditions of plant species.

Gabud R, Lapitan P, Mariano V, Mendoza E, Pampolina N, Clarino M Front Artif Intell. 2024; 7:1371411.

PMID: 38845683 PMC: 11153722. DOI: 10.3389/frai.2024.1371411.