» Articles » PMID: 22166723

Automatic Extraction of Semantic Relations Between Medical Entities: a Rule Based Approach

Overview
Publisher Biomed Central
Date 2011 Dec 15
PMID 22166723
Citations 22
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Information extraction is a complex task which is necessary to develop high-precision information retrieval tools. In this paper, we present the platform MeTAE (Medical Texts Annotation and Exploration). MeTAE allows (i) to extract and annotate medical entities and relationships from medical texts and (ii) to explore semantically the produced RDF annotations.

Results: Our annotation approach relies on linguistic patterns and domain knowledge and consists in two steps: (i) recognition of medical entities and (ii) identification of the correct semantic relation between each pair of entities. The first step is achieved by an enhanced use of MetaMap which improves the precision obtained by MetaMap by 19.59% in our evaluation. The second step relies on linguistic patterns which are built semi-automatically from a corpus selected according to semantic criteria. We evaluate our system's ability to identify medical entities of 16 types. We also evaluate the extraction of treatment relations between a treatment (e.g. medication) and a problem (e.g. disease): we obtain 75.72% precision and 60.46% recall.

Conclusions: According to our experiments, using an external sentence segmenter and noun phrase chunker may improve the precision of MetaMap-based medical entity recognition. Our pattern-based relation extraction method obtains good precision and recall w.r.t related works. A more precise comparison with related approaches remains difficult however given the differences in corpora and in the exact nature of the extracted relations. The selection of MEDLINE articles through queries related to known drug-disease pairs enabled us to obtain a more focused corpus of relevant examples of treatment relations than a more general MEDLINE query.

Citing Articles

Towards discovery: an end-to-end system for uncovering novel biomedical relations.

Almeida T, Jonker R, Antunes R, Almeida J, Matos S Database (Oxford). 2024; 2024.

PMID: 38994795 PMC: 11240158. DOI: 10.1093/database/baae057.


Synthetic data for annotation and extraction of family history information from clinical text.

Brekke P, Rama T, Pilan I, Nytro O, Ovrelid L J Biomed Semantics. 2021; 12(1):11.

PMID: 34261535 PMC: 8278746. DOI: 10.1186/s13326-021-00244-2.


Medical Knowledge Graph to Enhance Fraud, Waste, and Abuse Detection on Claim Data: Model Development and Performance Evaluation.

Sun H, Xiao J, Zhu W, He Y, Zhang S, Xu X JMIR Med Inform. 2020; 8(7):e17653.

PMID: 32706714 PMC: 7413281. DOI: 10.2196/17653.


Distant supervision for treatment relation extraction by leveraging MeSH subheadings.

Tran T, Kavuluru R Artif Intell Med. 2019; 98:18-26.

PMID: 31521249 PMC: 6748648. DOI: 10.1016/j.artmed.2019.06.002.


Comparison of Natural Language Processing Techniques in Analysis of Sparse Clinical Data: Insulin Decline by Patients.

Malmasi S, Ge W, Hosomura N, Turchin A AMIA Jt Summits Transl Sci Proc. 2019; 2019:610-619.

PMID: 31259016 PMC: 6568116.


References
1.
Rindflesch T, Bean C, Sneiderman C . Argument identification for arterial branching predications asserted in cardiac catheterization reports. Proc AMIA Symp. 2000; :704-8. PMC: 2243725. View

2.
Ahlers C, Fiszman M, Demner-Fushman D, Lang F, Rindflesch T . Extracting semantic predications from Medline citations for pharmacogenomics. Pac Symp Biocomput. 2007; :209-20. View

3.
Pratt W, Yetisgen-Yildiz M . A study of biomedical concept identification: MetaMap vs. people. AMIA Annu Symp Proc. 2004; :529-33. PMC: 1479976. View

4.
Aronson A . Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program. Proc AMIA Symp. 2002; :17-21. PMC: 2243666. View

5.
Pustejovsky J, Castano J, Zhang J, Kotecki M, Cochran B . Robust relational parsing over biomedical literature: extracting inhibit relations. Pac Symp Biocomput. 2002; :362-73. DOI: 10.1142/9789812799623_0034. View