» Articles » PMID: 31307134

Towards Cross-platform Interoperability for Machine-assisted Text Annotation

Overview
Journal Genomics Inform
Publisher Biomed Central
Specialty Biology
Date 2019 Jul 16
PMID 31307134
Citations 2
Authors
Affiliations
Soon will be listed here.
Abstract

In this paper we investigate cross-platform interoperability for natural language processing (NLP) and, in particular, annotation of textual resources, with an eye toward identifying the design elements of annotation models and processes that are particularly problematic for, or amenable to, enabling seamless communication across different platforms. The study is conducted in the context of a specific annotation methodology, namely machine-assisted interactive annotation (also known as human-in-the-loop annotation). This methodology requires the ability to freely combine resources from different document repositories, access a wide array of NLP tools that automatically annotate corpora for various linguistic phenomena, and use a sophisticated annotation editor that enables interactive manual annotation coupled with on-the-fly machine learning. We consider three independently developed platforms, each of which utilizes a different model for representing annotations over text, and each of which performs a different role in the process.

Citing Articles

MedTAG: a portable and customizable annotation tool for biomedical documents.

Giachelle F, Irrera O, Silvello G BMC Med Inform Decis Mak. 2021; 21(1):352.

PMID: 34922517 PMC: 8684237. DOI: 10.1186/s12911-021-01706-4.


Markup: A Web-Based Annotation Tool Powered by Active Learning.

Dobbie S, Strafford H, Pickrell W, Fonferko-Shadrach B, Jones C, Akbari A Front Digit Health. 2021; 3:598916.

PMID: 34713086 PMC: 8521860. DOI: 10.3389/fdgth.2021.598916.

References
1.
Savova G, Masanz J, Ogren P, Zheng J, Sohn S, Kipper-Schuler K . Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications. J Am Med Inform Assoc. 2010; 17(5):507-13. PMC: 2995668. DOI: 10.1136/jamia.2009.001560. View

2.
Cunningham H, Tablan V, Roberts A, Bontcheva K . Getting more out of biomedical documents with GATE's full lifecycle open source text analytics. PLoS Comput Biol. 2013; 9(2):e1002854. PMC: 3567135. DOI: 10.1371/journal.pcbi.1002854. View

3.
Furrer L, Jancso A, Colic N, Rinaldi F . OGER++: hybrid multi-type entity recognition. J Cheminform. 2019; 11(1):7. PMC: 6689863. DOI: 10.1186/s13321-018-0326-3. View