» Articles » PMID: 12603047

A Biological Named Entity Recognizer

Overview
Publisher World Scientific
Specialty Biology
Date 2003 Feb 27
PMID 12603047
Citations 27
Authors
Affiliations
Soon will be listed here.
Abstract

In this paper we describe a new named entity extraction system. Our system is based on a manually developed set of rules that rely heavily upon some crucial lexical information, linguistic constraints of English, and contextual information. This system achieves state of art results in the protein name detection task, which is what many of the current name extraction systems do. We discuss the need for detection of chemical names and show that we not only obtain a high degree of success in recognizing chemicals but that this task can help improve the precision of protein name detection as well. We use context and surrounding words for categorization of named entities and find the results obtained are encouraging.

Citing Articles

How Do Your Biomedical Named Entity Recognition Models Generalize to Novel Entities?.

Kim H, Kang J IEEE Access. 2022; 10:31513-31523.

PMID: 35582496 PMC: 9014470. DOI: 10.1109/ACCESS.2022.3157854.


ChEMU 2020: Natural Language Processing Methods Are Effective for Information Extraction From Chemical Patents.

He J, Nguyen D, Akhondi S, Druckenbrodt C, Thorne C, Hoessel R Front Res Metr Anal. 2021; 6:654438.

PMID: 33870071 PMC: 8028406. DOI: 10.3389/frma.2021.654438.


Discovery of disease- and drug-specific pathways through community structures of a literature network.

Pham M, Wilson S, Govindarajan H, Lin C, Lichtarge O Bioinformatics. 2019; 36(6):1881-1888.

PMID: 31738408 PMC: 7103064. DOI: 10.1093/bioinformatics/btz857.


OGER++: hybrid multi-type entity recognition.

Furrer L, Jancso A, Colic N, Rinaldi F J Cheminform. 2019; 11(1):7.

PMID: 30666476 PMC: 6689863. DOI: 10.1186/s13321-018-0326-3.


Automatic gene annotation using GO terms from cellular component domain.

Ding R, Qu Y, Wu C, Vijay-Shanker K BMC Med Inform Decis Mak. 2018; 18(Suppl 5):119.

PMID: 30526566 PMC: 6284271. DOI: 10.1186/s12911-018-0694-7.