» Articles » PMID: 11825149

Effective Mapping of Biomedical Text to the UMLS Metathesaurus: the MetaMap Program

Overview
Journal Proc AMIA Symp
Date 2002 Feb 5
PMID 11825149
Citations 708
Authors
Affiliations
Soon will be listed here.
Abstract

The UMLS Metathesaurus, the largest thesaurus in the biomedical domain, provides a representation of biomedical knowledge consisting of concepts classified by semantic type and both hierarchical and non-hierarchical relationships among the concepts. This knowledge has proved useful for many applications including decision support systems, management of patient records, information retrieval (IR) and data mining. Gaining effective access to the knowledge is critical to the success of these applications. This paper describes MetaMap, a program developed at the National Library of Medicine (NLM) to map biomedical text to the Metathesaurus or, equivalently, to discover Metathesaurus concepts referred to in text. MetaMap uses a knowledge intensive approach based on symbolic, natural language processing (NLP) and computational linguistic techniques. Besides being applied for both IR and data mining applications, MetaMap is one of the foundations of NLM's Indexing Initiative System which is being applied to both semi-automatic and fully automatic indexing of the biomedical literature at the library.

Citing Articles

Improving topic modeling performance on social media through semantic relationships within biomedical terminology.

Xin Y, Grabowska M, Gangireddy S, Krantz M, Kerchberger V, Dickson A PLoS One. 2025; 20(2):e0318702.

PMID: 39982945 PMC: 11845042. DOI: 10.1371/journal.pone.0318702.


Integrating AI-powered text mining from PubTator into the manual curation workflow at the Comparative Toxicogenomics Database.

Wiegers T, Davis A, Wiegers J, Sciaky D, Barkalow F, Wyatt B Database (Oxford). 2025; 2025.

PMID: 39982792 PMC: 11844237. DOI: 10.1093/database/baaf013.


Development and Validation of Natural Language Processing Algorithms in the ENACT National Electronic Health Record Research Network.

Wang Y, Hilsman J, Li C, Morris M, Heider P, Fu S medRxiv. 2025; .

PMID: 39974073 PMC: 11839006. DOI: 10.1101/2025.01.24.25321096.


Research on adverse event classification algorithm of da Vinci surgical robot based on Bert-BiLSTM model.

Li T, Zhu W, Xia W, Wang L, Li W, Zhang P Front Comput Neurosci. 2024; 18:1476164.

PMID: 39737445 PMC: 11682881. DOI: 10.3389/fncom.2024.1476164.


Identifying Symptoms of Delirium from Clinical Narratives Using Natural Language Processing.

Chen A, Paredes D, Yu Z, Lou X, Brunson R, Thomas J Proc (IEEE Int Conf Healthc Inform). 2024; 2024:305-311.

PMID: 39726986 PMC: 11670120. DOI: 10.1109/ichi61247.2024.00046.


References
1.
Rindflesch T, Hunter L, Aronson A . Mining molecular binding terminology from biomedical text. Proc AMIA Symp. 1999; :127-31. PMC: 2232739. View

2.
Wilbur W, Hazard Jr G, Divita G, Mork J, Aronson A, Browne A . Analysis of biomedical text for chemical names: a comparison of three methods. Proc AMIA Symp. 1999; :176-80. PMC: 2232672. View

3.
Rindflesch T, Tanabe L, Weinstein J, Hunter L . EDGAR: extraction of drugs, genes and relations from the biomedical literature. Pac Symp Biocomput. 2000; :517-28. PMC: 2709525. DOI: 10.1142/9789814447331_0049. View

4.
Nadkarni P, Chen R, Brandt C . UMLS concept indexing for production databases: a feasibility study. J Am Med Inform Assoc. 2001; 8(1):80-91. PMC: 134593. DOI: 10.1136/jamia.2001.0080080. View

5.
Aronson A, Bodenreider O, Chang H, Humphrey S, Mork J, Nelson S . The NLM Indexing Initiative. Proc AMIA Symp. 2000; :17-21. PMC: 2243970. View