» Articles » PMID: 14728536

A Probabilistic Similarity Metric for Medline Records: a Model for Author Name Disambiguation

Overview
Date 2004 Jan 20
PMID 14728536
Citations 24
Authors
Affiliations
Soon will be listed here.
Abstract

We present a model for automatically generating training sets and estimating the probability that a pair of Medline records sharing a last and first name initial are authored by the same individual, based on shared title words, journal name, co-authors, medical subject headings, language, and affiliation, as well as distinctive features of the name itself (i.e., presence of middle initial, suffix, and prevalence in Medline).

Citing Articles

Slow convergence: Career impediments to interdisciplinary biomedical research.

Berkes E, Marion M, Milojevic S, Weinberg B Proc Natl Acad Sci U S A. 2024; 121(32):e2402646121.

PMID: 39074264 PMC: 11317606. DOI: 10.1073/pnas.2402646121.


Bridging the gap in author names: building an enhanced author name dataset for biomedical literature system.

Zhang L, Song N, Gui S, Wu K, Lu W J Am Med Inform Assoc. 2024; 31(8):1648-1656.

PMID: 38916911 PMC: 11258411. DOI: 10.1093/jamia/ocae127.


Notes on the data quality of bibliographic records from the MEDLINE database.

Bramley R, Howe S, Marmanis H Database (Oxford). 2023; 2023.

PMID: 37935584 PMC: 10630407. DOI: 10.1093/database/baad070.


Exploring high scientific productivity in international co-authorship of a small developing country based on collaboration patterns.

Mitrovic I, Misic M, Protic J J Big Data. 2023; 10(1):64.

PMID: 37215244 PMC: 10184642. DOI: 10.1186/s40537-023-00744-1.


Scientific rewards for biomedical specialization are large and persistent.

de Rassenfosse G, Higham K, Penner O BMC Biol. 2022; 20(1):211.

PMID: 36175953 PMC: 9524129. DOI: 10.1186/s12915-022-01400-5.