» Articles » PMID: 25523215

Identifying Possible False Matches in Anonymized Hospital Administrative Data Without Patient Identifiers

Overview
Journal Health Serv Res
Specialty Health Services
Date 2014 Dec 20
PMID 25523215
Citations 18
Authors
Affiliations
Soon will be listed here.
Abstract

Objective: To identify data linkage errors in the form of possible false matches, where two patients appear to share the same unique identification number.

Data Source: Hospital Episode Statistics (HES) in England, United Kingdom.

Study Design: Data on births and re-admissions for infants (April 1, 2011 to March 31, 2012; age 0-1 year) and adolescents (April 1, 2004 to March 31, 2011; age 10-19 years).

Data Collection/extraction Methods: Hospital records pseudo-anonymized using an algorithm designed to link multiple records belonging to the same person. Six implausible clinical scenarios were considered possible false matches: multiple births sharing HESID, re-admission after death, two birth episodes sharing HESID, simultaneous admission at different hospitals, infant episodes coded as deliveries, and adolescent episodes coded as births.

Principal Findings: Among 507,778 infants, possible false matches were relatively rare (n = 433, 0.1 percent). The most common scenario (simultaneous admission at two hospitals, n = 324) was more likely for infants with missing data, those born preterm, and for Asian infants. Among adolescents, this scenario (n = 320) was more common for males, younger patients, the Mixed ethnic group, and those re-admitted more frequently.

Conclusions: Researchers can identify clinically implausible scenarios and patients affected, at the data cleaning stage, to mitigate the impact of possible linkage errors.

Citing Articles

Virtual patient identifier (vPID): Improving patient traceability using anonymized identifiers in Japanese healthcare insurance claims database.

Sato J, Mitsutake N, Yamada H, Kitsuregawa M, Goda K Heliyon. 2023; 9(5):e16209.

PMID: 37234615 PMC: 10205637. DOI: 10.1016/j.heliyon.2023.e16209.


Linking electronic mental healthcare and benefits records in South London: design, procedure and descriptive outcomes.

Stevelink S, Phillips A, Broadbent M, Boyd A, Dorrington S, Jewell A BMJ Open. 2023; 13(2):e067136.

PMID: 36792321 PMC: 9950921. DOI: 10.1136/bmjopen-2022-067136.


Biases arising from linked administrative data for epidemiological research: a conceptual framework from registration to analyses.

Shaw R, Harron K, Pescarini J, Pinto Junior E, Allik M, Siroky A Eur J Epidemiol. 2022; 37(12):1215-1224.

PMID: 36333542 PMC: 9792414. DOI: 10.1007/s10654-022-00934-w.


Developing a national birth cohort for child health research using a hospital admissions database in England: The impact of changes to data collection practices.

Zylbersztejn A, Gilbert R, Hardelid P PLoS One. 2020; 15(12):e0243843.

PMID: 33320878 PMC: 7737962. DOI: 10.1371/journal.pone.0243843.


Linkage of maternity hospital episode statistics birth records to birth registration and notification records for births in England 2005-2006: quality assurance of linkage.

Coathup V, Macfarlane A, Quigley M BMJ Open. 2020; 10(10):e037885.

PMID: 33109650 PMC: 7592278. DOI: 10.1136/bmjopen-2020-037885.


References
1.
Baker M, Telfar Barnard L, Kvalsvig A, Verrall A, Zhang J, Keall M . Increasing incidence of serious infectious diseases and inequalities in New Zealand: a national epidemiological study. Lancet. 2012; 379(9821):1112-9. DOI: 10.1016/S0140-6736(11)61780-7. View

2.
Joffe E, Bearden C, Byrne M, Bernstam E . Duplicate patient records--implication for missed laboratory results. AMIA Annu Symp Proc. 2013; 2012:1269-75. PMC: 3540536. View

3.
DUNN H . Record linkage. Am J Public Health Nations Health. 2010; 36(12):1412-6. View

4.
Dattani N, Datta-Nemdharry P, Macfarlane A . Linking maternity data for England, 2005-06: methods and data quality. Health Stat Q. 2011; (49):53-79. DOI: 10.1057/hsq.2011.3. View

5.
Lariscy J . Differential record linkage by Hispanic ethnicity and age in linked mortality studies: implications for the epidemiologic paradox. J Aging Health. 2011; 23(8):1263-84. PMC: 4598042. DOI: 10.1177/0898264311421369. View