» Articles » PMID: 32935028

Privacy Preserving Linkage Using Multiple Match-keys

Overview
Specialty Public Health
Date 2020 Sep 16
PMID 32935028
Citations 7
Authors
Affiliations
Soon will be listed here.
Abstract

Introduction: Available and practical methods for privacy preserving linkage have shortcomings: methods utilising anonymous linkage codes provide limited accuracy while methods based on Bloom filters have proven vulnerable to frequency-based attacks.

Objectives: In this paper, we present and evaluate a novel protocol that aims to meld both the accuracy of the Bloom filter method with the privacy achievable through the anonymous linkage code methodology.

Methods: The protocol involves creating multiple match-keys for each record, with the composition of each match-key depending on attributes of the underlying datasets being compared. The protocol was evaluated through de-duplication of four administrative datasets and two synthetic datasets; the 'answers' outlining which records belonged to the same individual were known for each dataset. The results were compared against results achieved with un-encoded linkage and other privacy preserving techniques on the same datasets.

Results: The multiple match-key protocol presented here achieved high quality across all datasets, performing better than record-level Bloom filters and the SLK, but worse than field-level Bloom filters.

Conclusion: The presented method provides high linkage quality while avoiding the frequency based attacks that have been demonstrated against the Bloom filter approach. The method appears promising for real world use.

Citing Articles

Microsimulation of an educational attainment register to predict future record linkage quality.

Schnell R, Weiand S Int J Popul Data Sci. 2023; 8(1):2122.

PMID: 37649490 PMC: 10463005. DOI: 10.23889/ijpds.v8i1.2122.


Validating a novel deterministic privacy-preserving record linkage between administrative & clinical data: applications in stroke research.

Southwell A, Bronskill S, Gee T, Behan B, Gilbert Evans S, Mikkelsen T Int J Popul Data Sci. 2023; 7(4):1755.

PMID: 37152407 PMC: 10161965. DOI: 10.23889/ijpds.v7i4.1755.


Unlocking Potential within Health Systems Using Privacy-Preserving Record Linkage: Exploring Chronic Kidney Disease Outcomes through Linked Data Modelling.

Lim D, Randall S, Robinson S, Thomas E, Williamson J, Chakera A Appl Clin Inform. 2022; 13(4):901-909.

PMID: 36170880 PMC: 9519263. DOI: 10.1055/s-0042-1757174.


AUS-TBI: The Australian Health Informatics Approach to Predict Outcomes and Monitor Intervention Efficacy after Moderate-to-Severe Traumatic Brain Injury.

Fitzgerald M, Ponsford J, Lannin N, OBrien T, Cameron P, Cooper D Neurotrauma Rep. 2022; 3(1):217-223.

PMID: 35919508 PMC: 9279124. DOI: 10.1089/neur.2022.0002.


A methodological assessment of privacy preserving record linkage using survey and administrative data.

Mirel L, Resnick D, Aram J, Cox C Stat J IAOS. 2022; 38(2):413-421.

PMID: 35910693 PMC: 9335262. DOI: 10.3233/sji-210891.


References
1.
Weber S, Lowe H, Das A, Ferris T . A simple heuristic for blindfolded record linkage. J Am Med Inform Assoc. 2012; 19(e1):e157-61. PMC: 3392854. DOI: 10.1136/amiajnl-2011-000329. View

2.
Brown A, Borgs C, Randall S, Schnell R . Evaluating privacy-preserving record linkage using cryptographic long-term keys and multibit trees on large medical datasets. BMC Med Inform Decis Mak. 2017; 17(1):83. PMC: 5465525. DOI: 10.1186/s12911-017-0478-5. View

3.
Randall S, Ferrante A, Boyd J, Bauer J, Semmens J . Privacy-preserving record linkage on large real world datasets. J Biomed Inform. 2013; 50:205-12. DOI: 10.1016/j.jbi.2013.12.003. View

4.
Ferrante A, Boyd J . A transparent and transportable methodology for evaluating Data Linkage software. J Biomed Inform. 2011; 45(1):165-72. DOI: 10.1016/j.jbi.2011.10.006. View

5.
Newcombe H, Kennedy J, Axford S, James A . Automatic linkage of vital records. Science. 1959; 130(3381):954-9. DOI: 10.1126/science.130.3381.954. View