» Articles » PMID: 32134684

Systematic Review of Privacy-Preserving Distributed Machine Learning From Federated Databases in Health Care

Overview
Date 2020 Mar 6
PMID 32134684
Citations 32
Authors
Affiliations
Soon will be listed here.
Abstract

Big data for health care is one of the potential solutions to deal with the numerous challenges of health care, such as rising cost, aging population, precision medicine, universal health coverage, and the increase of noncommunicable diseases. However, data centralization for big data raises privacy and regulatory concerns.Covered topics include (1) an introduction to privacy of patient data and distributed learning as a potential solution to preserving these data, a description of the legal context for patient data research, and a definition of machine/deep learning concepts; (2) a presentation of the adopted review protocol; (3) a presentation of the search results; and (4) a discussion of the findings, limitations of the review, and future perspectives.Distributed learning from federated databases makes data centralization unnecessary. Distributed algorithms iteratively analyze separate databases, essentially sharing research questions and answers between databases instead of sharing the data. In other words, one can learn from separate and isolated datasets without patient data ever leaving the individual clinical institutes.Distributed learning promises great potential to facilitate big data for medical application, in particular for international consortiums. Our purpose is to review the major implementations of distributed learning in health care.

Citing Articles

A privacy-preserving dependable deep federated learning model for identifying new infections from genome sequences.

Mehedi S, Abdulrazak L, Ahmed K, Uddin M, Bui F, Chen L Sci Rep. 2025; 15(1):7291.

PMID: 40025035 PMC: 11873272. DOI: 10.1038/s41598-025-89612-x.


Consumer opinion on the use of machine learning in healthcare settings: A qualitative systematic review.

Stephens J, Northcott C, Poirier B, Lewis T Digit Health. 2025; 11():20552076241288631.

PMID: 39777065 PMC: 11705357. DOI: 10.1177/20552076241288631.


WebQuorumChain: A web framework for quorum-based health care model learning.

Shao X, Pham A, Kuo T Inform Med Unlocked. 2024; 50.

PMID: 39483487 PMC: 11526443. DOI: 10.1016/j.imu.2024.101590.


Advancing healthcare through data: the BETTER project's vision for distributed analytics.

Bregonzio M, Bernasconi A, Pinoli P Front Med (Lausanne). 2024; 11:1473874.

PMID: 39416867 PMC: 11480012. DOI: 10.3389/fmed.2024.1473874.


Federated Abnormal Heart Sound Detection with Weak to No Labels.

Qiu W, Quan C, Yu Y, Kara E, Qian K, Hu B Cyborg Bionic Syst. 2024; 5:0152.

PMID: 39257898 PMC: 11382922. DOI: 10.34133/cbsystems.0152.


References
1.
Polanin J, Terzian M . A data-sharing agreement helps to increase researchers' willingness to share primary data: results from a randomized controlled trial. J Clin Epidemiol. 2018; 106():60-69. DOI: 10.1016/j.jclinepi.2018.10.006. View

2.
Tagliaferri L, Gobitti C, Colloca G, Boldrini L, Farina E, Furlan C . A new standardized data collection system for interdisciplinary thyroid cancer management: Thyroid COBRA. Eur J Intern Med. 2018; 53:73-78. DOI: 10.1016/j.ejim.2018.02.012. View

3.
Ing E, Ing R . The Use of a Nomogram to Visually Interpret a Logistic Regression Prediction Model for Giant Cell Arteritis. Neuroophthalmology. 2018; 42(5):284-286. PMC: 6152514. DOI: 10.1080/01658107.2018.1425728. View

4.
Wilkinson M, Dumontier M, Aalbersberg I, Appleton G, Axton M, Baak A . The FAIR Guiding Principles for scientific data management and stewardship. Sci Data. 2016; 3:160018. PMC: 4792175. DOI: 10.1038/sdata.2016.18. View

5.
Deist T, Jochems A, Soest J, Nalbantov G, Oberije C, Walsh S . Infrastructure and distributed learning methodology for privacy-preserving multi-centric rapid learning health care: euroCAT. Clin Transl Radiat Oncol. 2018; 4:24-31. PMC: 5833935. DOI: 10.1016/j.ctro.2016.12.004. View