» Articles » PMID: 38264722

FHBF: Federated Hybrid Boosted Forests with Dropout Rates for Supervised Learning Tasks Across Highly Imbalanced Clinical Datasets

Overview
Journal Patterns (N Y)
Date 2024 Jan 24
PMID 38264722
Authors
Affiliations
Soon will be listed here.
Abstract

Although several studies have deployed gradient boosting trees (GBT) as a robust classifier for federated learning tasks (federated GBT [FGBT]), even with dropout rates (federated gradient boosting trees with dropout rate [FDART]), none of them have investigated the overfitting effects of FGBT across heterogeneous and highly imbalanced datasets within federated environments nor the effect of dropouts in the loss function. In this work, we present the federated hybrid boosted forests (FHBF) algorithm, which incorporates a hybrid weight update approach to overcome ill-posed problems that arise from overfitting effects during the training across highly imbalanced datasets in the cloud. Eight case studies were conducted to stress the performance of FHBF against existing algorithms toward the development of robust AI models for lymphoma development across 18 European federated databases. Our results highlight the robustness of FHBF, yielding an average loss of 0.527 compared with FGBT (0.611) and FDART (0.584) with increased classification performance (0.938 sensitivity, 0.732 specificity).

References
1.
Zerka F, Barakat S, Walsh S, Bogowicz M, Leijenaar R, Jochems A . Systematic Review of Privacy-Preserving Distributed Machine Learning From Federated Databases in Health Care. JCO Clin Cancer Inform. 2020; 4:184-200. PMC: 7113079. DOI: 10.1200/CCI.19.00047. View

2.
Pezoulas V, Kalatzis F, Exarchos T, Chatzis L, Gandolfo S, Goules A . A federated AI strategy for the classification of patients with Mucosa Associated Lymphoma Tissue (MALT) lymphoma across multiple harmonized cohorts. Annu Int Conf IEEE Eng Med Biol Soc. 2021; 2021:1666-1669. DOI: 10.1109/EMBC46164.2021.9630014. View

3.
Leonardsen A, Hardeland C, Helgesen A, Grondahl V . Patient experiences with technology enabled care across healthcare settings- a systematic review. BMC Health Serv Res. 2020; 20(1):779. PMC: 7446109. DOI: 10.1186/s12913-020-05633-4. View

4.
Hauschild A, Lemanczyk M, Matschinske J, Frisch T, Zolotareva O, Holzinger A . Federated Random Forests can improve local performance of predictive models for various healthcare applications. Bioinformatics. 2022; 38(8):2278-2286. DOI: 10.1093/bioinformatics/btac065. View

5.
Majnaric L, Babic F, OSullivan S, Holzinger A . AI and Big Data in Healthcare: Towards a More Comprehensive Research Framework for Multimorbidity. J Clin Med. 2021; 10(4). PMC: 7918668. DOI: 10.3390/jcm10040766. View