» Articles » PMID: 36572246

Machine Learning Does Not Outperform Traditional Statistical Modelling for Kidney Allograft Failure Prediction

Abstract

Machine learning (ML) models have recently shown potential for predicting kidney allograft outcomes. However, their ability to outperform traditional approaches remains poorly investigated. Therefore, using large cohorts of kidney transplant recipients from 14 centers worldwide, we developed ML-based prediction models for kidney allograft survival and compared their prediction performances to those achieved by a validated Cox-Based Prognostication System (CBPS). In a French derivation cohort of 4000 patients, candidate determinants of allograft failure including donor, recipient and transplant-related parameters were used as predictors to develop tree-based models (RSF, RSF-ERT, CIF), Support Vector Machine models (LK-SVM, AK-SVM) and a gradient boosting model (XGBoost). Models were externally validated with cohorts of 2214 patients from Europe, 1537 from North America, and 671 from South America. Among these 8422 kidney transplant recipients, 1081 (12.84%) lost their grafts after a median post-transplant follow-up time of 6.25 years (Inter Quartile Range 4.33-8.73). At seven years post-risk evaluation, the ML models achieved a C-index of 0.788 (95% bootstrap percentile confidence interval 0.736-0.833), 0.779 (0.724-0.825), 0.786 (0.735-0.832), 0.527 (0.456-0.602), 0.704 (0.648-0.759) and 0.767 (0.711-0.815) for RSF, RSF-ERT, CIF, LK-SVM, AK-SVM and XGBoost respectively, compared with 0.808 (0.792-0.829) for the CBPS. In validation cohorts, ML models' discrimination performances were in a similar range of those of the CBPS. Calibrations of the ML models were similar or less accurate than those of the CBPS. Thus, when using a transparent methodological pipeline in validated international cohorts, ML models, despite overall good performances, do not outperform a traditional CBPS in predicting kidney allograft failure. Hence, our current study supports the continued use of traditional statistical approaches for kidney graft prognostication.

Citing Articles

Advancing risk stratification in kidney transplantation: integrating HLA-derived T-cell epitope and B-cell epitope matching algorithms for enhanced predictive accuracy of HLA compatibility.

Niemann M, Matern B, Gupta G, Tanriover B, Halleck F, Budde K Front Immunol. 2025; 16:1548934.

PMID: 40007544 PMC: 11850546. DOI: 10.3389/fimmu.2025.1548934.


Advancements in Artificial Intelligence for Kidney Transplantology: A Comprehensive Review of Current Applications and Predictive Models.

Mizera J, Pondel M, Kepinska M, Jerzak P, Banasik M J Clin Med. 2025; 14(3).

PMID: 39941645 PMC: 11818595. DOI: 10.3390/jcm14030975.


Enhancing individual glomerular filtration rate assessment: can we trust the equation? Development and validation of machine learning models to assess the trustworthiness of estimated GFR compared to measured GFR.

Lanot A, Akesson A, Nakano F, Vens C, Bjork J, Nyman U BMC Nephrol. 2025; 26(1):47.

PMID: 39885391 PMC: 11780799. DOI: 10.1186/s12882-025-03972-0.


Predicting graft survival in paediatric kidney transplant recipients using machine learning.

Aksoy G, Akcay H, Ari C, Adar M, Koyun M, Comak E Pediatr Nephrol. 2024; 40(1):203-211.

PMID: 39150523 DOI: 10.1007/s00467-024-06484-5.


The transformative potential of artificial intelligence in solid organ transplantation.

Moussawy M, Lakkis Z, Ansari Z, Cherukuri A, Abou-Daya K Front Transplant. 2024; 3:1361491.

PMID: 38993779 PMC: 11235281. DOI: 10.3389/frtra.2024.1361491.