Predicting Acute Graft-Versus-Host Disease Using Machine Learning and Longitudinal Vital Sign Data From Electronic Health Records
Overview
Affiliations
Purpose: Acute graft-versus-host disease (aGVHD) remains a significant complication of allogeneic hematopoietic cell transplantation (HCT) and limits its broader application. The ability to predict grade II to IV aGVHD could potentially mitigate morbidity and mortality. To date, researchers have focused on using snapshots of a patient (eg, biomarkers at a single time point) to predict aGVHD onset. We hypothesized that longitudinal data collected and stored in electronic health records (EHRs) could distinguish patients at high risk of developing aGVHD from those at low risk.
Patients And Methods: The study included a cohort of 324 patients undergoing allogeneic HCT at the University of Michigan C.S. Mott Children's Hospital during 2014 to 2017. Using EHR data, specifically vital sign measurements collected within the first 10 days of transplantation, we built a predictive model using penalized logistic regression for identifying patients at risk for grade II to IV aGVHD. We compared the proposed model with a baseline model trained only on patient and donor characteristics collected at the time of transplantation and performed an analysis of the importance of different input features.
Results: The proposed model outperformed the baseline model, with an area under the receiver operating characteristic curve of 0.659 versus 0.512 ( = .019). The feature importance analysis showed that the learned model relied most on temperature and systolic blood pressure, and temporal trends (eg, increasing or decreasing) were more important than the average values.
Conclusion: Leveraging readily available clinical data from EHRs, we developed a machine-learning model for aGVHD prediction in patients undergoing HCT. Continuous monitoring of vital signs, such as temperature, could potentially help clinicians more accurately identify patients at high risk for aGVHD.
Shen Y, Yu J, Zhou J, Hu G J Med Internet Res. 2025; 27:e59024.
PMID: 39787599 PMC: 11757985. DOI: 10.2196/59024.
Xiao H, Yang G, Huang Q, Wei Z, Gan Z, Wu M Ther Adv Hematol. 2024; 15:20406207241294054.
PMID: 39664034 PMC: 11632902. DOI: 10.1177/20406207241294054.
Elaborating the potential of Artificial Intelligence in automated CAR-T cell manufacturing.
Backel N, Hort S, Kis T, Nettleton D, Egan J, Jacobs J Front Mol Med. 2024; 3:1250508.
PMID: 39086671 PMC: 11285580. DOI: 10.3389/fmmed.2023.1250508.
Chandra G, Wang J, Siirtola P, Roning J BMC Med Res Methodol. 2024; 24(1):112.
PMID: 38734644 PMC: 11088760. DOI: 10.1186/s12874-024-02237-y.
Prediction of hospital-acquired influenza using machine learning algorithms: a comparative study.
Cho Y, Lee H, Kim J, Yoo K, Choi J, Lee Y BMC Infect Dis. 2024; 24(1):466.
PMID: 38698304 PMC: 11067145. DOI: 10.1186/s12879-024-09358-1.