» Articles » PMID: 39456622

AI Survival Prediction Modeling: The Importance of Considering Treatments and Changes in Health Status over Time

Overview
Journal Cancers (Basel)
Publisher MDPI
Specialty Oncology
Date 2024 Oct 26
PMID 39456622
Authors
Affiliations
Soon will be listed here.
Abstract

Background And Objectives: Deep learning (DL)-based models for predicting the survival of patients with local stages of breast cancer only use time-fixed covariates, i.e., patient and cancer data at the time of diagnosis. These predictions are inherently error-prone because they do not consider time-varying events that occur after initial diagnosis. Our objective is to improve the predictive modeling of survival of patients with localized breast cancer to consider both time-fixed and time-varying events; thus, we take into account the progression of a patient's health status over time.

Methods: We extended four DL-based predictive survival models (DeepSurv, DeepHit, Nnet-survival, and Cox-Time) that deal with right-censored time-to-event data to consider not only a patient's time-fixed covariates (patient and cancer data at diagnosis) but also a patient's time-varying covariates (e.g., treatments, comorbidities, progressive age, frailty index, adverse events from treatment). We utilized, as our study data, the SEER-Medicare linked dataset from 1991 to 2016 to study a population of women diagnosed with stage I-III breast cancer (BC) enrolled in Medicare at 65 years or older as qualified by age. We delineated time-fixed variables recorded at the time of diagnosis, including age, race, marital status, breast cancer stage, tumor grade, laterality, estrogen receptor (ER), progesterone receptor (PR), and human epidermal receptor 2 (HER2) status, and comorbidity index. We analyzed six distinct prognostic categories, cancer stages I-III BC, and each stage's ER/PR+ or ER/PR- status. At each visit, we delineated the time-varying covariates of administered treatments, induced adverse events, comorbidity index, and age. We predicted the survival of three hypothetical patients to demonstrate the model's utility.

Main Outcomes And Measures: The primary outcomes of the modeling were the measures of the model's prediction error, as measured by the concordance index, the most commonly applied evaluation metric in survival analysis, and the integrated Brier score, a metric of the model's discrimination and calibration.

Results: The proposed extended patients' covariates that include both time-fixed and time-varying covariates significantly improved the deep learning models' prediction error and the discrimination and calibration of a model's estimates. The prediction of the four DL models using time-fixed covariates in six different prognostic categories all resulted in approximately a 30% error in all six categories. When applying the proposed extension to include time-varying covariates, the accuracy of all four predictive models improved significantly, with the error decreasing to approximately 10%. The models' predictive accuracy was independent of the differing published survival predictions from time-fixed covariates in the six prognostic categories. We demonstrate the utility of the model in three hypothetical patients with unique patient, cancer, and treatment variables. The model predicted survival based on the patient's individual time-fixed and time-varying features, which varied considerably from Social Security age-based, and stage and race-based breast cancer survival predictions.

Conclusions: The predictive modeling of the survival of patients with early-stage breast cancer using DL models has a prediction error of around 30% when considering only time-fixed covariates at the time of diagnosis and decreases to values under 10% when time-varying covariates are added as input to the models, regardless of the prognostic category of the patient groups. These models can be used to predict individual patients' survival probabilities based on their unique repertoire of time-fixed and time-varying features. They will provide guidance for patients and their caregivers to assist in decision making.

Citing Articles

Predictive Modeling of Long-Term Survivors with Stage IV Breast Cancer Using the SEER-Medicare Dataset.

Adam N, Wieder R Cancers (Basel). 2024; 16(23).

PMID: 39682219 PMC: 11640017. DOI: 10.3390/cancers16234033.

References
1.
Freid V, Bernstein A, Bush M . Multiple chronic conditions among adults aged 45 and over: trends over the past 10 years. NCHS Data Brief. 2012; (100):1-8. View

2.
Schoenborn N, Blackford A, Joshu C, Boyd C, Varadhan R . Life expectancy estimates based on comorbidities and frailty to inform preventive care. J Am Geriatr Soc. 2021; 70(1):99-109. PMC: 8742754. DOI: 10.1111/jgs.17468. View

3.
Kattan M . Comparison of Cox regression with other methods for determining prediction models and nomograms. J Urol. 2003; 170(6 Pt 2):S6-9. DOI: 10.1097/01.ju.0000094764.56269.2d. View

4.
Gerds T, Schumacher M . Consistent estimation of the expected Brier score in general survival models with right-censored event times. Biom J. 2007; 48(6):1029-40. DOI: 10.1002/bimj.200610301. View

5.
Gagliato D, Gonzalez-Angulo A, Lei X, Theriault R, Giordano S, Valero V . Clinical impact of delaying initiation of adjuvant chemotherapy in patients with breast cancer. J Clin Oncol. 2014; 32(8):735-44. PMC: 3940536. DOI: 10.1200/JCO.2013.49.7693. View