» Articles » PMID: 38816560

Automated Machine Learning for Predicting Liver Metastasis in Patients with Gastrointestinal Stromal Tumor: a SEER-based Analysis

Overview
Journal Sci Rep
Specialty Science
Date 2024 May 30
PMID 38816560
Authors
Affiliations
Soon will be listed here.
Abstract

Gastrointestinal stromal tumors (GISTs) are a rare type of tumor that can develop liver metastasis (LIM), significantly impacting the patient's prognosis. This study aimed to predict LIM in GIST patients by constructing machine learning (ML) algorithms to assist clinicians in the decision-making process for treatment. Retrospective analysis was performed using the Surveillance, Epidemiology, and End Results (SEER) database, and cases from 2010 to 2015 were assigned to the developing sets, while cases from 2016 to 2017 were assigned to the testing set. Missing values were addressed using the multiple imputation technique. Four algorithms were utilized to construct the models, comprising traditional logistic regression (LR) and automated machine learning (AutoML) analysis such as gradient boost machine (GBM), deep neural net (DL), and generalized linear model (GLM). We evaluated the models' performance using LR-based metrics, including the area under the receiver operating characteristic curve (AUC), calibration curve, and decision curve analysis (DCA), as well as AutoML-based metrics, such as feature importance, SHapley Additive exPlanation (SHAP) Plots, and Local Interpretable Model Agnostic Explanation (LIME). A total of 6207 patients were included in this study, with 2683, 1780, and 1744 patients allocated to the training, validation, and test sets, respectively. Among the different models evaluated, the GBM model demonstrated the highest performance in the training, validation, and test cohorts, with respective AUC values of 0.805, 0.780, and 0.795. Furthermore, the GBM model outperformed other AutoML models in terms of accuracy, achieving 0.747, 0.700, and 0.706 in the training, validation, and test cohorts, respectively. Additionally, the study revealed that tumor size and tumor location were the most significant predictors influencing the AutoML model's ability to accurately predict LIM. The AutoML model utilizing the GBM algorithm for GIST patients can effectively predict the risk of LIM and provide clinicians with a reference for developing individualized treatment plans.

Citing Articles

Invasion and metastasis in cancer: molecular insights and therapeutic targets.

Li Y, Liu F, Cai Q, Deng L, OuYang Q, Zhang X Signal Transduct Target Ther. 2025; 10(1):57.

PMID: 39979279 PMC: 11842613. DOI: 10.1038/s41392-025-02148-4.


Automated machine learning models for nonalcoholic fatty liver disease assessed by controlled attenuation parameter from the NHANES 2017-2020.

Liu L, Lin J, Liu L, Gao J, Xu G, Yin M Digit Health. 2024; 10:20552076241272535.

PMID: 39119551 PMC: 11307367. DOI: 10.1177/20552076241272535.

References
1.
Candia J, Tsang J . eNetXplorer: an R package for the quantitative exploration of elastic net families for generalized linear models. BMC Bioinformatics. 2019; 20(1):189. PMC: 6469092. DOI: 10.1186/s12859-019-2778-5. View

2.
von Mehren M, Joensuu H . Gastrointestinal Stromal Tumors. J Clin Oncol. 2017; 36(2):136-143. PMC: 6553810. DOI: 10.1200/JCO.2017.74.9705. View

3.
Neves I, Folgado D, Santos S, Barandas M, Campagner A, Ronzio L . Interpretable heartbeat classification using local model-agnostic explanations on ECGs. Comput Biol Med. 2021; 133:104393. DOI: 10.1016/j.compbiomed.2021.104393. View

4.
Nohara Y, Matsumoto K, Soejima H, Nakashima N . Explanation of machine learning models using shapley additive explanation and application for real data in hospital. Comput Methods Programs Biomed. 2021; 214:106584. DOI: 10.1016/j.cmpb.2021.106584. View

5.
Kukar M, Kapil A, Papenfuss W, Groman A, Grobmyer S, Hochwald S . Gastrointestinal stromal tumors (GISTs) at uncommon locations: a large population based analysis. J Surg Oncol. 2015; 111(6):696-701. DOI: 10.1002/jso.23873. View