» Articles » PMID: 35321110

Development and Validation of a Risk Prediction Model for Venous Thromboembolism in Lung Cancer Patients Using Machine Learning

Overview
Authors
Affiliations
Soon will be listed here.
Abstract

Background: There is currently a lack of model for predicting the occurrence of venous thromboembolism (VTE) in patients with lung cancer. Machine learning (ML) techniques are being increasingly adapted for use in the medical field because of their capabilities of intelligent analysis and scalability. This study aimed to develop and validate ML models to predict the incidence of VTE among lung cancer patients.

Methods: Data of lung cancer patients from a Grade 3A cancer hospital in China with and without VTE were included. Patient characteristics and clinical predictors related to VTE were collected. The primary endpoint was the diagnosis of VTE during index hospitalization. We calculated and compared the area under the receiver operating characteristic curve (AUROC) using the selected best-performed model (Random Forest model) through multiple model comparison, as well as investigated feature contributions during the training process with both permutation importance scores and the impurity-based feature importance scores in random forest model.

Results: In total, 3,398 patients were included in our study, 125 of whom experienced VTE during their hospital stay. The ROC curve and precision-recall curve (PRC) for Random Forest Model showed an AUROC of 0.91 (95% : 0.893-0.926) and an AUPRC of 0.43 (95% : 0.363-0.500). For the simplified model, five most relevant features were selected: Karnofsky Performance Status (KPS), a history of VTE, recombinant human endostatin, EGFR-TKI, and platelet count. We re-trained a random forest classifier with results of the AUROC of 0.87 (95% : 0.802-0.917) and AUPRC of 0.30 (95% : 0.265-0.358), respectively.

Conclusion: According to the study results, there was no conspicuous decrease in the model's performance when use fewer features to predict, we concluded that our simplified model would be more applicable in real-life clinical settings. The developed model using ML algorithms in our study has the potential to improve the early detection and prediction of the incidence of VTE in patients with lung cancer.

Citing Articles

Incidence and Risk Factors of Lower Limb Deep Vein Thrombosis in Psychiatric Inpatients by Applying Machine Learning to Electronic Health Records: A Retrospective Cohort Study.

Xu L, Da M Clin Epidemiol. 2025; 17:197-209.

PMID: 40027401 PMC: 11871911. DOI: 10.2147/CLEP.S501062.


Predicting survival benefits of immune checkpoint inhibitor therapy in lung cancer patients: a machine learning approach using real-world data.

Pan L, Mu L, Lei H, Miao S, Hu X, Tang Z Int J Clin Pharm. 2024; .

PMID: 39470981 DOI: 10.1007/s11096-024-01818-7.


Machine learning in cancer-associated thrombosis: hype or hope in untangling the clot.

Patell R, Zwicker J, Singh R, Mantha S Bleeding Thromb Vasc Biol. 2024; 3(Suppl 1).

PMID: 39323613 PMC: 11423546. DOI: 10.4081/btvb.2024.123.


A nomogram predicting venous thromboembolism risk in primary liver cancer patients.

Lei H, Li X, Hu Z, Xu Q, Li Q, Zhou R J Thromb Thrombolysis. 2024; 58(1):145-156.

PMID: 39306653 PMC: 11762598. DOI: 10.1007/s11239-024-03041-7.


Incidence and risk factors of VTE in lung cancer: a meta-analysis.

Yang R, Wang H, Liu D, Li W Ann Med. 2024; 56(1):2390200.

PMID: 39183726 PMC: 11348814. DOI: 10.1080/07853890.2024.2390200.


References
1.
Yhim H, Jang M, Bang S, Kim K, Kim Y, Nam S . Incidence of venous thromboembolism following major surgery in Korea: from the Health Insurance Review and Assessment Service database. J Thromb Haemost. 2014; 12(7):1035-43. DOI: 10.1111/jth.12611. View

2.
Gutierrez M, Giaccone G . Antiangiogenic therapy in nonsmall cell lung cancer. Curr Opin Oncol. 2008; 20(2):176-82. DOI: 10.1097/CCO.0b013e3282f4e55e. View

3.
Yan A, Samarawickrema I, Naunton M, Peterson G, Yip D, De Rosa S . Risk Factors and Prediction Models for Venous Thromboembolism in Ambulatory Patients with Lung Cancer. Healthcare (Basel). 2021; 9(6). PMC: 8233898. DOI: 10.3390/healthcare9060778. View

4.
Lababede O, Meziane M . The Eighth Edition of TNM Staging of Lung Cancer: Reference Chart and Diagrams. Oncologist. 2018; 23(7):844-848. PMC: 6058324. DOI: 10.1634/theoncologist.2017-0659. View

5.
Kawaler E, Cobian A, Peissig P, Cross D, Yale S, Craven M . Learning to predict post-hospitalization VTE risk from EHR data. AMIA Annu Symp Proc. 2013; 2012:436-45. PMC: 3540493. View