» Articles » PMID: 30891794

Combining Handcrafted Features with Latent Variables in Machine Learning for Prediction of Radiation-induced Lung Damage

Overview
Journal Med Phys
Specialty Biophysics
Date 2019 Mar 21
PMID 30891794
Citations 21
Authors
Affiliations
Soon will be listed here.
Abstract

Purpose: There has been burgeoning interest in applying machine learning methods for predicting radiotherapy outcomes. However, the imbalanced ratio of a large number of variables to a limited sample size in radiation oncology constitutes a major challenge. Therefore, dimensionality reduction methods can be a key to success. The study investigates and contrasts the application of traditional machine learning methods and deep learning approaches for outcome modeling in radiotherapy. In particular, new joint architectures based on variational autoencoder (VAE) for dimensionality reduction are presented and their application is demonstrated for the prediction of lung radiation pneumonitis (RP) from a large-scale heterogeneous dataset.

Methods: A large-scale heterogeneous dataset containing a pool of 230 variables including clinical factors (e.g., dose, KPS, stage) and biomarkers (e.g., single nucleotide polymorphisms (SNPs), cytokines, and micro-RNAs) in a population of 106 nonsmall cell lung cancer (NSCLC) patients who received radiotherapy was used for modeling RP. Twenty-two patients had grade 2 or higher RP. Four methods were investigated, including feature selection (case A) and feature extraction (case B) with traditional machine learning methods, a VAE-MLP joint architecture (case C) with deep learning and lastly, the combination of feature selection and joint architecture (case D). For feature selection, Random forest (RF), Support Vector Machine (SVM), and multilayer perceptron (MLP) were implemented to select relevant features. Specifically, each method was run for multiple times to rank features within several cross-validated (CV) resampled sets. A collection of ranking lists were then aggregated by top 5% and Kemeny graph methods to identify the final ranking for prediction. A synthetic minority oversampling technique was applied to correct for class imbalance during this process. For deep learning, a VAE-MLP joint architecture where a VAE aimed for dimensionality reduction and an MLP aimed for classification was developed. In this architecture, reconstruction loss and prediction loss were combined into a single loss function to realize simultaneous training and weights were assigned to different classes to mitigate class imbalance. To evaluate the prediction performance and conduct comparisons, the area under receiver operating characteristic curves (AUCs) were performed for nested CVs for both handcrafted feature selections and the deep learning approach. The significance of differences in AUCs was assessed using the DeLong test of U-statistics.

Results: An MLP-based method using weight pruning (WP) feature selection yielded the best performance among the different hand-crafted feature selection methods (case A), reaching an AUC of 0.804 (95% CI: 0.761-0.823) with 29 top features. A VAE-MLP joint architecture (case C) achieved a comparable but slightly lower AUC of 0.781 (95% CI: 0.737-0.808) with the size of latent dimension being 2. The combination of handcrafted features (case A) and latent representation (case D) achieved a significant AUC improvement of 0.831 (95% CI: 0.805-0.863) with 22 features (P-value = 0.000642 compared with handcrafted features only (Case A) and P-value = 0.000453 compared to VAE alone (Case C)) with an MLP classifier.

Conclusion: The potential for combination of traditional machine learning methods and deep learning VAE techniques has been demonstrated for dealing with limited datasets in modeling radiotherapy toxicities. Specifically, latent variables from a VAE-MLP joint architecture are able to complement handcrafted features for the prediction of RP and improve prediction over either method alone.

Citing Articles

Deep learning combining imaging, dose and clinical data for predicting bowel toxicity after pelvic radiotherapy.

Elhaminia B, Gilbert A, Scarsbrook A, Lilley J, Appelt A, Gooya A Phys Imaging Radiat Oncol. 2025; 33:100710.

PMID: 40046574 PMC: 11880715. DOI: 10.1016/j.phro.2025.100710.


Machine learning approaches to predict the need for intensive care unit admission among Iranian COVID-19 patients based on ICD-10: A cross-sectional study.

Karimi Z, Malak J, Aghakhani A, Najafi M, Ariannejad H, Zeraati H Health Sci Rep. 2024; 7(9):e70041.

PMID: 39229475 PMC: 11369020. DOI: 10.1002/hsr2.70041.


Fostering Transformation: Unleashing the Power of Artifical Intelligence and Machine Learning in the Field of Radiation Oncology.

Das J, Nath J, Bhattacharyya M, Kalita A Indian J Otolaryngol Head Neck Surg. 2024; 76(4):3750-3754.

PMID: 39130229 PMC: 11306808. DOI: 10.1007/s12070-024-04658-z.


Deep-Learning Model Prediction of Radiation Pneumonitis Using Pretreatment Chest Computed Tomography and Clinical Factors.

Lee J, Kang M, Park J, Lee S, Kim J, Park S Technol Cancer Res Treat. 2024; 23:15330338241254060.

PMID: 38752262 PMC: 11102700. DOI: 10.1177/15330338241254060.


SH3GL2 and MMP17 as lung adenocarcinoma biomarkers: a machine-learning based approach.

Tian Z, Yu S, Cai R, Zhang Y, Liu Q, Zhu Y Biochem Biophys Rep. 2024; 38:101693.

PMID: 38571554 PMC: 10987888. DOI: 10.1016/j.bbrep.2024.101693.


References
1.
Bentzen S, Dische S . Morbidity related to axillary irradiation in the treatment of breast cancer. Acta Oncol. 2000; 39(3):337-47. DOI: 10.1080/028418600750013113. View

2.
Su M, Miften M, Whiddon C, Sun X, Light K, Marks L . An artificial neural network for predicting the incidence of radiation pneumonitis. Med Phys. 2005; 32(2):318-25. DOI: 10.1118/1.1835611. View

3.
Stavrev P, Stavreva N, Sharplin J, Fallone B, Franko A . Critical volume model analysis of lung complication data from different strains of mice. Int J Radiat Biol. 2005; 81(1):77-88. DOI: 10.1080/09553000400027910. View

4.
El Naqa I, Bradley J, Blanco A, Lindsay P, Vicic M, Hope A . Multivariable modeling of radiotherapy outcomes, including dose-volume and clinical factors. Int J Radiat Oncol Biol Phys. 2006; 64(4):1275-86. DOI: 10.1016/j.ijrobp.2005.11.022. View

5.
Damaraju S, Murray D, Dufour J, Carandang D, Myrehaug S, Fallone G . Association of DNA repair and steroid metabolism gene polymorphisms with clinical late toxicity in patients treated with conformal radiotherapy for prostate cancer. Clin Cancer Res. 2006; 12(8):2545-54. DOI: 10.1158/1078-0432.CCR-05-2703. View