» Articles » PMID: 27277016

Improving Lung Cancer Prognosis Assessment by Incorporating Synthetic Minority Oversampling Technique and Score Fusion Method

Overview
Journal Med Phys
Specialty Biophysics
Date 2016 Jun 10
PMID 27277016
Citations 7
Authors
Affiliations
Soon will be listed here.
Abstract

Purpose: This study aims to investigate the potential to improve lung cancer recurrence risk prediction performance for stage I NSCLS patients by integrating oversampling, feature selection, and score fusion techniques and develop an optimal prediction model.

Methods: A dataset involving 94 early stage lung cancer patients was retrospectively assembled, which includes CT images, nine clinical and biological (CB) markers, and outcome of 3-yr disease-free survival (DFS) after surgery. Among the 94 patients, 74 remained DFS and 20 had cancer recurrence. Applying a computer-aided detection scheme, tumors were segmented from the CT images and 35 quantitative image (QI) features were initially computed. Two normalized Gaussian radial basis function network (RBFN) based classifiers were built based on QI features and CB markers separately. To improve prediction performance, the authors applied a synthetic minority oversampling technique (SMOTE) and a BestFirst based feature selection method to optimize the classifiers and also tested fusion methods to combine QI and CB based prediction results.

Results: Using a leave-one-case-out cross-validation (K-fold cross-validation) method, the computed areas under a receiver operating characteristic curve (AUCs) were 0.716 ± 0.071 and 0.642 ± 0.061, when using the QI and CB based classifiers, respectively. By fusion of the scores generated by the two classifiers, AUC significantly increased to 0.859 ± 0.052 (p < 0.05) with an overall prediction accuracy of 89.4%.

Conclusions: This study demonstrated the feasibility of improving prediction performance by integrating SMOTE, feature selection, and score fusion techniques. Combining QI features and CB markers and performing SMOTE prior to feature selection in classifier training enabled RBFN based classifier to yield improved prediction accuracy.

Citing Articles

A multi-stage fusion framework to classify breast lesions using deep learning and radiomics features computed from four-view mammograms.

Jones M, Sadeghipour N, Chen X, Islam W, Zheng B Med Phys. 2023; 50(12):7670-7683.

PMID: 37083190 PMC: 10589387. DOI: 10.1002/mp.16419.


Applying Quantitative Radiographic Image Markers to Predict Clinical Complications After Aneurysmal Subarachnoid Hemorrhage: A Pilot Study.

Danala G, Desai M, Ray B, Heidari M, Maryada S, Prodan C Ann Biomed Eng. 2022; 50(4):413-425.

PMID: 35112157 PMC: 8918043. DOI: 10.1007/s10439-022-02926-z.


Learning from imbalanced fetal outcomes of systemic lupus erythematosus in artificial neural networks.

Ma J, Feng Z, Wu J, Zhang Y, Di W BMC Med Inform Decis Mak. 2021; 21(1):127.

PMID: 33845834 PMC: 8042715. DOI: 10.1186/s12911-021-01486-x.


Applying a random projection algorithm to optimize machine learning model for predicting peritoneal metastasis in gastric cancer patients using CT images.

Mirniaharikandehei S, Heidari M, Danala G, Lakshmivarahan S, Zheng B Comput Methods Programs Biomed. 2021; 200:105937.

PMID: 33486339 PMC: 7920928. DOI: 10.1016/j.cmpb.2021.105937.


Developing global image feature analysis models to predict cancer risk and prognosis.

Zheng B, Qiu Y, Aghaei F, Mirniaharikandehei S, Heidari M, Danala G Vis Comput Ind Biomed Art. 2020; 2(1):17.

PMID: 32190407 PMC: 7055572. DOI: 10.1186/s42492-019-0026-5.