Early Symptoms and Sensations As Predictors of Lung Cancer: a Machine Learning Multivariate Model
Authors
Affiliations
The aim of this study was to identify a combination of early predictive symptoms/sensations attributable to primary lung cancer (LC). An interactive e-questionnaire comprised of pre-diagnostic descriptors of first symptoms/sensations was administered to patients referred for suspected LC. Respondents were included in the present analysis only if they later received a primary LC diagnosis or had no cancer; and inclusion of each descriptor required ≥4 observations. Fully-completed data from 506/670 individuals later diagnosed with primary LC (n = 311) or no cancer (n = 195) were modelled with orthogonal projections to latent structures (OPLS). After analysing 145/285 descriptors, meeting inclusion criteria, through randomised seven-fold cross-validation (six-fold training set: n = 433; test set: n = 73), 63 provided best LC prediction. The most-significant LC-positive descriptors included a cough that varied over the day, back pain/aches/discomfort, early satiety, appetite loss, and having less strength. Upon combining the descriptors with the background variables current smoking, a cold/flu or pneumonia within the past two years, female sex, older age, a history of COPD (positive LC-association); antibiotics within the past two years, and a history of pneumonia (negative LC-association); the resulting 70-variable model had accurate cross-validated test set performance: area under the ROC curve = 0.767 (descriptors only: 0.736/background predictors only: 0.652), sensitivity = 84.8% (73.9/76.1%, respectively), specificity = 55.6% (66.7/51.9%, respectively). In conclusion, accurate prediction of LC was found through 63 early symptoms/sensations and seven background factors. Further research and precision in this model may lead to a tool for referral and LC diagnostic decision-making.
LncRNA MANCR is downregulated in non-small cell lung cancer and predicts poor survival.
Tao Y, Liu J, Qiu W, Li Y Discov Oncol. 2025; 16(1):40.
PMID: 39806185 PMC: 11730023. DOI: 10.1007/s12672-025-01739-5.
Research Priorities of the Oncology Nursing Society: 2024-2027.
Rosenzweig M, Belcher S, Braithwaite L, Cuaron L, Fischer-Cartlidge E, Lally R Oncol Nurs Forum. 2024; 51(6):502-515.
PMID: 39431689 PMC: 11697697. DOI: 10.1188/24.ONF.502-515.
Multiplex plasma protein assays as a diagnostic tool for lung cancer.
Ahamed M, Forshed J, Levitsky A, Lehtio J, Bajalan A, Pernemalm M Cancer Sci. 2024; 115(10):3439-3454.
PMID: 39080998 PMC: 11447887. DOI: 10.1111/cas.16300.
Galadima H, Anson-Dwamena R, Johnson A, Bello G, Adunlin G, Blando J Cancers (Basel). 2024; 16(3).
PMID: 38339293 PMC: 10854986. DOI: 10.3390/cancers16030540.
Lorenc A, Romaszko-Wojtowicz A, Jaskiewicz L, Doboszynska A, Bucinski A Transl Lung Cancer Res. 2023; 12(10):2083-2097.
PMID: 38025814 PMC: 10654430. DOI: 10.21037/tlcr-23-350.