» Articles » PMID: 38455408

Development and Validation of Machine Learning Models for Diagnosis and Prognosis of Cancer by Urinary Proteomics, Based on the FLEMENGHO Cohort

Overview
Journal Am J Cancer Res
Specialty Oncology
Date 2024 Mar 8
PMID 38455408
Authors
Affiliations
Soon will be listed here.
Abstract

The current study aims to develop and validate machine learning (ML) models for the prediction of cancer status by the non-invasive urinary proteomic in a population-based cohort. In this retrospective study, urinary proteome profiles in 804 cases from the FLEMENGHO cohort were measured by mass spectrometry. After feature selection by LASSO on both clinical variables and urinary proteome profile, benchmark models by clinical variables were built with six different ML algorithms. Proteome-based models and combined models were built and compared with the benchmark models. The models' performance, i.e. area under the curve (AUC) was compared by Delong method. The 95% confidence interval was estimated by the bootstrapping method. The best-performing model was explained by Shapley Additive Explanations (SHAP) method. The predictive role of proteome biomarkers in longitudinal cancer diagnosis was also explored. A clinical model, based on age, blood sugar and blood lipid profile, yielded the best AUC of 0.75 (0.68-0.82), with 0.80 (0.72-0.91) for the proteome model based on 13 selected biomarkers and 0.83 (0.77-0.90) for the combined model (=0.01 for comparison with clinical model). SHAP on the support vector machine in the combined setting showed that except for age, proteome biomarkers contribute to the final prediction of the model. After adjusting with clinical factors, three proteome biomarkers are independent risk factors for longitudinal cancer development. Urinary proteome profiling, together with fine-tuned machine learning algorithms, demonstrates the predictive potential for cancer diagnosis transparently.

Citing Articles

The 2024 Report on the Human Proteome from the HUPO Human Proteome Project.

Omenn G, Orchard S, Lane L, Lindskog C, Pineau C, Overall C J Proteome Res. 2024; 23(12):5296-5311.

PMID: 39514846 PMC: 11781352. DOI: 10.1021/acs.jproteome.4c00776.

References
1.
Wei D, Melgarejo J, Thijs L, Temmerman X, Vanassche T, Van Aelst L . Urinary Proteomic Profile of Arterial Stiffness Is Associated With Mortality and Cardiovascular Outcomes. J Am Heart Assoc. 2022; 11(8):e024769. PMC: 9238473. DOI: 10.1161/JAHA.121.024769. View

2.
Fan G, Gong T, Lin Y, Wang J, Sun L, Wei H . Urine proteomics identifies biomarkers for diabetic kidney disease at different stages. Clin Proteomics. 2021; 18(1):32. PMC: 8903606. DOI: 10.1186/s12014-021-09338-6. View

3.
Theodorescu D, Wittke S, Ross M, Walden M, Conaway M, Just I . Discovery and validation of new protein biomarkers for urothelial cancer: a prospective analysis. Lancet Oncol. 2006; 7(3):230-40. DOI: 10.1016/S1470-2045(06)70584-8. View

4.
Jantos-Siwy J, Schiffer E, Brand K, Schumann G, Rossing K, Delles C . Quantitative urinary proteome analysis for biomarker evaluation in chronic kidney disease. J Proteome Res. 2008; 8(1):268-81. DOI: 10.1021/pr800401m. View

5.
Wang Y, Wang Y, Han X, Sun J, Li C, Adhikari B . Cardio-Oncology: A Myriad of Relationships Between Cardiovascular Disease and Cancer. Front Cardiovasc Med. 2022; 9:727487. PMC: 8968416. DOI: 10.3389/fcvm.2022.727487. View