Predicting HPV Association Using Deep Learning and Regular H&E Stains Allows Granular Stratification of Oropharyngeal Cancer Patients
Overview
Authors
Affiliations
Human Papilloma Virus (HPV)-associated oropharyngeal squamous cell cancer (OPSCC) represents an OPSCC subgroup with an overall good prognosis with a rising incidence in Western countries. Multiple lines of evidence suggest that HPV-associated tumors are not a homogeneous tumor entity, underlining the need for accurate prognostic biomarkers. In this retrospective, multi-institutional study involving 906 patients from four centers and one database, we developed a deep learning algorithm (OPSCCnet), to analyze standard H&E stains for the calculation of a patient-level score associated with prognosis, comparing it to combined HPV-DNA and p16-status. When comparing OPSCCnet to HPV-status, the algorithm showed a good overall performance with a mean area under the receiver operator curve (AUROC) = 0.83 (95% CI = 0.77-0.9) for the test cohort (n = 639), which could be increased to AUROC = 0.88 by filtering cases using a fixed threshold on the variance of the probability of the HPV-positive class - a potential surrogate marker of HPV-heterogeneity. OPSCCnet could be used as a screening tool, outperforming gold standard HPV testing (OPSCCnet: five-year survival rate: 96% [95% CI = 90-100%]; HPV testing: five-year survival rate: 80% [95% CI = 71-90%]). This could be confirmed using a multivariate analysis of a three-tier threshold (OPSCCnet: high HR = 0.15 [95% CI = 0.05-0.44], intermediate HR = 0.58 [95% CI = 0.34-0.98] p = 0.043, Cox proportional hazards model, n = 211; HPV testing: HR = 0.29 [95% CI = 0.15-0.54] p < 0.001, Cox proportional hazards model, n = 211). Collectively, our findings indicate that by analyzing standard gigapixel hematoxylin and eosin (H&E) histological whole-slide images, OPSCCnet demonstrated superior performance over p16/HPV-DNA testing in various clinical scenarios, particularly in accurately stratifying these patients.
Feng B, Zhao D, Zhang Z, Jia R, Schuler P, Hess J NPJ Precis Oncol. 2025; 9(1):57.
PMID: 40021759 PMC: 11871237. DOI: 10.1038/s41698-025-00844-6.
Muksimova S, Umirzakova S, Baltayev J, Cho Y Diagnostics (Basel). 2025; 15(3).
PMID: 39941293 PMC: 11816595. DOI: 10.3390/diagnostics15030364.
Wang R, Gunesli G, Skingen V, Valen K, Lyng H, Young L NPJ Precis Oncol. 2025; 9(1):11.
PMID: 39799271 PMC: 11724963. DOI: 10.1038/s41698-024-00778-5.
Genome composition-based deep learning predicts oncogenic potential of HPVs.
Hao L, Jiang Y, Zhang C, Han P Front Cell Infect Microbiol. 2024; 14:1430424.
PMID: 39104853 PMC: 11298479. DOI: 10.3389/fcimb.2024.1430424.