Automatic Recognition of Laryngoscopic Images Using a Deep-Learning Technique
Overview
Authors
Affiliations
Objectives/hypothesis: To develop a deep-learning-based computer-aided diagnosis system for distinguishing laryngeal neoplasms (benign, precancerous lesions, and cancer) and improve the clinician-based accuracy of diagnostic assessments of laryngoscopy findings.
Study Design: Retrospective study.
Methods: A total of 24,667 laryngoscopy images (normal, vocal nodule, polyps, leukoplakia and malignancy) were collected to develop and test a convolutional neural network (CNN)-based classifier. A comparison between the proposed CNN-based classifier and the clinical visual assessments (CVAs) by 12 otolaryngologists was conducted.
Results: In the independent testing dataset, an overall accuracy of 96.24% was achieved; for leukoplakia, benign, malignancy, normal, and vocal nodule, the sensitivity and specificity were 92.8% vs. 98.9%, 97% vs. 99.7%, 89% vs. 99.3%, 99.0% vs. 99.4%, and 97.2% vs. 99.1%, respectively. Furthermore, when compared with CVAs on the randomly selected test dataset, the CNN-based classifier outperformed physicians for most laryngeal conditions, with striking improvements in the ability to distinguish nodules (98% vs. 45%, P < .001), polyps (91% vs. 86%, P < .001), leukoplakia (91% vs. 65%, P < .001), and malignancy (90% vs. 54%, P < .001).
Conclusions: The CNN-based classifier can provide a valuable reference for the diagnosis of laryngeal neoplasms during laryngoscopy, especially for distinguishing benign, precancerous, and cancer lesions.
Level Of Evidence: NA Laryngoscope, 130:E686-E693, 2020.
Xu X, Yun B, Zhao Y, Jin L, Zong Y, Yu G Bioengineering (Basel). 2025; 12(1).
PMID: 39851283 PMC: 11762390. DOI: 10.3390/bioengineering12010010.
Intelligent imaging technology applications in multidisciplinary hospitals.
Fan K, Yang L, Ren F, Zhang X, Liu B, Zhao Z Chin Med J (Engl). 2024; 137(24):3083-3092.
PMID: 39690448 PMC: 11706584. DOI: 10.1097/CM9.0000000000003436.
SCC-NET: segmentation of clinical cancer image for head and neck squamous cell carcinoma.
Huang C, Tsai C, Hwang L, Kang B, Lin Y, Su H J Med Imaging (Bellingham). 2024; 11(6):065501.
PMID: 39583005 PMC: 11579920. DOI: 10.1117/1.JMI.11.6.065501.
Application of artificial intelligence in laryngeal lesions: a systematic review and meta-analysis.
Marrero-Gonzalez A, Diemer T, Nguyen S, Camilon T, Meenan K, ORourke A Eur Arch Otorhinolaryngol. 2024; 282(3):1543-1555.
PMID: 39576322 PMC: 11890366. DOI: 10.1007/s00405-024-09075-0.
Enhanced WGAN Model for Diagnosing Laryngeal Carcinoma.
Kim S, Chang Y, An S, Kim D, Cho J, Oh K Cancers (Basel). 2024; 16(20).
PMID: 39456576 PMC: 11506071. DOI: 10.3390/cancers16203482.