» Articles » PMID: 37746081

Explainability Agreement Between Dermatologists and Five Visual Explanations Techniques in Deep Neural Networks for Melanoma AI Classification

Overview
Specialty General Medicine
Date 2023 Sep 25
PMID 37746081
Authors
Affiliations
Soon will be listed here.
Abstract

Introduction: The use of deep convolutional neural networks for analyzing skin lesion images has shown promising results. The identification of skin cancer by faster and less expensive means can lead to an early diagnosis, saving lives and avoiding treatment costs. However, to implement this technology in a clinical context, it is important for specialists to understand why a certain model makes a prediction; it must be explainable. Explainability techniques can be used to highlight the patterns of interest for a prediction.

Methods: Our goal was to test five different techniques: Grad-CAM, Grad-CAM++, Score-CAM, Eigen-CAM, and LIME, to analyze the agreement rate between features highlighted by the visual explanation maps to 3 important clinical criteria for melanoma classification: asymmetry, border irregularity, and color heterogeneity (ABC rule) in 100 melanoma images. Two dermatologists scored the visual maps and the clinical images using a semi-quantitative scale, and the results were compared. They also ranked their preferable techniques.

Results: We found that the techniques had different agreement rates and acceptance. In the overall analysis, Grad-CAM showed the best total+partial agreement rate (93.6%), followed by LIME (89.8%), Grad-CAM++ (88.0%), Eigen-CAM (86.4%), and Score-CAM (84.6%). Dermatologists ranked their favorite options: Grad-CAM and Grad-CAM++, followed by Score-CAM, LIME, and Eigen-CAM.

Discussion: Saliency maps are one of the few methods that can be used for visual explanations. The evaluation of explainability with humans is ideal to assess the understanding and applicability of these methods. Our results demonstrated that there is a significant agreement between clinical features used by dermatologists to diagnose melanomas and visual explanation techniques, especially Grad-Cam.

Citing Articles

Deep learning-assisted diagnosis of acute mesenteric ischemia based on CT angiography images.

Song L, Zhang X, Zhang J, Wu J, Wang J, Wang F Front Med (Lausanne). 2025; 12:1510357.

PMID: 39926426 PMC: 11802816. DOI: 10.3389/fmed.2025.1510357.

References
1.
Lau A, Staccini P . Artificial Intelligence in Health: New Opportunities, Challenges, and Practical Implications. Yearb Med Inform. 2019; 28(1):174-178. PMC: 6697520. DOI: 10.1055/s-0039-1677935. View

2.
Rudin C . Stop Explaining Black Box Machine Learning Models for High Stakes Decisions and Use Interpretable Models Instead. Nat Mach Intell. 2022; 1(5):206-215. PMC: 9122117. DOI: 10.1038/s42256-019-0048-x. View

3.
Alexandrescu D . Melanoma costs: a dynamic model comparing estimated overall costs of various clinical stages. Dermatol Online J. 2009; 15(11):1. View

4.
Buja A, Sartor G, Scioni M, Vecchiato A, Bolzan M, Rebba V . Estimation of Direct Melanoma-related Costs by Disease Stage and by Phase of Diagnosis and Treatment According to Clinical Guidelines. Acta Derm Venereol. 2017; 98(2):218-224. DOI: 10.2340/00015555-2830. View

5.
Esteva A, Kuprel B, Novoa R, Ko J, Swetter S, Blau H . Dermatologist-level classification of skin cancer with deep neural networks. Nature. 2017; 542(7639):115-118. PMC: 8382232. DOI: 10.1038/nature21056. View