» Articles » PMID: 39593823

A Comprehensive Review of Performance Metrics for Computer-Aided Detection Systems

Overview
Date 2024 Nov 27
PMID 39593823
Authors
Affiliations
Soon will be listed here.
Abstract

This paper aims to provide a structured analysis of the performance metrics used in computer-aided detection (CAD) systems, specifically focusing on lung nodule detection in computed tomography (CT) images. By examining key metrics along with their respective strengths and limitations, this study offers guidelines to assist in selecting appropriate metrics. Evaluation methods for CAD systems for lung nodule detection are primarily categorized into per-scan and per-nodule approaches. For per-scan analysis, a key metric is the area under the receiver operating characteristic (ROC) curve (AUROC), which evaluates the ability of the system to distinguish between scans with and without nodules. For per-nodule analysis, the nodule-level sensitivity at fixed false positives per scan is often used, supplemented by the free-response receiver operating characteristic (FROC) curve and the competition performance metric (CPM). However, the CPM does not provide normalized scores because it theoretically ranges from zero to infinity and largely varies depending on the characteristics of the data. To address the advantages and limitations of ROC and FROC curves, an alternative FROC (AFROC) was introduced to combine the strengths of both per-scan and per-nodule analyses. This paper discusses the principles of each metric and their relative strengths, providing insights into their clinical implications and practical utility.

References
1.
Li L, Liu Z, Huang H, Lin M, Luo D . Evaluating the performance of a deep learning-based computer-aided diagnosis (DL-CAD) system for detecting and characterizing lung nodules: Comparison with the performance of double reading by radiologists. Thorac Cancer. 2018; 10(2):183-192. PMC: 6360226. DOI: 10.1111/1759-7714.12931. View

2.
Hsu H, Ko K, Chou Y, Wu Y, Chiu S, Chang C . Performance and reading time of lung nodule identification on multidetector CT with or without an artificial intelligence-powered computer-aided detection system. Clin Radiol. 2021; 76(8):626.e23-626.e32. DOI: 10.1016/j.crad.2021.04.006. View

3.
Jin W, Li X, Fatehi M, Hamarneh G . Guidelines and evaluation of clinical explainable AI in medical image analysis. Med Image Anal. 2022; 84:102684. DOI: 10.1016/j.media.2022.102684. View

4.
Park S, Lee S, Kim W, Park H, Jung K, Do K . Computer-aided Detection of Subsolid Nodules at Chest CT: Improved Performance with Deep Learning-based CT Section Thickness Reduction. Radiology. 2021; 299(1):211-219. DOI: 10.1148/radiol.2021203387. View

5.
Katase S, Ichinose A, Hayashi M, Watanabe M, Chin K, Takeshita Y . Development and performance evaluation of a deep learning lung nodule detection system. BMC Med Imaging. 2022; 22(1):203. PMC: 9682774. DOI: 10.1186/s12880-022-00938-8. View