Comparison of Diagnostic Accuracy and Utility of Artificial Intelligence-optimized ACR TI-RADS and Original ACR TI-RADS: a Multi-center Validation Study Based on 2061 Thyroid Nodules
Overview
Authors
Affiliations
Objective: To determine if artificial intelligence-based modification of the Thyroid Imaging Reporting Data System (TI-RADS) would be better than the current American College of Radiology (ACR) TI-RADS for risk stratification of thyroid nodules.
Methods: A total of 2061 thyroid nodules (in 1859 patients) sampled with fine-needle aspiration or operation were retrospectively analyzed between January 2017 and July 2020. Two radiologists blinded to the pathologic diagnosis evaluated nodule features in five ultrasound categories and assigned TI-RADS scores by both ACR TI-RADS and AI TI-RADS. Inter-rater agreement was assessed by asking another two radiologists to score a set of 100 nodules independently. The reference standard was postoperative pathological or cytopathological diagnosis according to the Bethesda system. Inter-rater agreement was determined using intraclass correlation coefficient (ICC).
Results: AI TI-RADS assigned lower TI-RADS risk levels than ACR TI-RADS (p < 0.001) and had larger area under receiver operating characteristic curve (0.762 vs. 0.679, p < 0.001). The sensitivities of ACR TI-RADS and AI TI-RADS were similar (86.7% vs. 82.2%, p = 0.052), but specificity was higher with AI TI-RADS (70.2% vs. 49.2%, p < 0.001). AI TI-RADS downgraded 743 (48.63%) benign nodules, indicating that 328 (42.3% of 776 biopsied nodules) unnecessary fine-needle aspirations (FNA) could have been avoided. Inter-rater agreement was better with AI TI-RADS than with ACR TI-RADS (ICC, 0.808 vs. 0.861, p < 0.001).
Conclusion: AI TI-RADS can achieve meaningful reduction in the number of benign thyroid nodules recommended for biopsy and significantly improve specificity despite a slight decrease in sensitivity.
Key Points: • AI TI-RADS assigned lower TI-RADS risk levels than ACR TI-RADS, showing similar sensitivity but higher specificity. • Half of the benign nodules can be downgraded of which 42.3% of biopsy nodules avoided unnecessary fine-needle aspiration (FNA). • AI TI-RADS had a better overall inter-rater agreement.
Savardi M, Signoroni A, Benini S, Vaccher F, Alberti M, Ciolli P Insights Imaging. 2025; 16(1):23.
PMID: 39881013 PMC: 11780016. DOI: 10.1186/s13244-024-01893-4.
Hellmann A, Wisniewski P, Sledzinski M, Raffaelli M, Kobiela J, Barczynski M Cancers (Basel). 2024; 16(12).
PMID: 38927942 PMC: 11202303. DOI: 10.3390/cancers16122237.
Ma X, Yu J, Huang Y, Cui Y, Cui K Front Oncol. 2023; 13:1265973.
PMID: 38033487 PMC: 10684914. DOI: 10.3389/fonc.2023.1265973.
Yang L, Li C, Chen Z, He S, Wang Z, Liu J Front Endocrinol (Lausanne). 2023; 14:1227339.
PMID: 37720531 PMC: 10501732. DOI: 10.3389/fendo.2023.1227339.
Toro-Tobon D, Loor-Torres R, Duran M, Fan J, Ospina N, Wu Y Thyroid. 2023; 33(8):903-917.
PMID: 37279303 PMC: 10440669. DOI: 10.1089/thy.2023.0132.