» Articles » PMID: 38843483

Automated Lugano Metabolic Response Assessment in F-Fluorodeoxyglucose-Avid Non-Hodgkin Lymphoma With Deep Learning on F-Fluorodeoxyglucose-Positron Emission Tomography

Abstract

Purpose: Artificial intelligence can reduce the time used by physicians on radiological assessments. For F-fluorodeoxyglucose-avid lymphomas, obtaining complete metabolic response (CMR) by end of treatment is prognostic.

Methods: Here, we present a deep learning-based algorithm for fully automated treatment response assessments according to the Lugano 2014 classification. The proposed four-stage method, trained on a multicountry clinical trial (ClinicalTrials.gov identifier: NCT01287741) and tested in three independent multicenter and multicountry test sets on different non-Hodgkin lymphoma subtypes and different lines of treatment (ClinicalTrials.gov identifiers NCT02257567, NCT02500407; 20% holdout in ClinicalTrials.gov identifier NCT01287741), outputs the detected lesions at baseline and follow-up to enable focused radiologist review.

Results: The method's response assessment achieved high agreement with the adjudicated radiologic responses (eg, agreement for overall response assessment of 93%, 87%, and 85% in ClinicalTrials.gov identifiers NCT01287741, NCT02500407, and NCT02257567, respectively) similar to inter-radiologist agreement and was strongly prognostic of outcomes with a trend toward higher accuracy for death risk than adjudicated radiologic responses (hazard ratio for end of treatment by-model CMR of 0.123, 0.054, and 0.205 in ClinicalTrials.gov identifiers NCT01287741, NCT02500407, and NCT02257567, compared with, respectively, 0.226, 0.292, and 0.272 for CMR by the adjudicated responses). Furthermore, a radiologist review of the algorithm's assessments was conducted. The radiologist median review time was 1.38 minutes/assessment, and no statistically significant differences were observed in the level of agreement of the radiologist with the model's response compared with the level of agreement of the radiologist with the adjudicated responses.

Conclusion: These results suggest that the proposed method can be incorporated into radiologic response assessment workflows in cancer imaging for significant time savings and with performance similar to trained medical experts.

Citing Articles

Assessing large language models for Lugano classification of malignant lymphoma in Japanese FDG-PET reports.

Ito R, Kato K, Nanataki K, Abe Y, Ogawa H, Minamimoto R EJNMMI Rep. 2025; 9(1):8.

PMID: 40059276 PMC: 11891112. DOI: 10.1186/s41824-025-00246-8.


Comparative analysis of intestinal tumor segmentation in PET CT scans using organ based and whole body deep learning.

Torkaman M, Jemaa S, Fredrickson J, Fernandez Coimbra A, De Crespigny A, Carano R BMC Med Imaging. 2025; 25(1):52.

PMID: 39962481 PMC: 11834234. DOI: 10.1186/s12880-025-01587-3.


A Systematic Review of the Applications of Deep Learning for the Interpretation of Positron Emission Tomography Images of Patients with Lymphoma.

Kanavos T, Birbas E, Zanos T Cancers (Basel). 2025; 17(1.

PMID: 39796698 PMC: 11719749. DOI: 10.3390/cancers17010069.


Clinical scoring systems, molecular subtypes and baseline [F]FDG PET/CT image analysis for prognosis of diffuse large B-cell lymphoma.

Sun Z, Yang T, Ding C, Shi Y, Cheng L, Jia Q Cancer Imaging. 2024; 24(1):168.

PMID: 39696503 PMC: 11656546. DOI: 10.1186/s40644-024-00810-8.

References
1.
de Jesus F, Yin Y, Mantzorou-Kyriaki E, Kahle X, de Haas R, Yakar D . Machine learning in the differentiation of follicular lymphoma from diffuse large B-cell lymphoma with radiomic [F]FDG PET/CT features. Eur J Nucl Med Mol Imaging. 2021; 49(5):1535-1543. DOI: 10.1007/s00259-021-05626-3. View

2.
Eisenhauer E, Therasse P, Bogaerts J, Schwartz L, Sargent D, Ford R . New response evaluation criteria in solid tumours: revised RECIST guideline (version 1.1). Eur J Cancer. 2008; 45(2):228-47. DOI: 10.1016/j.ejca.2008.10.026. View

3.
Young H, Baum R, Cremerius U, Herholz K, Hoekstra O, Lammertsma A . Measurement of clinical and subclinical tumour response using [18F]-fluorodeoxyglucose and positron emission tomography: review and 1999 EORTC recommendations. European Organization for Research and Treatment of Cancer (EORTC) PET Study Group. Eur J Cancer. 2000; 35(13):1773-82. DOI: 10.1016/s0959-8049(99)00229-4. View

4.
Martensson G, Ferreira D, Granberg T, Cavallin L, Oppedal K, Padovani A . The reliability of a deep learning model in clinical out-of-distribution MRI data: A multicohort study. Med Image Anal. 2020; 66:101714. DOI: 10.1016/j.media.2020.101714. View

5.
Cha K, Hadjiiski L, Chan H, Weizer A, Alva A, Cohan R . Bladder Cancer Treatment Response Assessment in CT using Radiomics with Deep-Learning. Sci Rep. 2017; 7(1):8738. PMC: 5562694. DOI: 10.1038/s41598-017-09315-w. View