» Articles » PMID: 38870441

Associations Between Radiation Oncologist Demographic Factors and Segmentation Similarity Benchmarks: Insights From a Crowd-Sourced Challenge Using Bayesian Estimation

Abstract

Purpose: The quality of radiotherapy auto-segmentation training data, primarily derived from clinician observers, is of utmost importance. However, the factors influencing the quality of clinician-derived segmentations are poorly understood; our study aims to quantify these factors.

Methods: Organ at risk (OAR) and tumor-related segmentations provided by radiation oncologists from the Contouring Collaborative for Consensus in Radiation Oncology data set were used. Segmentations were derived from five disease sites: breast, sarcoma, head and neck (H&N), gynecologic (GYN), and GI. Segmentation quality was determined on a structure-by-structure basis by comparing the observer segmentations with an expert-derived consensus, which served as a reference standard benchmark. The Dice similarity coefficient (DSC) was primarily used as a metric for the comparisons. DSC was stratified into binary groups on the basis of structure-specific expert-derived interobserver variability (IOV) cutoffs. Generalized linear mixed-effects models using Bayesian estimation were used to investigate the association between demographic variables and the binarized DSC for each disease site. Variables with a highest density interval excluding zero were considered to substantially affect the outcome measure.

Results: Five hundred seventy-four, 110, 452, 112, and 48 segmentations were used for the breast, sarcoma, H&N, GYN, and GI cases, respectively. The median percentage of segmentations that crossed the expert DSC IOV cutoff when stratified by structure type was 55% and 31% for OARs and tumors, respectively. Regression analysis revealed that the structure being tumor-related had a substantial negative impact on binarized DSC for the breast, sarcoma, H&N, and GI cases. There were no recurring relationships between segmentation quality and demographic variables across the cases, with most variables demonstrating large standard deviations.

Conclusion: Our study highlights substantial uncertainty surrounding conventionally presumed factors influencing segmentation quality relative to benchmarks.

References
1.
Nikolov S, Blackwell S, Zverovitch A, Mendes R, Livne M, De Fauw J . Clinically Applicable Segmentation of Head and Neck Anatomy for Radiotherapy: Deep Learning Algorithm Development and Validation Study. J Med Internet Res. 2021; 23(7):e26151. PMC: 8314151. DOI: 10.2196/26151. View

2.
Zhang Y, Cha E, Lynch K, Gennarelli R, Brower J, Sherer M . Attitudes and access to resources and strategies to improve quality of radiotherapy among US radiation oncologists: A mixed methods study. J Med Imaging Radiat Oncol. 2022; 66(7):993-1002. PMC: 9532345. DOI: 10.1111/1754-9485.13423. View

3.
Nissen C, Ying J, Kalantari F, Patel M, Prabhu A, Kesaria A . A Prospective Study Measuring Resident and Faculty Contour Concordance: A Potential Tool for Quantitative Assessment of Residents' Performance in Contouring and Target Delineation in Radiation Oncology Residency. J Am Coll Radiol. 2023; 21(3):464-472. DOI: 10.1016/j.jacr.2023.08.049. View

4.
Sherer M, Lin D, Elguindi S, Duke S, Tan L, Cacicedo J . Metrics to evaluate the performance of auto-segmentation for radiation treatment planning: A critical review. Radiother Oncol. 2021; 160:185-191. PMC: 9444281. DOI: 10.1016/j.radonc.2021.05.003. View

5.
Berry S, Boczkowski A, Ma R, Mechalakos J, Hunt M . Interobserver variability in radiation therapy plan output: Results of a single-institution study. Pract Radiat Oncol. 2016; 6(6):442-449. PMC: 5099085. DOI: 10.1016/j.prro.2016.04.005. View