» Articles » PMID: 39696039

Enhanced Cross-stage-attention U-Net for Esophageal Target Volume Segmentation

Overview
Journal BMC Med Imaging
Publisher Biomed Central
Specialty Radiology
Date 2024 Dec 19
PMID 39696039
Authors
Affiliations
Soon will be listed here.
Abstract

Purpose: The segmentation of target volume and organs at risk (OAR) was a significant part of radiotherapy. Specifically, determining the location and scale of the esophagus in simulated computed tomography images was difficult and time-consuming primarily due to its complex structure and low contrast with the surrounding tissues. In this study, an Enhanced Cross-stage-attention U-Net was proposed to solve the segmentation problem for the esophageal gross tumor volume (GTV) and clinical tumor volume (CTV) in CT images.

Methods: First, a module based on principal component analysis theory was constructed to pre-extract the features of the input image. Then, a cross-stage based feature fusion model was designed to replace the skip concatenation of original UNet, which was composed of Wide Range Attention unit, Small-kernel Local Attention unit, and Inverted Bottleneck unit. WRA was employed to capture global attention, whose large convolution kernel was further decomposed to simplify the calculation. SLA was used to complement the local attention to WRA. IBN was structed to fuse the extracted features, where a global frequency response layer was built to redistribute the frequency response of the fused feature maps.

Results: The proposed method was compared with relevant published esophageal segmentation methods. The prediction of the proposed network was MSD = 2.83(1.62, 4.76)mm, HD = 11.79 ± 6.02 mm, DC = 72.45 ± 19.18% in GTV; MSD = 5.26(2.18, 8.82)mm, HD = 16.22 ± 10.01 mm, DC = 71.06 ± 17.72% in CTV.

Conclusion: The reconstruction of the skip concatenation in UNet showed an improvement of performance for esophageal segmentation. The results showed the proposed network had better effect on esophageal GTV and CTV segmentation.

References
1.
You C, Zhao R, Staib L, Duncan J . Momentum Contrastive Voxel-wise Representation Learning for Semi-supervised Volumetric Medical Image Segmentation. Med Image Comput Comput Assist Interv. 2023; 13434:639-652. PMC: 10352821. DOI: 10.1007/978-3-031-16440-8_61. View

2.
Jian M, Tao C, Wu R, Zhang H, Li X, Wang R . HRU-Net: A high-resolution convolutional neural network for esophageal cancer radiotherapy target segmentation. Comput Methods Programs Biomed. 2024; 250:108177. DOI: 10.1016/j.cmpb.2024.108177. View

3.
Yang J, Haas B, Fang R, Beadle B, Garden A, Liao Z . Atlas ranking and selection for automatic segmentation of the esophagus from CT scans. Phys Med Biol. 2017; 62(23):9140-9158. PMC: 6167015. DOI: 10.1088/1361-6560/aa94ba. View

4.
Diniz J, Ferreira J, Diniz P, Silva A, Cardoso de Paiva A . Esophagus segmentation from planning CT images using an atlas-based deep learning approach. Comput Methods Programs Biomed. 2020; 197:105685. DOI: 10.1016/j.cmpb.2020.105685. View

5.
You C, Dai W, Min Y, Staib L, Duncan J . Implicit Anatomical Rendering for Medical Image Segmentation with Stochastic Experts. Med Image Comput Comput Assist Interv. 2024; 14222:561-571. PMC: 11151725. DOI: 10.1007/978-3-031-43898-1_54. View