Enhancing Medical Image Segmentation with a Multi-transformer U-Net

Overview

Journal PeerJ

Specialties Biology
Environmental Health
General Medicine

Date 2024 Mar 4

PMID 38435997

Authors

Yongping Dan

Weishou Jin

Xuebin Yue

Zhida Wang

Affiliations

Soon will be listed here.

Abstract

Various segmentation networks based on Swin Transformer have shown promise in medical segmentation tasks. Nonetheless, challenges such as lower accuracy and slower training convergence have persisted. To tackle these issues, we introduce a novel approach that combines the Swin Transformer and Deformable Transformer to enhance overall model performance. We leverage the Swin Transformer's window attention mechanism to capture local feature information and employ the Deformable Transformer to adjust sampling positions dynamically, accelerating model convergence and aligning it more closely with object shapes and sizes. By amalgamating both Transformer modules and incorporating additional skip connections to minimize information loss, our proposed model excels at rapidly and accurately segmenting CT or X-ray lung images. Experimental results demonstrate the remarkable, showcasing the significant prowess of our model. It surpasses the performance of the standalone Swin Transformer's Swin Unet and converges more rapidly under identical conditions, yielding accuracy improvements of 0.7% (resulting in 88.18%) and 2.7% (resulting in 98.01%) on the COVID-19 CT scan lesion segmentation dataset and Chest X-ray Masks and Labels dataset, respectively. This advancement has the potential to aid medical practitioners in early diagnosis and treatment decision-making.

Citing Articles

IDCC-SAM: A Zero-Shot Approach for Cell Counting in Immunocytochemistry Dataset Using the Segment Anything Model.

Fanijo S, Jannesari A, Dickerson J Bioengineering (Basel). 2025; 12(2).

PMID: 40001703 PMC: 11851800. DOI: 10.3390/bioengineering12020184.

Flood change detection model based on an improved U-net network and multi-head attention mechanism.

Wang F, Feng X Sci Rep. 2025; 15(1):3295.

PMID: 39865097 PMC: 11770074. DOI: 10.1038/s41598-025-87851-6.

Joint segmentation of sternocleidomastoid and skeletal muscles in computed tomography images using a multiclass learning approach.

Ashino K, Kamiya N, Zhou X, Kato H, Hara T, Fujita H Radiol Phys Technol. 2024; 17(4):854-861.

PMID: 39242477 PMC: 11579203. DOI: 10.1007/s12194-024-00839-1.

References

Wang C, Zhao Z, Ren Q, Xu Y, Yu Y . Dense U-net Based on Patch-Based Learning for Retinal Vessel Segmentation. Entropy (Basel). 2020; 21(2). PMC: 7514650. DOI: 10.3390/e21020168. View

Torre V, Poggio T . On edge detection. IEEE Trans Pattern Anal Mach Intell. 2011; 8(2):147-63. DOI: 10.1109/tpami.1986.4767769. View

Wadhwa A, Bhardwaj A, Singh Verma V . A review on brain tumor segmentation of MRI images. Magn Reson Imaging. 2019; 61:247-259. DOI: 10.1016/j.mri.2019.05.043. View

Jiang J, Hu Y, Liu C, Halpenny D, Hellmann M, Deasy J . Multiple Resolution Residually Connected Feature Streams for Automatic Lung Tumor Segmentation From CT Images. IEEE Trans Med Imaging. 2018; 38(1):134-144. PMC: 6402577. DOI: 10.1109/TMI.2018.2857800. View

Jaeger S, Karargyris A, Candemir S, Folio L, Siegelman J, Callaghan F . Automatic tuberculosis screening using chest radiographs. IEEE Trans Med Imaging. 2013; 33(2):233-45. DOI: 10.1109/TMI.2013.2284099. View

Li J, Chen J, Tang Y, Wang C, Landman B, Kevin Zhou S . Transforming medical imaging with Transformers? A comparative review of key properties, current progresses, and future perspectives. Med Image Anal. 2023; 85:102762. PMC: 10010286. DOI: 10.1016/j.media.2023.102762. View

Gibson E, Giganti F, Hu Y, Bonmati E, Bandula S, Gurusamy K . Automatic Multi-Organ Segmentation on Abdominal CT With Dense V-Networks. IEEE Trans Med Imaging. 2018; 37(8):1822-1834. PMC: 6076994. DOI: 10.1109/TMI.2018.2806309. View

Qin D, Bu J, Liu Z, Shen X, Zhou S, Gu J . Efficient Medical Image Segmentation Based on Knowledge Distillation. IEEE Trans Med Imaging. 2021; 40(12):3820-3831. DOI: 10.1109/TMI.2021.3098703. View

Lagree A, Mohebpour M, Meti N, Saednia K, Lu F, Slodkowska E . A review and comparison of breast tumor cell nuclei segmentation performances using deep convolutional neural networks. Sci Rep. 2021; 11(1):8025. PMC: 8044238. DOI: 10.1038/s41598-021-87496-1. View

10.

Pellegrino F, Vanzella W, Torre V . Edge detection revisited. IEEE Trans Syst Man Cybern B Cybern. 2004; 34(3):1500-18. DOI: 10.1109/tsmcb.2004.824147. View

11.

Zhou X, Takayama R, Wang S, Hara T, Fujita H . Deep learning of the sectional appearances of 3D CT images for anatomical structure segmentation based on an FCN voting method. Med Phys. 2017; 44(10):5221-5233. DOI: 10.1002/mp.12480. View

12.

Candemir S, Jaeger S, Palaniappan K, Musco J, Singh R, Xue Z . Lung segmentation in chest radiographs using anatomical atlases with nonrigid registration. IEEE Trans Med Imaging. 2013; 33(2):577-90. DOI: 10.1109/TMI.2013.2290491. View

13.

Abedalla A, Abdullah M, Al-Ayyoub M, Benkhelifa E . Chest X-ray pneumothorax segmentation using U-Net with EfficientNet and ResNet architectures. PeerJ Comput Sci. 2021; 7:e607. PMC: 8279140. DOI: 10.7717/peerj-cs.607. View

14.

Gu Z, Cheng J, Fu H, Zhou K, Hao H, Zhao Y . CE-Net: Context Encoder Network for 2D Medical Image Segmentation. IEEE Trans Med Imaging. 2019; 38(10):2281-2292. DOI: 10.1109/TMI.2019.2903562. View

15.

Xia K, Yin H, Zhang Y . Deep Semantic Segmentation of Kidney and Space-Occupying Lesion Area Based on SCNN and ResNet Models Combined with SIFT-Flow Algorithm. J Med Syst. 2018; 43(1):2. DOI: 10.1007/s10916-018-1116-1. View

16.

Sharp G, Fritscher K, Pekar V, Peroni M, Shusharina N, Veeraraghavan H . Vision 20/20: perspectives on automated image segmentation for radiotherapy. Med Phys. 2014; 41(5):050902. PMC: 4000389. DOI: 10.1118/1.4871620. View

17.

Havaei M, Davy A, Warde-Farley D, Biard A, Courville A, Bengio Y . Brain tumor segmentation with Deep Neural Networks. Med Image Anal. 2016; 35:18-31. DOI: 10.1016/j.media.2016.05.004. View

18.

Zhao S, Lau T, Luo J, Chang E, Xu Y . Unsupervised 3D End-to-End Medical Image Registration With Volume Tweening Network. IEEE J Biomed Health Inform. 2019; 24(5):1394-1404. DOI: 10.1109/JBHI.2019.2951024. View