» Articles » PMID: 39698626

Not Another Dual Attention UNet Transformer (NNDA-UNETR): a Plug-and-play Parallel Dual Attention Block in U-Net with Enhanced Residual Blocks for Medical Image Segmentation

Overview
Specialty Radiology
Date 2024 Dec 19
PMID 39698626
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Medical image segmentation is crucial for clinical diagnostics and treatment planning. Recently, hybrid models often neglect the local modeling capabilities of Transformers for medical image segmentation, despite the complementary nature of local information from both convolutional neural networks (CNNs) and transformers. This limitation is particularly problematic in multi-organ segmentation, where organs are closely adhered, and accurate delineation is essential. This study aims to develop a novel method that leverages the strengths of both CNNs and transformers to address the challenges of segmenting multiple closely adhered organs, improving accuracy and robustness in multi-organ segmentation.

Methods: In this study, we present the Not Another Dual Attention block (NNDA-block), a versatile, plug-and-play module that seamlessly integrates channel and spatial attention mechanisms. This block can be incorporated at any stage within the Not Another Dual Attention UNet Transformer (NNDA-UNETR) framework. Our novel approach to spatial attention uniquely combines local modeling, significantly reducing detail and texture loss during the up- and down-sampling processes typical in U-shaped architectures.

Results: Evaluations on the Beyond the Cranial Vault (BTCV) and Automatic Cardiac Diagnosis Challenge benchmark datasets demonstrate our method's effectiveness in balancing model complexity with accuracy. On the BTCV dataset, our model sets new state-of-the-art benchmarks, achieving a Dice similarity coefficient of 84.13%, normalized surface dice of 86.96%, mean average surface distance of 3.46 mm, and an Hausdorff distance at 95% of 17.76 mm. Moreover, it reduces the parameter count by 34.2% compared to leading contemporary methods, all while maintaining low computational costs [measured in floating point operations per second (FLOPs)].

Conclusions: NNDA-UNETR offers a robust solution for accurate segmentation in multi-organ tasks, particularly where organ adhesion poses challenges. Its lightweight design also makes it well-suited for deployment in real-world medical environments with limited computational resources.

References
1.
Ciecholewski M, Kassjanski M . Computational Methods for Liver Vessel Segmentation in Medical Imaging: A Review. Sensors (Basel). 2021; 21(6). PMC: 7999381. DOI: 10.3390/s21062027. View

2.
Verma R, Kumar N, Patil A, Kurian N, Rane S, Graham S . MoNuSAC2020: A Multi-Organ Nuclei Segmentation and Classification Challenge. IEEE Trans Med Imaging. 2021; 40(12):3413-3423. DOI: 10.1109/TMI.2021.3085712. View

3.
Huang X, Deng Z, Li D, Yuan X, Fu Y . MISSFormer: An Effective Transformer for 2D Medical Image Segmentation. IEEE Trans Med Imaging. 2023; 42(5):1484-1494. DOI: 10.1109/TMI.2022.3230943. View

4.
Zunair H, Ben Hamza A . Sharp U-Net: Depthwise convolutional network for biomedical image segmentation. Comput Biol Med. 2021; 136:104699. DOI: 10.1016/j.compbiomed.2021.104699. View

5.
Conze P, Kavur A, Cornec-Le Gall E, Gezer N, Le Meur Y, Selver M . Abdominal multi-organ segmentation with cascaded convolutional and adversarial deep networks. Artif Intell Med. 2021; 117:102109. DOI: 10.1016/j.artmed.2021.102109. View