» Articles » PMID: 35436184

ResViT: Residual Vision Transformers for Multimodal Medical Image Synthesis

Overview
Date 2022 Apr 18
PMID 35436184
Authors
Affiliations
Soon will be listed here.
Abstract

Generative adversarial models with convolutional neural network (CNN) backbones have recently been established as state-of-the-art in numerous medical image synthesis tasks. However, CNNs are designed to perform local processing with compact filters, and this inductive bias compromises learning of contextual features. Here, we propose a novel generative adversarial approach for medical image synthesis, ResViT, that leverages the contextual sensitivity of vision transformers along with the precision of convolution operators and realism of adversarial learning. ResViT's generator employs a central bottleneck comprising novel aggregated residual transformer (ART) blocks that synergistically combine residual convolutional and transformer modules. Residual connections in ART blocks promote diversity in captured representations, while a channel compression module distills task-relevant information. A weight sharing strategy is introduced among ART blocks to mitigate computational burden. A unified implementation is introduced to avoid the need to rebuild separate synthesis models for varying source-target modality configurations. Comprehensive demonstrations are performed for synthesizing missing sequences in multi-contrast MRI, and CT images from MRI. Our results indicate superiority of ResViT against competing CNN- and transformer-based methods in terms of qualitative observations and quantitative metrics.

Citing Articles

Semantic structure preservation for accurate multi-modal glioma diagnosis.

Shi C, Zhang X, Zhao R, Zhang W, Chen F Sci Rep. 2025; 15(1):7185.

PMID: 40021688 PMC: 11871068. DOI: 10.1038/s41598-025-88458-7.


GBCHV an advanced deep learning anatomy aware model for accurate classification of gallbladder cancer utilizing ultrasound images.

Hasan M, Rony M, Chowa S, Bhuiyan M, Moustafa A Sci Rep. 2025; 15(1):7120.

PMID: 40016258 PMC: 11868569. DOI: 10.1038/s41598-025-89232-5.


InspirationOnly: synthesizing expiratory CT from inspiratory CT to estimate parametric response map.

Zhang T, Pang H, Wu Y, Xu J, Liang Z, Xia S Med Biol Eng Comput. 2025; .

PMID: 39961910 DOI: 10.1007/s11517-025-03322-0.


PortNet: Achieving lightweight architecture and high accuracy in lung cancer cell classification.

Zhao K, Si Y, Sun L, Meng X Heliyon. 2025; 11(3):e41850.

PMID: 39931476 PMC: 11808607. DOI: 10.1016/j.heliyon.2025.e41850.


Patch-based dual-domain photon-counting CT data correction with residual-based WGAN-ViT.

Morovati B, Li M, Han S, Zhou L, Wang D, Wang G Phys Med Biol. 2025; 70(4).

PMID: 39874670 PMC: 11800073. DOI: 10.1088/1361-6560/adaf71.