» Articles » PMID: 36738650

Transforming Medical Imaging with Transformers? A Comparative Review of Key Properties, Current Progresses, and Future Perspectives

Overview
Journal Med Image Anal
Publisher Elsevier
Specialty Radiology
Date 2023 Feb 4
PMID 36738650
Authors
Affiliations
Soon will be listed here.
Abstract

Transformer, one of the latest technological advances of deep learning, has gained prevalence in natural language processing or computer vision. Since medical imaging bear some resemblance to computer vision, it is natural to inquire about the status quo of Transformers in medical imaging and ask the question: can the Transformer models transform medical imaging? In this paper, we attempt to make a response to the inquiry. After a brief introduction of the fundamentals of Transformers, especially in comparison with convolutional neural networks (CNNs), and highlighting key defining properties that characterize the Transformers, we offer a comprehensive review of the state-of-the-art Transformer-based approaches for medical imaging and exhibit current research progresses made in the areas of medical image segmentation, recognition, detection, registration, reconstruction, enhancement, etc. In particular, what distinguishes our review lies in its organization based on the Transformer's key defining properties, which are mostly derived from comparing the Transformer and CNN, and its type of architecture, which specifies the manner in which the Transformer and CNN are combined, all helping the readers to best understand the rationale behind the reviewed approaches. We conclude with discussions of future perspectives.

Citing Articles

Early Enhancement in Contrast-Enhanced Computed Tomography Is an Index of , , , and Expression in Canine Hepatocellular Carcinoma: A Preliminary Study.

Tanaka T, Motegi T, Sumikawa N, Mori M, Kurokawa S, Akiyoshi H Vet Sci. 2025; 12(2).

PMID: 40005897 PMC: 11860268. DOI: 10.3390/vetsci12020137.


Overcoming Neuroanatomical Mapping and Computational Barriers in Human Brain Synaptic Architecture.

Kumar R, Waisberg E, Ong J, Paladugu P, Amiri D, Jagadeesan R Neuroinformatics. 2025; 23(2):22.

PMID: 39998695 DOI: 10.1007/s12021-025-09715-8.


Applications of Artificial Intelligence, Deep Learning, and Machine Learning to Support the Analysis of Microscopic Images of Cells and Tissues.

Ali M, Benfante V, Basirinia G, Alongi P, Sperandeo A, Quattrocchi A J Imaging. 2025; 11(2).

PMID: 39997561 PMC: 11856378. DOI: 10.3390/jimaging11020059.


Feasibility of generating sagittal radiographs from coronal views using GAN-based deep learning framework in adolescent idiopathic scoliosis.

Bassani T, Cina A, Galbusera F, Cazzato A, Pellegrino M, Albano D Eur Radiol Exp. 2025; 9(1):11.

PMID: 39881022 PMC: 11780070. DOI: 10.1186/s41747-025-00553-6.


Systematic Review of Hybrid Vision Transformer Architectures for Radiological Image Analysis.

Kim J, Khan A, Banerjee I J Imaging Inform Med. 2025; .

PMID: 39871042 DOI: 10.1007/s10278-024-01322-4.


References
1.
Segars W, Bond J, Frush J, Hon S, Eckersley C, Williams C . Population of anatomically variable 4D XCAT adult phantoms for imaging research and optimization. Med Phys. 2013; 40(4):043701. PMC: 3612121. DOI: 10.1118/1.4794178. View

2.
Holmes A, Hollinshead M, OKeefe T, Petrov V, Fariello G, Wald L . Brain Genomics Superstruct Project initial data release with structural, functional, and behavioral measures. Sci Data. 2015; 2:150031. PMC: 4493828. DOI: 10.1038/sdata.2015.31. View

3.
Ruggeri A, Scarpa F, De Luca M, Meltendorf C, Schroeter J . A system for the automatic estimation of morphometric parameters of corneal endothelium in alizarine red-stained images. Br J Ophthalmol. 2010; 94(5):643-7. DOI: 10.1136/bjo.2009.166561. View

4.
Brosch T, Tam R . Manifold learning of brain MRIs by deep learning. Med Image Comput Comput Assist Interv. 2014; 16(Pt 2):633-40. DOI: 10.1007/978-3-642-40763-5_78. View

5.
Shamshad F, Khan S, Zamir S, Khan M, Hayat M, Khan F . Transformers in medical imaging: A survey. Med Image Anal. 2023; 88:102802. DOI: 10.1016/j.media.2023.102802. View