» Articles » PMID: 36723384

Predicting Microsatellite Instability and Key Biomarkers in Colorectal Cancer from H&E-stained Images: Achieving State-of-the-art Predictive Performance with Fewer Data Using Swin Transformer

Overview
Specialty Pathology
Date 2023 Feb 1
PMID 36723384
Authors
Affiliations
Soon will be listed here.
Abstract

Many artificial intelligence models have been developed to predict clinically relevant biomarkers for colorectal cancer (CRC), including microsatellite instability (MSI). However, existing deep learning networks require large training datasets, which are often hard to obtain. In this study, based on the latest Hierarchical Vision Transformer using Shifted Windows (Swin Transformer [Swin-T]), we developed an efficient workflow to predict biomarkers in CRC (MSI, hypermutation, chromosomal instability, CpG island methylator phenotype, and BRAF and TP53 mutation) that required relatively small datasets. Our Swin-T workflow substantially achieved the state-of-the-art (SOTA) predictive performance in an intra-study cross-validation experiment on the Cancer Genome Atlas colon and rectal cancer dataset (TCGA-CRC-DX). It also demonstrated excellent generalizability in cross-study external validation and delivered a SOTA area under the receiver operating characteristic curve (AUROC) of 0.90 for MSI, using the Molecular and Cellular Oncology dataset for training (N = 1,065) and the TCGA-CRC-DX (N = 462) for testing. A similar performance (AUROC = 0.91) was reported in a recent study, using ~8,000 training samples (ResNet18) on the same testing dataset. Swin-T was extremely efficient when using small training datasets and exhibited robust predictive performance with 200-500 training samples. Our findings indicate that Swin-T could be 5-10 times more efficient than existing algorithms for MSI prediction based on ResNet18 and ShuffleNet. Furthermore, the Swin-T models demonstrated their capability in accurately predicting MSI and BRAF mutation status, which could exclude and therefore reduce samples before subsequent standard testing in a cascading diagnostic workflow, in turn reducing turnaround time and costs.

Citing Articles

Assessing Genotype-Phenotype Correlations with Deep Learning in Colorectal Cancer: A Multi-Centric Study.

Gustav M, van Treeck M, Reitsam N, Carrero Z, Loeffler C, Loeffler C medRxiv. 2025; .

PMID: 39973981 PMC: 11838662. DOI: 10.1101/2025.02.04.25321660.


Accurate colorectal cancer detection using a random hinge exponential distribution coupled attention network on pathological images.

Bharath E, Raja R, Kalaivanan K, Deshpande V Abdom Radiol (NY). 2025; .

PMID: 39779530 DOI: 10.1007/s00261-024-04770-2.


An interpretable deep learning model for detecting pathogenic variants of breast cancer from hematoxylin and eosin-stained pathological images.

Li Y, Xiong X, Liu X, Wu Y, Li X, Liu B PeerJ. 2024; 12:e18098.

PMID: 39484212 PMC: 11526788. DOI: 10.7717/peerj.18098.


Deep learning application in prediction of cancer molecular alterations based on pathological images: a bibliographic analysis via CiteSpace.

Xiaojian Y, Zhanbo Q, Jian C, Zefeng W, Jian L, Jin L J Cancer Res Clin Oncol. 2024; 150(10):467.

PMID: 39422817 PMC: 11489169. DOI: 10.1007/s00432-024-05992-z.


Development and deployment of a histopathology-based deep learning algorithm for patient prescreening in a clinical trial.

Juan Ramon A, Parmar C, Carrasco-Zevallos O, Csiszer C, Yip S, Raciti P Nat Commun. 2024; 15(1):4690.

PMID: 38824132 PMC: 11144215. DOI: 10.1038/s41467-024-49153-9.


References
1.
Yamashita R, Long J, Longacre T, Peng L, Berry G, Martin B . Deep learning model for the prediction of microsatellite instability in colorectal cancer: a diagnostic study. Lancet Oncol. 2021; 22(1):132-141. DOI: 10.1016/S1470-2045(20)30535-0. View

2.
Sirinukunwattana K, Domingo E, Richman S, Redmond K, Blake A, Verrill C . Image-based consensus molecular subtype (imCMS) classification of colorectal cancer using deep learning. Gut. 2020; 70(3):544-554. PMC: 7873419. DOI: 10.1136/gutjnl-2019-319866. View

3.
Kather J, Heij L, Grabsch H, Loeffler C, Echle A, Muti H . Pan-cancer image-based detection of clinically actionable genetic alterations. Nat Cancer. 2021; 1(8):789-799. PMC: 7610412. DOI: 10.1038/s43018-020-0087-6. View

4.
Andre T, Shiu K, Kim T, Jensen B, Jensen L, Punt C . Pembrolizumab in Microsatellite-Instability-High Advanced Colorectal Cancer. N Engl J Med. 2020; 383(23):2207-2218. DOI: 10.1056/NEJMoa2017699. View

5.
Laleh N, Muti H, Loeffler C, Echle A, Saldanha O, Mahmood F . Benchmarking weakly-supervised deep learning pipelines for whole slide classification in computational pathology. Med Image Anal. 2022; 79:102474. DOI: 10.1016/j.media.2022.102474. View