» Articles » PMID: 39006998

Masked Pre-training of Transformers for Histology Image Analysis

Overview
Journal J Pathol Inform
Date 2024 Jul 15
PMID 39006998
Authors
Affiliations
Soon will be listed here.
Abstract

In digital pathology, whole-slide images (WSIs) are widely used for applications such as cancer diagnosis and prognosis prediction. Vision transformer (ViT) models have recently emerged as a promising method for encoding large regions of WSIs while preserving spatial relationships among patches. However, due to the large number of model parameters and limited labeled data, applying transformer models to WSIs remains challenging. In this study, we propose a pretext task to train the transformer model in a self-supervised manner. Our model, MaskHIT, uses the transformer output to reconstruct masked patches, measured by contrastive loss. We pre-trained MaskHIT model using over 7000 WSIs from TCGA and extensively evaluated its performance in multiple experiments, covering survival prediction, cancer subtype classification, and grade prediction tasks. Our experiments demonstrate that the pre-training procedure enables context-aware understanding of WSIs, facilitates the learning of representative histological features based on patch positions and visual patterns, and is essential for the ViT model to achieve optimal results on WSI-level tasks. The pre-trained MaskHIT surpasses various multiple instance learning approaches by 3% and 2% on survival prediction and cancer subtype classification tasks, and also outperforms recent state-of-the-art transformer-based methods. Finally, a comparison between the attention maps generated by the MaskHIT model with pathologist's annotations indicates that the model can accurately identify clinically relevant histological structures on the whole slide for each task.

Citing Articles

A novel framework for the automated characterization of Gram-stained blood culture slides using a large-scale vision transformer.

McMahon J, Tomita N, Tatishev E, Workman A, Costales C, Banaei N J Clin Microbiol. 2025; 63(3):e0151424.

PMID: 39992156 PMC: 11898657. DOI: 10.1128/jcm.01514-24.


A multi-model approach integrating whole-slide imaging and clinicopathologic features to predict breast cancer recurrence risk.

Goyal M, Marotti J, Workman A, Tooker G, Ramin S, Kuhn E NPJ Breast Cancer. 2024; 10(1):93.

PMID: 39426965 PMC: 11490577. DOI: 10.1038/s41523-024-00700-z.


Deep Learning for Grading Endometrial Cancer.

Goyal M, Tafe L, Feng J, Muller K, Hondelink L, Bentz J Am J Pathol. 2024; 194(9):1701-1711.

PMID: 38879079 PMC: 11373039. DOI: 10.1016/j.ajpath.2024.05.003.


Improving Representation Learning for Histopathologic Images with Cluster Constraints.

Wu W, Gao C, DiPalma J, Vosoughi S, Hassanpour S Proc IEEE Int Conf Comput Vis. 2024; 2023:21347-21357.

PMID: 38694561 PMC: 11062482. DOI: 10.1109/iccv51070.2023.01957.


A survey of Transformer applications for histopathological image analysis: New developments and future directions.

Atabansi C, Nie J, Liu H, Song Q, Yan L, Zhou X Biomed Eng Online. 2023; 22(1):96.

PMID: 37749595 PMC: 10518923. DOI: 10.1186/s12938-023-01157-0.

References
1.
Veta M, van Diest P, Jiwa M, Al-Janabi S, Pluim J . Mitosis Counting in Breast Cancer: Object-Level Interobserver Agreement and Comparison to an Automatic Method. PLoS One. 2016; 11(8):e0161286. PMC: 4987048. DOI: 10.1371/journal.pone.0161286. View

2.
Vu Q, Rajpoot K, Raza S, Rajpoot N . Handcrafted Histological Transformer (H2T): Unsupervised representation of whole slide images. Med Image Anal. 2023; 85:102743. DOI: 10.1016/j.media.2023.102743. View

3.
Dimitriou N, Arandjelovic O, Caie P . Deep Learning for Whole Slide Image Analysis: An Overview. Front Med (Lausanne). 2019; 6:264. PMC: 6882930. DOI: 10.3389/fmed.2019.00264. View

4.
Jiang S, Suriawinata A, Hassanpour S . MHAttnSurv: Multi-head attention for survival prediction using whole-slide pathology images. Comput Biol Med. 2023; 158:106883. PMC: 10148238. DOI: 10.1016/j.compbiomed.2023.106883. View

5.
Abels E, Pantanowitz L, Aeffner F, Zarella M, van der Laak J, Bui M . Computational pathology definitions, best practices, and recommendations for regulatory guidance: a white paper from the Digital Pathology Association. J Pathol. 2019; 249(3):286-294. PMC: 6852275. DOI: 10.1002/path.5331. View