Improving Feature Extraction from Histopathological Images Through a Fine-tuning ImageNet Model
Overview
Authors
Affiliations
Background: Due to lack of annotated pathological images, transfer learning has been the predominant approach in the field of digital pathology. Pre-trained neural networks based on ImageNet database are often used to extract "off-the-shelf" features, achieving great success in predicting tissue types, molecular features, and clinical outcomes, etc. We hypothesize that fine-tuning the pre-trained models using histopathological images could further improve feature extraction, and downstream prediction performance.
Methods: We used 100 000 annotated H&E image patches for colorectal cancer (CRC) to fine-tune a pre-trained Xception model via a 2-step approach. The features extracted from fine-tuned Xception (FTX-2048) model and Image-pretrained (IMGNET-2048) model were compared through: (1) tissue classification for H&E images from CRC, same image type that was used for fine-tuning; (2) prediction of immune-related gene expression, and (3) gene mutations for lung adenocarcinoma (LUAD). Five-fold cross validation was used for model performance evaluation. Each experiment was repeated 50 times.
Findings: The extracted features from the fine-tuned FTX-2048 exhibited significantly higher accuracy (98.4%) for predicting tissue types of CRC compared to the "off-the-shelf" features directly from Xception based on ImageNet database (96.4%) (P value = 2.2 × 10). Particularly, FTX-2048 markedly improved the accuracy for stroma from 87% to 94%. Similarly, features from FTX-2048 boosted the prediction of transcriptomic expression of immune-related genes in LUAD. For the genes that had significant relationships with image features (P < 0.05, n = 171), the features from the fine-tuned model improved the prediction for the majority of the genes (139; 81%). In addition, features from FTX-2048 improved prediction of mutation for 5 out of 9 most frequently mutated genes (STK11, TP53, LRP1B, NF1, and FAT1) in LUAD.
Conclusions: We proved the concept that fine-tuning the pretrained ImageNet neural networks with histopathology images can produce higher quality features and better prediction performance for not only the same-cancer tissue classification where similar images from the same cancer are used for fine-tuning, but also cross-cancer prediction for gene expression and mutation at patient level.
Nakamoto I, Chen H, Wang R, Guo Y, Chen W, Feng J Biomed Eng Online. 2025; 24(1):11.
PMID: 39915867 PMC: 11800529. DOI: 10.1186/s12938-025-01341-4.
Precise grading of non-muscle invasive bladder cancer with multi-scale pyramidal CNN.
Shalata A, Alksas A, Shehata M, Khater S, Ezzat O, Ali K Sci Rep. 2024; 14(1):25131.
PMID: 39448755 PMC: 11502747. DOI: 10.1038/s41598-024-77101-6.
Deep Learning in Hematology: From Molecules to Patients.
Wang J Clin Hematol Int. 2024; 6(4):19-42.
PMID: 39417017 PMC: 11477942. DOI: 10.46989/001c.124131.
Ashayeri H, Sobhi N, Plawiak P, Pedrammehr S, Alizadehsani R, Jafarizadeh A Cancers (Basel). 2024; 16(11).
PMID: 38893257 PMC: 11171544. DOI: 10.3390/cancers16112138.
Chlorogiannis D, Verras G, Tzelepi V, Chlorogiannis A, Apostolos A, Kotis K Prz Gastroenterol. 2024; 18(4):353-367.
PMID: 38572457 PMC: 10985751. DOI: 10.5114/pg.2023.130337.