» Articles » PMID: 36873900

Restaining-based Annotation for Cancer Histology Segmentation to Overcome Annotation-related Limitations Among Pathologists

Abstract

Numerous cancer histopathology specimens have been collected and digitized over the past few decades. A comprehensive evaluation of the distribution of various cells in tumor tissue sections can provide valuable information for understanding cancer. Deep learning is suitable for achieving these goals; however, the collection of extensive, unbiased training data is hindered, thus limiting the production of accurate segmentation models. This study presents SegPath-the largest annotation dataset (>10 times larger than publicly available annotations)-for the segmentation of hematoxylin and eosin (H&E)-stained sections for eight major cell types in cancer tissue. The SegPath generating pipeline used H&E-stained sections that were destained and subsequently immunofluorescence-stained with carefully selected antibodies. We found that SegPath is comparable with, or outperforms, pathologist annotations. Moreover, annotations by pathologists are biased toward typical morphologies. However, the model trained on SegPath can overcome this limitation. Our results provide foundational datasets for machine-learning research in histopathology.

Citing Articles

Colorectal cancer tumor grade segmentation: A new dataset and baseline results.

Arslan D, Sehlaver S, Guder E, Temena M, Bahcekapili A, Ozdemir U Heliyon. 2025; 11(4):e42467.

PMID: 40061913 PMC: 11889546. DOI: 10.1016/j.heliyon.2025.e42467.


A novel dataset for nuclei and tissue segmentation in melanoma with baseline nuclei segmentation and tissue segmentation benchmarks.

Schuiveling M, Liu H, Eek D, Breimer G, Suijkerbuijk K, Blokx W Gigascience. 2025; 14.

PMID: 39970004 PMC: 11837757. DOI: 10.1093/gigascience/giaf011.


Pathology Foundation Models.

Ochi M, Komura D, Ishikawa S JMA J. 2025; 8(1):121-130.

PMID: 39926091 PMC: 11799676. DOI: 10.31662/jmaj.2024-0206.


Machine learning methods for histopathological image analysis: Updates in 2024.

Komura D, Ochi M, Ishikawa S Comput Struct Biotechnol J. 2025; 27:383-400.

PMID: 39897057 PMC: 11786909. DOI: 10.1016/j.csbj.2024.12.033.


A hierarchically annotated dataset drives tangled filament recognition in digital neuron reconstruction.

Chen W, Liao M, Bao S, An S, Li W, Liu X Patterns (N Y). 2024; 5(8):101007.

PMID: 39233689 PMC: 11368685. DOI: 10.1016/j.patter.2024.101007.


References
1.
Giraldo N, Sanchez-Salas R, Peske J, Vano Y, Becht E, Petitprez F . The clinical role of the TME in solid cancer. Br J Cancer. 2018; 120(1):45-53. PMC: 6325164. DOI: 10.1038/s41416-018-0327-z. View

2.
Uhlen M, Fagerberg L, Hallstrom B, Lindskog C, Oksvold P, Mardinoglu A . Proteomics. Tissue-based map of the human proteome. Science. 2015; 347(6220):1260419. DOI: 10.1126/science.1260419. View

3.
Bulten W, Bandi P, Hoven J, van de Loo R, Lotz J, Weiss N . Epithelium segmentation using deep learning in H&E-stained prostate specimens with immunohistochemistry as reference standard. Sci Rep. 2019; 9(1):864. PMC: 6351532. DOI: 10.1038/s41598-018-37257-4. View

4.
Diao J, Wang J, Chui W, Mountain V, Gullapally S, Srinivasan R . Human-interpretable image features derived from densely mapped cancer pathology slides predict diverse molecular phenotypes. Nat Commun. 2021; 12(1):1613. PMC: 7955068. DOI: 10.1038/s41467-021-21896-9. View

5.
Cifci D, Foersch S, Kather J . Artificial intelligence to identify genetic alterations in conventional histopathology. J Pathol. 2022; 257(4):430-444. DOI: 10.1002/path.5898. View