» Articles » PMID: 38796533

REAL-Colon: A Dataset for Developing Real-world AI Applications in Colonoscopy

Overview
Journal Sci Data
Specialty Science
Date 2024 May 25
PMID 38796533
Authors
Affiliations
Soon will be listed here.
Abstract

Detection and diagnosis of colon polyps are key to preventing colorectal cancer. Recent evidence suggests that AI-based computer-aided detection (CADe) and computer-aided diagnosis (CADx) systems can enhance endoscopists' performance and boost colonoscopy effectiveness. However, most available public datasets primarily consist of still images or video clips, often at a down-sampled resolution, and do not accurately represent real-world colonoscopy procedures. We introduce the REAL-Colon (Real-world multi-center Endoscopy Annotated video Library) dataset: a compilation of 2.7 M native video frames from sixty full-resolution, real-world colonoscopy recordings across multiple centers. The dataset contains 350k bounding-box annotations, each created under the supervision of expert gastroenterologists. Comprehensive patient clinical data, colonoscopy acquisition information, and polyp histopathological information are also included in each video. With its unprecedented size, quality, and heterogeneity, the REAL-Colon dataset is a unique resource for researchers and developers aiming to advance AI research in colonoscopy. Its openness and transparency facilitate rigorous and reproducible research, fostering the development and benchmarking of more accurate and reliable colonoscopy-related algorithms and models.

Citing Articles

Assessing clinical efficacy of polyp detection models using open-access datasets.

Marchese Aizenman G, Salvagnini P, Cherubini A, Biffi C Front Oncol. 2024; 14:1422942.

PMID: 39148908 PMC: 11324571. DOI: 10.3389/fonc.2024.1422942.


REAL-Colon: A dataset for developing real-world AI applications in colonoscopy.

Biffi C, Antonelli G, Bernhofer S, Hassan C, Hirata D, Iwatate M Sci Data. 2024; 11(1):539.

PMID: 38796533 PMC: 11127922. DOI: 10.1038/s41597-024-03359-0.

References
1.
Silva J, Histace A, Romain O, Dray X, Granado B . Toward embedded detection of polyps in WCE images for early diagnosis of colorectal cancer. Int J Comput Assist Radiol Surg. 2013; 9(2):283-93. DOI: 10.1007/s11548-013-0926-3. View

2.
. The Paris endoscopic classification of superficial neoplastic lesions: esophagus, stomach, and colon: November 30 to December 1, 2002. Gastrointest Endosc. 2003; 58(6 Suppl):S3-43. DOI: 10.1016/s0016-5107(03)02159-x. View

3.
Ali S, Jha D, Ghatwary N, Realdon S, Cannizzaro R, Salem O . A multi-centre polyp detection and segmentation dataset for generalisability assessment. Sci Data. 2023; 10(1):75. PMC: 9902556. DOI: 10.1038/s41597-023-01981-y. View

4.
Mesejo P, Pizarro D, Abergel A, Rouquette O, Beorchia S, Poincloux L . Computer-Aided Classification of Gastrointestinal Lesions in Regular Colonoscopy. IEEE Trans Med Imaging. 2016; 35(9):2051-2063. DOI: 10.1109/TMI.2016.2547947. View

5.
Wallace M, Sharma P, Bhandari P, East J, Antonelli G, Lorenzetti R . Impact of Artificial Intelligence on Miss Rate of Colorectal Neoplasia. Gastroenterology. 2022; 163(1):295-304.e5. DOI: 10.1053/j.gastro.2022.03.007. View