Deep Learning-based Recognition of Key Anatomical Structures During Robot-assisted Minimally Invasive Esophagectomy
Overview
General Surgery
Radiology
Authors
Affiliations
Objective: To develop a deep learning algorithm for anatomy recognition in thoracoscopic video frames from robot-assisted minimally invasive esophagectomy (RAMIE) procedures using deep learning.
Background: RAMIE is a complex operation with substantial perioperative morbidity and a considerable learning curve. Automatic anatomy recognition may improve surgical orientation and recognition of anatomical structures and might contribute to reducing morbidity or learning curves. Studies regarding anatomy recognition in complex surgical procedures are currently lacking.
Methods: Eighty-three videos of consecutive RAMIE procedures between 2018 and 2022 were retrospectively collected at University Medical Center Utrecht. A surgical PhD candidate and an expert surgeon annotated the azygos vein and vena cava, aorta, and right lung on 1050 thoracoscopic frames. 850 frames were used for training of a convolutional neural network (CNN) to segment the anatomical structures. The remaining 200 frames of the dataset were used for testing the CNN. The Dice and 95% Hausdorff distance (95HD) were calculated to assess algorithm accuracy.
Results: The median Dice of the algorithm was 0.79 (IQR = 0.20) for segmentation of the azygos vein and/or vena cava. A median Dice coefficient of 0.74 (IQR = 0.86) and 0.89 (IQR = 0.30) were obtained for segmentation of the aorta and lung, respectively. Inference time was 0.026 s (39 Hz). The prediction of the deep learning algorithm was compared with the expert surgeon annotations, showing an accuracy measured in median Dice of 0.70 (IQR = 0.19), 0.88 (IQR = 0.07), and 0.90 (0.10) for the vena cava and/or azygos vein, aorta, and lung, respectively.
Conclusion: This study shows that deep learning-based semantic segmentation has potential for anatomy recognition in RAMIE video frames. The inference time of the algorithm facilitated real-time anatomy recognition. Clinical applicability should be assessed in prospective clinical studies.
Chen H, Gou L, Fang Z, Dou Q, Chen H, Chen C NPJ Digit Med. 2025; 8(1):9.
PMID: 39757250 PMC: 11701130. DOI: 10.1038/s41746-024-01372-6.
Nishiya Y, Matsuura K, Ogane T, Hayashi K, Kinebuchi Y, Tanaka H Laryngoscope Investig Otolaryngol. 2024; 9(6):e70049.
PMID: 39640517 PMC: 11618636. DOI: 10.1002/lio2.70049.
Artificial intelligence assisted operative anatomy recognition in endoscopic pituitary surgery.
Khan D, Valetopoulou A, Das A, Hanrahan J, Williams S, Bano S NPJ Digit Med. 2024; 7(1):314.
PMID: 39521895 PMC: 11550325. DOI: 10.1038/s41746-024-01273-8.
Deep Learning for Image Analysis in the Diagnosis and Management of Esophageal Cancer.
Theocharopoulos C, Davakis S, Ziogas D, Theocharopoulos A, Foteinou D, Mylonakis A Cancers (Basel). 2024; 16(19).
PMID: 39409906 PMC: 11475041. DOI: 10.3390/cancers16193285.
Sadeghi A, Mank Q, Tuzcu A, Hofman J, Siregar S, Maat A JTCVS Tech. 2024; 26:121-125.
PMID: 39156519 PMC: 11329169. DOI: 10.1016/j.xjtc.2024.04.011.