» Articles » PMID: 39994137

SfMDiffusion: Self-supervised Monocular Depth Estimation in Endoscopy Based on Diffusion Models

Overview
Publisher Springer
Date 2025 Feb 24
PMID 39994137
Authors
Affiliations
Soon will be listed here.
Abstract

Purpose: In laparoscopic surgery, accurate 3D reconstruction from endoscopic video is crucial for effective image-guided techniques. Current methods for monocular depth estimation (MDE) face challenges in complex surgical scenes, including limited training data, specular reflections, and varying illumination conditions.

Methods: We propose SfMDiffusion, a novel diffusion-based self-supervised framework for MDE. Our approach combines: (1) a denoising diffusion process guided by pseudo-ground-truth depth maps, (2) knowledge distillation from a pre-trained teacher model, and (3) discriminative priors to enhance estimation robustness. Our design enables accurate depth estimation without requiring ground-truth depth data during training.

Results: Experiments on the SCARED and Hamlyn datasets demonstrate that SfMDiffusion achieves superior performance: an Absolute relative error (Abs Rel) of 0.049, a Squared relative error (Sq Rel) of 0.366, and a Root Mean Square Error (RMSE) of 4.305 on SCARED dataset, and Abs Rel of 0.067, Sq Rel of 0.800, and RMSE of 7.465 on Hamlyn dataset.

Conclusion: SfMDiffusion provides an innovative approach for 3D reconstruction in image-guided surgical techniques. Future work will focus on computational optimization and validation across diverse surgical scenarios. Our code is available at https://github.com/Skylanding/SfM-Diffusion .

References
1.
Zhang P, Luo H, Zhu W, Yang J, Zeng N, Fan Y . Real-time navigation for laparoscopic hepatectomy using image fusion of preoperative 3D surgical plan and intraoperative indocyanine green fluorescence imaging. Surg Endosc. 2019; 34(8):3449-3459. DOI: 10.1007/s00464-019-07121-1. View

2.
Masoumian A, Rashwan H, Cristiano J, Asif M, Puig D . Monocular Depth Estimation Using Deep Learning: A Review. Sensors (Basel). 2022; 22(14). PMC: 9325018. DOI: 10.3390/s22145353. View

3.
Ozyoruk K, Gokceler G, Bobrow T, Coskun G, Incetan K, Almalioglu Y . EndoSLAM dataset and an unsupervised monocular visual odometry and depth estimation approach for endoscopic videos. Med Image Anal. 2021; 71:102058. DOI: 10.1016/j.media.2021.102058. View

4.
Shao S, Pei Z, Chen W, Zhu W, Wu X, Sun D . Self-Supervised monocular depth and ego-Motion estimation in endoscopy: Appearance flow to the rescue. Med Image Anal. 2022; 77:102338. DOI: 10.1016/j.media.2021.102338. View