Capsule Robot Pose and Mechanism State Detection in Ultrasound Using Attention-based Hierarchical Deep Learning
Authors
Affiliations
Ingestible robotic capsules with locomotion capabilities and on-board sampling mechanism have great potential for non-invasive diagnostic and interventional use in the gastrointestinal tract. Real-time tracking of capsule location and operational state is necessary for clinical application, yet remains a significant challenge. To this end, we propose an approach that can simultaneously determine the mechanism state and in-plane 2D pose of millimeter capsule robots in an anatomically representative environment using ultrasound imaging. Our work proposes an attention-based hierarchical deep learning approach and adapts the success of transfer learning towards solving the multi-task tracking problem with limited dataset. To train the neural networks, we generate a representative dataset of a robotic capsule within ex-vivo porcine stomachs. Experimental results show that the accuracy of capsule state classification is 97%, and the mean estimation errors for orientation and centroid position are 2.0 degrees and 0.24 mm (1.7% of the capsule's body length) on the hold-out test set. Accurate detection of the capsule while manipulated by an external magnet in a porcine stomach and colon is also demonstrated. The results suggest our proposed method has the potential for advancing the wireless capsule-based technologies by providing accurate detection of capsule robots in clinical scenarios.
Zheng Y, Cai Y, Yan Y, Chen S, Gong K JMIR Hum Factors. 2024; 11:e57670.
PMID: 39146009 PMC: 11362707. DOI: 10.2196/57670.
Sun Y, Zhang W, Gu J, Xia L, Cao Y, Zhu X Nat Commun. 2024; 15(1):1839.
PMID: 38424039 PMC: 10904804. DOI: 10.1038/s41467-024-46046-9.