Vision-based segmentation of the robotic tool during robot-assisted surgery enables downstream applications, such as augmented reality feedback, while allowing for inaccuracies in robot kinematics. With the introduction of deep learning, many methods were presented to solve instrument segmentation directly and solely from images. While these approaches made remarkable progress on benchmark datasets, fundamental challenges pertaining to their robustness remain. We present CaRTS, a causality-driven robot tool segmentation algorithm, that is designed based on a complementary causal model of the robot tool segmentation task. Rather than directly inferring segmentation masks from observed images, CaRTS iteratively aligns tool models with image observations by updating the initially incorrect robot kinematic parameters through forward kinematics and differentiable rendering to optimize image feature similarity end-to-end. We benchmark CaRTS with competing techniques on both synthetic as well as real data from the dVRK, generated in precisely controlled scenarios to allow for counterfactual synthesis. On training-domain test data, CaRTS achieves a Dice score of 93.4 that is preserved well (Dice score of 91.8) when tested on counterfactually altered test data, exhibiting low brightness, smoke, blood, and altered background patterns. This compares favorably to Dice scores of 95.0 and 86.7, respectively, of the SOTA image-based method. Future work will involve accelerating CaRTS to achieve video framerate and estimating the impact occlusion has in practice. Despite these limitations, our results are promising: In addition to achieving high segmentation accuracy, CaRTS provides estimates of the true robot kinematics, which may benefit applications such as force estimation. Code is available at: https://github.com/hding2455/CaRTS
翻译:在机器人辅助外科手术期间,机器人工具基于视觉的分割使得下游应用能够进行,例如增加现实反馈,同时允许机器人运动学中的不准确性。随着深层学习的引入,提出了许多方法,直接和完全从图像中解决仪器分解问题。虽然这些方法在基准数据集方面取得了显著进展,但与其稳健性相关的基本挑战依然存在。我们介绍了基于机器人工具分解任务互补因因果关系驱动的机器人工具分解算法(CaRTS ),而不是直接从所观察到的图像中推断分解面,而使CaRTS 与图像观察的代谢性模型一致起来。CaRTS 通过前方运动和可变的图像分解法更新最初不正确的机器人运动参数,直接解决仪器分解,直接解决仪器分解问题。我们以合成和真实性数据基调和真实性数据为基准,这在精确的假设的基础上产生反真实性合成。在培训多功能测试数据中,CRTS 将获得93.4分级分数(D级分数为91.8分数 ) 。在测试时,在快速的缩缩缩算中将获得快速的缩略性数据,在测试中可以实现历史分算。