为立体立体立体立体立体立体立体立体立体立体立体立体立体改造机器人外科 (Neural Rendering for Stereo 3D Reconstruction of Deformable Tissues in Robotic Surgery)

Reconstruction of the soft tissues in robotic surgery from endoscopic stereo videos is important for many applications such as intra-operative navigation and image-guided robotic surgery automation. Previous works on this task mainly rely on SLAM-based approaches, which struggle to handle complex surgical scenes. Inspired by recent progress in neural rendering, we present a novel framework for deformable tissue reconstruction from binocular captures in robotic surgery under the single-viewpoint setting. Our framework adopts dynamic neural radiance fields to represent deformable surgical scenes in MLPs and optimize shapes and deformations in a learning-based manner. In addition to non-rigid deformations, tool occlusion and poor 3D clues from a single viewpoint are also particular challenges in soft tissue reconstruction. To overcome these difficulties, we present a series of strategies of tool mask-guided ray casting, stereo depth-cueing ray marching and stereo depth-supervised optimization. With experiments on DaVinci robotic surgery videos, our method significantly outperforms the current state-of-the-art reconstruction method for handling various complex non-rigid deformations. To our best knowledge, this is the first work leveraging neural rendering for surgical scene 3D reconstruction with remarkable potential demonstrated. Code is available at: https://github.com/med-air/EndoNeRF.

翻译：从内镜立体视频对机器人手术中的软组织进行重建,对于许多应用,例如操作内导航和图像制导机器人手术自动化等,非常重要。这项任务以前的工作主要依靠以SLAM为基础的方法,这些方法难以处理复杂的外科手术场景。在神经转化方面最近的进展的启发下,我们提出了一个新的框架,用于在单视点设置下对机器人手术中的双筒镜捕获进行可变组织重建。我们的框架采用了动态神经光亮场,以显示MLPs中可变形的外科手术场,并以基于学习的方式优化形状和变形。除了非硬化变形、工具隔离和单一观点差的三维线索之外,软组织重建中也存在特殊的挑战。为了克服这些困难,我们提出了一系列工具面具制导射线铸造、立体深度射线和立体深度超强优化的战略。通过DVinci机器人外科手术录像的实验,我们的方法大大超越了目前最先进的状态和最先进的重建方法。处理各种复杂、非硬体外科手术的内置技术改革工具,展示了RE-FMRFM 。展示了这一模型。

相关内容

三维重建

关注 1174

在计算机视觉中, 三维重建是指根据单视图或者多视图的图像重建三维信息的过程. 由于单视频的信息不完全,因此三维重建需要利用经验知识. 而多视图的三维重建(类似人的双目定位)相对比较容易, 其方法是先对摄像机进行标定, 即计算出摄像机的图象坐标系与世界坐标系的关系.然后利用多个二维图象中的信息重建出三维信息。物体三维重建是计算机辅助几何设计(CAGD)、计算机图形学(CG)、计算机动画、计算机视觉、医学图像处理、科学计算和虚拟现实、数字媒体创作等领域的共性科学问题和核心技术。在计算机内生成物体三维表示主要有两类方法。一类是使用几何建模软件通过人机交互生成人为控制下的物体三维几何模型,另一类是通过一定的手段获取真实物体的几何形状。前者实现技术已经十分成熟,现有若干软件支持,比如:3DMAX、Maya、AutoCAD、UG等等,它们一般使用具有数学表达式的曲线曲面表示几何形状。后者一般称为三维重建过程,三维重建是指利用二维投影恢复物体三维信息(形状等)的数学过程和计算机技术,包括数据获取、预处理、点云拼接和特征分析等步骤。

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日