E-DSSSR: 高效动态外科现场重建,采用以变压器为基础的电流分层深度感知 (E-DSSR: Efficient Dynamic Surgical Scene Reconstruction with Transformer-based Stereoscopic Depth Perception) - 专知论文

会员服务 ·

0

估计/估计量 · 机器人 · INFORMS · 数据集 · FPS ·

2021 年 7 月 1 日

E-DSSR: Efficient Dynamic Surgical Scene Reconstruction with Transformer-based Stereoscopic Depth Perception

翻译：E-DSSSR: 高效动态外科现场重建,采用以变压器为基础的电流分层深度感知

Yonghao Long,Zhaoshuo Li,Chi Hang Yee,Chi Fai Ng,Russell H. Taylor,Mathias Unberath,Qi Dou

from arxiv, Accepted to MICCAI 2021

Reconstructing the scene of robotic surgery from the stereo endoscopic video is an important and promising topic in surgical data science, which potentially supports many applications such as surgical visual perception, robotic surgery education and intra-operative context awareness. However, current methods are mostly restricted to reconstructing static anatomy assuming no tissue deformation, tool occlusion and de-occlusion, and camera movement. However, these assumptions are not always satisfied in minimal invasive robotic surgeries. In this work, we present an efficient reconstruction pipeline for highly dynamic surgical scenes that runs at 28 fps. Specifically, we design a transformer-based stereoscopic depth perception for efficient depth estimation and a light-weight tool segmentor to handle tool occlusion. After that, a dynamic reconstruction algorithm which can estimate the tissue deformation and camera movement, and aggregate the information over time is proposed for surgical scene reconstruction. We evaluate the proposed pipeline on two datasets, the public Hamlyn Centre Endoscopic Video Dataset and our in-house DaVinci robotic surgery dataset. The results demonstrate that our method can recover the scene obstructed by the surgical tool and handle the movement of camera in realistic surgical scenarios effectively at real-time speed.

翻译：从立体内分层视频中重建机器人手术场景是外科数据科学中一个重要的、有希望的话题,它可能支持许多应用,例如外科直视、机器人外科教育和手术内环境意识,然而,目前的方法主要局限于重建静态解剖,假设没有组织畸形、工具封闭和隔离以及相机移动;然而,这些假设在最低入侵机器人手术中并不总是得到满足。在这项工作中,我们为28英尺高动态外科手术场景提供了一个高效重建管道。具体地说,我们设计了一个基于变压器的立体深度感,以便进行高效深度估测,并设计一个轻量工具分割处理工具封闭问题。之后,提出了动态的重建算法,可以估计组织畸形和相机移动情况,并汇总一段时间内的信息,用于手术现场重建。我们评价了两个数据集的拟议管道,即公共Hamlyn Enosco摄像视频数据集和我们内部的DaVinci机器人外科手术数据集。结果显示,我们的方法可以恢复被手术工具有效阻断的现场,并有效地处理现实的摄影机场景的移动情况。

0

相关内容

估计/估计量

估计/估计量

基于视觉的三维重建关键技术研究综述

基于视觉的三维重建关键技术研究综述

专知会员服务

166+阅读 · 2020年5月1日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

【ICCV 2019 Tutorial】Holistic 3D Reconstruction: Learning to Reconstruct Holistic 3D Structures from Sensorial Data（整体3D重建：学习从感官数据重建整体3D结构），宾夕法尼亚州立大学 Zihan Zhou，西蒙弗雷泽大学计算机科学系 Yasutaka Furukawa，UCB 马毅

【ICCV 2019 Tutorial】Holistic 3D Reconstruction: Learning to Reconstruct Holistic 3D Structures from Sensorial Data（整体3D重建：学习从感官数据重建整体3D结构），宾夕法尼亚州立大学 Zihan Zhou，西蒙弗雷泽大学计算机科学系 Yasutaka Furukawa，UCB 马毅

专知会员服务

29+阅读 · 2019年10月31日

Understanding Color and the In-Camera Image Processing Pipeline for Computer Vision 【Michael S. Brown IEEE】韩国 ICCV 2019

Understanding Color and the In-Camera Image Processing Pipeline for Computer Vision 【Michael S. Brown IEEE】韩国 ICCV 2019

专知会员服务

10+阅读 · 2019年10月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

ExBert — 可视化分析Transformer学到的表示

ExBert — 可视化分析Transformer学到的表示

专知会员服务

32+阅读 · 2019年10月16日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

知识图谱本体结构构建论文合集

知识图谱本体结构构建论文合集

专知会员服务

109+阅读 · 2019年10月9日

【泡泡汇总】CVPR2019 SLAM Paperlist

【泡泡汇总】CVPR2019 SLAM Paperlist

泡泡机器人SLAM

14+阅读 · 2019年6月12日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

CVPR2019 | Stereo R-CNN 3D 目标检测

CVPR2019 | Stereo R-CNN 3D 目标检测

极市平台

27+阅读 · 2019年3月10日

【泡泡一分钟】LIMO：激光和单目相机融合的视觉里程计

【泡泡一分钟】LIMO：激光和单目相机融合的视觉里程计

泡泡机器人SLAM

12+阅读 · 2019年1月16日

TCN v2 + 3Dconv 运动信息

TCN v2 + 3Dconv 运动信息

CreateAMind

4+阅读 · 2019年1月8日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

【泡泡一分钟】基于角点的多焦距全光相机几何标定(ICCV2017-100)

【泡泡一分钟】基于角点的多焦距全光相机几何标定(ICCV2017-100)

泡泡机器人SLAM

4+阅读 · 2018年9月25日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【泡泡一分钟】结合地图变形信息的惯性RGBD-SLAM系统（IROS-9）

【泡泡一分钟】结合地图变形信息的惯性RGBD-SLAM系统（IROS-9）

泡泡机器人SLAM

9+阅读 · 2018年3月20日

【学习】CVPR 2017 Tutorial：如何从图像来构建3D模型

【学习】CVPR 2017 Tutorial：如何从图像来构建3D模型

机器学习研究会

6+阅读 · 2017年8月8日

Technology Report : Robotic Localization and Navigation System for Visible Light Positioning and SLAM

Arxiv

0+阅读 · 2021年9月3日

3D Reconstruction of Novel Object Shapes from Single Images

Arxiv

0+阅读 · 2021年9月1日

Total3DUnderstanding: Joint Layout, Object Pose and Mesh Reconstruction for Indoor Scenes from a Single Image

Total3DUnderstanding: Joint Layout, Object Pose and Mesh Reconstruction for Indoor Scenes from a Single Image

Arxiv

12+阅读 · 2020年2月27日

Improving CNN-based Planar Object Detection with Geometric Prior Knowledge

Improving CNN-based Planar Object Detection with Geometric Prior Knowledge

Arxiv

6+阅读 · 2019年9月23日

Sparse2Dense: From direct sparse odometry to dense 3D reconstruction

Sparse2Dense: From direct sparse odometry to dense 3D reconstruction

Arxiv

9+阅读 · 2019年3月21日

3D Hand Shape and Pose Estimation from a Single RGB Image

3D Hand Shape and Pose Estimation from a Single RGB Image

Arxiv

17+阅读 · 2019年3月3日

Geometry-Based Multiple Camera Head Detection in Dense Crowds

Geometry-Based Multiple Camera Head Detection in Dense Crowds

Arxiv

3+阅读 · 2018年8月2日

Road surface 3d reconstruction based on dense subpixel disparity map estimation

Arxiv

3+阅读 · 2018年7月5日

Complex-YOLO: Real-time 3D Object Detection on Point Clouds

Arxiv

3+阅读 · 2018年3月16日

MR image reconstruction using deep density priors

Arxiv

5+阅读 · 2018年1月17日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

基于视觉的三维重建关键技术研究综述

基于视觉的三维重建关键技术研究综述

专知会员服务

166+阅读 · 2020年5月1日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

【ICCV 2019 Tutorial】Holistic 3D Reconstruction: Learning to Reconstruct Holistic 3D Structures from Sensorial Data（整体3D重建：学习从感官数据重建整体3D结构），宾夕法尼亚州立大学 Zihan Zhou，西蒙弗雷泽大学计算机科学系 Yasutaka Furukawa，UCB 马毅

【ICCV 2019 Tutorial】Holistic 3D Reconstruction: Learning to Reconstruct Holistic 3D Structures from Sensorial Data（整体3D重建：学习从感官数据重建整体3D结构），宾夕法尼亚州立大学 Zihan Zhou，西蒙弗雷泽大学计算机科学系 Yasutaka Furukawa，UCB 马毅

专知会员服务

29+阅读 · 2019年10月31日

Understanding Color and the In-Camera Image Processing Pipeline for Computer Vision 【Michael S. Brown IEEE】韩国 ICCV 2019

Understanding Color and the In-Camera Image Processing Pipeline for Computer Vision 【Michael S. Brown IEEE】韩国 ICCV 2019

专知会员服务

10+阅读 · 2019年10月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

ExBert — 可视化分析Transformer学到的表示

ExBert — 可视化分析Transformer学到的表示

专知会员服务

32+阅读 · 2019年10月16日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

知识图谱本体结构构建论文合集

知识图谱本体结构构建论文合集

专知会员服务

109+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《算法战争研究计划全景评估》35页

《分层多智能体系统分类：设计范式、协调机制与工业应用》最新28页

智能体战争：自主人工智能军备竞赛全景透视

《太空对抗中未知追踪者目标下的规避策略研究》122页

相关资讯

【泡泡汇总】CVPR2019 SLAM Paperlist

【泡泡汇总】CVPR2019 SLAM Paperlist

泡泡机器人SLAM

14+阅读 · 2019年6月12日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

CVPR2019 | Stereo R-CNN 3D 目标检测

CVPR2019 | Stereo R-CNN 3D 目标检测

极市平台

27+阅读 · 2019年3月10日

【泡泡一分钟】LIMO：激光和单目相机融合的视觉里程计

【泡泡一分钟】LIMO：激光和单目相机融合的视觉里程计

泡泡机器人SLAM

12+阅读 · 2019年1月16日

TCN v2 + 3Dconv 运动信息

TCN v2 + 3Dconv 运动信息

CreateAMind

4+阅读 · 2019年1月8日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

【泡泡一分钟】基于角点的多焦距全光相机几何标定(ICCV2017-100)

【泡泡一分钟】基于角点的多焦距全光相机几何标定(ICCV2017-100)

泡泡机器人SLAM

4+阅读 · 2018年9月25日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【泡泡一分钟】结合地图变形信息的惯性RGBD-SLAM系统（IROS-9）

【泡泡一分钟】结合地图变形信息的惯性RGBD-SLAM系统（IROS-9）

泡泡机器人SLAM

9+阅读 · 2018年3月20日

【学习】CVPR 2017 Tutorial：如何从图像来构建3D模型

【学习】CVPR 2017 Tutorial：如何从图像来构建3D模型

机器学习研究会

6+阅读 · 2017年8月8日

相关论文

Technology Report : Robotic Localization and Navigation System for Visible Light Positioning and SLAM

Arxiv

0+阅读 · 2021年9月3日

3D Reconstruction of Novel Object Shapes from Single Images

Arxiv

0+阅读 · 2021年9月1日

Total3DUnderstanding: Joint Layout, Object Pose and Mesh Reconstruction for Indoor Scenes from a Single Image

Total3DUnderstanding: Joint Layout, Object Pose and Mesh Reconstruction for Indoor Scenes from a Single Image

Arxiv

12+阅读 · 2020年2月27日

Improving CNN-based Planar Object Detection with Geometric Prior Knowledge

Improving CNN-based Planar Object Detection with Geometric Prior Knowledge

Arxiv

6+阅读 · 2019年9月23日

Sparse2Dense: From direct sparse odometry to dense 3D reconstruction

Sparse2Dense: From direct sparse odometry to dense 3D reconstruction

Arxiv

9+阅读 · 2019年3月21日

3D Hand Shape and Pose Estimation from a Single RGB Image

3D Hand Shape and Pose Estimation from a Single RGB Image

Arxiv

17+阅读 · 2019年3月3日

Geometry-Based Multiple Camera Head Detection in Dense Crowds

Geometry-Based Multiple Camera Head Detection in Dense Crowds

Arxiv

3+阅读 · 2018年8月2日

Road surface 3d reconstruction based on dense subpixel disparity map estimation

Arxiv

3+阅读 · 2018年7月5日

Complex-YOLO: Real-time 3D Object Detection on Point Clouds

Arxiv

3+阅读 · 2018年3月16日

MR image reconstruction using deep density priors

Arxiv

5+阅读 · 2018年1月17日

微信扫码咨询专知VIP会员