用于浏览动态场景合成的深 3D 遮罩卷 (Deep 3D Mask Volume for View Synthesis of Dynamic Scenes) - 专知论文

会员服务 ·

0

掩码 · 3D · 可辨认的 · INTERACT · Better ·

2021 年 8 月 30 日

Deep 3D Mask Volume for View Synthesis of Dynamic Scenes

翻译：用于浏览动态场景合成的深 3D 遮罩卷

Kai-En Lin,Lei Xiao,Feng Liu,Guowei Yang,Ravi Ramamoorthi

from arxiv, Published at ICCV 2021. Code and dataset available at: https://cseweb.ucsd.edu//~viscomp/projects/ICCV21Deep/

Image view synthesis has seen great success in reconstructing photorealistic visuals, thanks to deep learning and various novel representations. The next key step in immersive virtual experiences is view synthesis of dynamic scenes. However, several challenges exist due to the lack of high-quality training datasets, and the additional time dimension for videos of dynamic scenes. To address this issue, we introduce a multi-view video dataset, captured with a custom 10-camera rig in 120FPS. The dataset contains 96 high-quality scenes showing various visual effects and human interactions in outdoor scenes. We develop a new algorithm, Deep 3D Mask Volume, which enables temporally-stable view extrapolation from binocular videos of dynamic scenes, captured by static cameras. Our algorithm addresses the temporal inconsistency of disocclusions by identifying the error-prone areas with a 3D mask volume, and replaces them with static background observed throughout the video. Our method enables manipulation in 3D space as opposed to simple 2D masks, We demonstrate better temporal stability than frame-by-frame static view synthesis methods, or those that use 2D masks. The resulting view synthesis videos show minimal flickering artifacts and allow for larger translational movements.

翻译：由于深层次的学习和各种新表现形式,图像合成在重建摄影现实视觉方面取得了巨大成功。在沉浸的虚拟经验的下一个关键步骤是观察动态场景的合成。然而,由于缺乏高质量的培训数据集,以及动态场景视频的额外时间层面,存在若干挑战。为了解决这个问题,我们引入了一个多视图视频数据集,以120FPS中的定制10摄像机设备拍摄。该数据集包含96个高品质的场景,显示各种视觉效应和户外场景中的人类互动。我们开发了一个新的算法,即深 3D遮罩卷,它使得能够从静态摄影机拍摄的动态场景的双向视频中进行时间性外推。我们的算法通过用3D面罩卷识别易出错区,用在整个视频中观测到的静态背景来取代这些错位区。我们的方法使得3D空间的操控比简单的2D面罩更能显示时间性更稳定,或者那些使用 2D 掩码的静态合成方法。由此产生的合成视频显示最小的闪动和允许更大的翻动。

0

相关内容

CVPR2020 | 商汤-港中文等提出PV-RCNN：3D目标检测新网络

CVPR2020 | 商汤-港中文等提出PV-RCNN：3D目标检测新网络

专知会员服务

45+阅读 · 2020年4月17日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

【新书】深度学习自然语言处理，Deep Learning for Natural Language Processing

【新书】深度学习自然语言处理，Deep Learning for Natural Language Processing

专知会员服务

67+阅读 · 2019年12月27日

【英国帝国理工学院】心脏图像分割的深度学习:综述，47页pdf，Deep learning for cardiac image segmentation: A review

【英国帝国理工学院】心脏图像分割的深度学习:综述，47页pdf，Deep learning for cardiac image segmentation: A review

专知会员服务

56+阅读 · 2019年11月14日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

ICRA 2019 论文速览 | 基于Deep Learning 的SLAM

ICRA 2019 论文速览 | 基于Deep Learning 的SLAM

计算机视觉life

41+阅读 · 2019年7月22日

CVPR 2019 | 34篇 CVPR 2019 论文实现代码

CVPR 2019 | 34篇 CVPR 2019 论文实现代码

AI科技评论

21+阅读 · 2019年6月23日

CVPR 2019 | 重磅！34篇 CVPR2019 论文实现代码

CVPR 2019 | 重磅！34篇 CVPR2019 论文实现代码

AI研习社

11+阅读 · 2019年6月21日

【泡泡汇总】CVPR2019 SLAM Paperlist

【泡泡汇总】CVPR2019 SLAM Paperlist

泡泡机器人SLAM

14+阅读 · 2019年6月12日

CVPR2019 | 03-27日更新12篇论文及代码汇总（多目标跟踪、3D目标检测、分割等）

CVPR2019 | 03-27日更新12篇论文及代码汇总（多目标跟踪、3D目标检测、分割等）

极市平台

55+阅读 · 2019年3月27日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

视频超分辨 Detail-revealing Deep Video Super-resolution 论文笔记

视频超分辨 Detail-revealing Deep Video Super-resolution 论文笔记

统计学习与视觉计算组

17+阅读 · 2018年3月16日

ICCV17 :12为顶级大牛教你学生成对抗网络（GAN)！

ICCV17 :12为顶级大牛教你学生成对抗网络（GAN)！

全球人工智能

8+阅读 · 2017年11月26日

Kimera: from SLAM to Spatial Perception with 3D Dynamic Scene Graphs

Arxiv

0+阅读 · 2021年10月20日

NeuralDiff: Segmenting 3D objects that move in egocentric videos

Arxiv

0+阅读 · 2021年10月19日

Data-Driven 3D Reconstruction of Dressed Humans From Sparse Views

Arxiv

0+阅读 · 2021年10月19日

Generating Novel Scene Compositions from Single Images and Videos

Arxiv

0+阅读 · 2021年10月19日

NeRS: Neural Reflectance Surfaces for Sparse-view 3D Reconstruction in the Wild

Arxiv

0+阅读 · 2021年10月18日

4D Human Body Capture from Egocentric Video via 3D Scene Grounding

Arxiv

0+阅读 · 2021年10月15日

Video Autoencoder: self-supervised disentanglement of static 3D structure and motion

Arxiv

5+阅读 · 2021年10月6日

3D Object Detection for Autonomous Driving: A Survey

Arxiv

12+阅读 · 2021年6月21日

MultiBodySync: Multi-Body Segmentation and Motion Estimation via 3D Scan Synchronization

Arxiv

4+阅读 · 2021年1月17日

Video-to-Video Synthesis

Video-to-Video Synthesis

Arxiv

9+阅读 · 2018年8月20日

VIP会员

文章信息

相关主题

相关VIP内容

CVPR2020 | 商汤-港中文等提出PV-RCNN：3D目标检测新网络

CVPR2020 | 商汤-港中文等提出PV-RCNN：3D目标检测新网络

专知会员服务

45+阅读 · 2020年4月17日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

【新书】深度学习自然语言处理，Deep Learning for Natural Language Processing

【新书】深度学习自然语言处理，Deep Learning for Natural Language Processing

专知会员服务

67+阅读 · 2019年12月27日

【英国帝国理工学院】心脏图像分割的深度学习:综述，47页pdf，Deep learning for cardiac image segmentation: A review

【英国帝国理工学院】心脏图像分割的深度学习:综述，47页pdf，Deep learning for cardiac image segmentation: A review

专知会员服务

56+阅读 · 2019年11月14日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

操作系统智能体：基于多模态大模型（MLLM）的通用计算设备智能体综述

《美国太空军系统全生命周期建模、仿真与分析效能提升方案》最新84页报告

【博士论文】推进数据高效的深度学习：非参数 Transformer、主动测试与上下文学习

自主人工智能：未来战争是否将是自主化的？

相关资讯

ICRA 2019 论文速览 | 基于Deep Learning 的SLAM

ICRA 2019 论文速览 | 基于Deep Learning 的SLAM

计算机视觉life

41+阅读 · 2019年7月22日

CVPR 2019 | 34篇 CVPR 2019 论文实现代码

CVPR 2019 | 34篇 CVPR 2019 论文实现代码

AI科技评论

21+阅读 · 2019年6月23日

CVPR 2019 | 重磅！34篇 CVPR2019 论文实现代码

CVPR 2019 | 重磅！34篇 CVPR2019 论文实现代码

AI研习社

11+阅读 · 2019年6月21日

【泡泡汇总】CVPR2019 SLAM Paperlist

【泡泡汇总】CVPR2019 SLAM Paperlist

泡泡机器人SLAM

14+阅读 · 2019年6月12日

CVPR2019 | 03-27日更新12篇论文及代码汇总（多目标跟踪、3D目标检测、分割等）

CVPR2019 | 03-27日更新12篇论文及代码汇总（多目标跟踪、3D目标检测、分割等）

极市平台

55+阅读 · 2019年3月27日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

视频超分辨 Detail-revealing Deep Video Super-resolution 论文笔记

视频超分辨 Detail-revealing Deep Video Super-resolution 论文笔记

统计学习与视觉计算组

17+阅读 · 2018年3月16日

ICCV17 :12为顶级大牛教你学生成对抗网络（GAN)！

ICCV17 :12为顶级大牛教你学生成对抗网络（GAN)！

全球人工智能

8+阅读 · 2017年11月26日

相关论文

Kimera: from SLAM to Spatial Perception with 3D Dynamic Scene Graphs

Arxiv

0+阅读 · 2021年10月20日

NeuralDiff: Segmenting 3D objects that move in egocentric videos

Arxiv

0+阅读 · 2021年10月19日

Data-Driven 3D Reconstruction of Dressed Humans From Sparse Views

Arxiv

0+阅读 · 2021年10月19日

Generating Novel Scene Compositions from Single Images and Videos

Arxiv

0+阅读 · 2021年10月19日

NeRS: Neural Reflectance Surfaces for Sparse-view 3D Reconstruction in the Wild

Arxiv

0+阅读 · 2021年10月18日

4D Human Body Capture from Egocentric Video via 3D Scene Grounding

Arxiv

0+阅读 · 2021年10月15日

Video Autoencoder: self-supervised disentanglement of static 3D structure and motion

Arxiv

5+阅读 · 2021年10月6日

3D Object Detection for Autonomous Driving: A Survey

Arxiv

12+阅读 · 2021年6月21日

MultiBodySync: Multi-Body Segmentation and Motion Estimation via 3D Scan Synchronization

Arxiv

4+阅读 · 2021年1月17日

Video-to-Video Synthesis

Video-to-Video Synthesis

Arxiv

9+阅读 · 2018年8月20日

微信扫码咨询专知VIP会员