FADIV-Syn: 使用软面罩和隐性混合法快速深度独立视图合成 (FaDIV-Syn: Fast Depth-Independent View Synthesis using Soft Masks and Implicit Blending) - 专知论文

会员服务 ·

0

估计/估计量 · SOFT · Performer · FAST · Networking ·

2022 年 5 月 13 日

FaDIV-Syn: Fast Depth-Independent View Synthesis using Soft Masks and Implicit Blending

翻译：FADIV-Syn: 使用软面罩和隐性混合法快速深度独立视图合成

Andre Rochow,Max Schwarz,Michael Weinmann,Sven Behnke

from arxiv, Accepted to Robotics: Science and Systems (RSS) 2022

Novel view synthesis is required in many robotic applications, such as VR teleoperation and scene reconstruction. Existing methods are often too slow for these contexts, cannot handle dynamic scenes, and are limited by their explicit depth estimation stage, where incorrect depth predictions can lead to large projection errors. Our proposed method runs in real time on live streaming data and avoids explicit depth estimation by efficiently warping input images into the target frame for a range of assumed depth planes. The resulting plane sweep volume (PSV) is directly fed into our network, which first estimates soft PSV masks in a self-supervised manner, and then directly produces the novel output view. This improves efficiency and performance on transparent, reflective, thin, and feature-less scene parts. FaDIV-Syn can perform both interpolation and extrapolation tasks at 540p in real-time and outperforms state-of-the-art extrapolation methods on the large-scale RealEstate10k dataset. We thoroughly evaluate ablations, such as removing the Soft-Masking network, training from fewer examples as well as generalization to higher resolutions and stronger depth discretization. Our implementation is available.

翻译：许多机器人应用程序(如VR远程操作和场景重建)都需要进行新视角合成。对于这些背景而言,现有方法往往过于缓慢,无法处理动态场景,并且受到其清晰深度估计阶段的限制,不正确的深度预测可能导致大预测错误。我们提议的方法实时运行在实时流数据上,避免通过将输入图像有效转换到一系列假设深度平面的目标框架来进行明确的深度估计。由此产生的飞机扫荡量(PSV)直接输入我们的网络,它首先以自我监督的方式估算软 PSV 遮罩,然后直接生成新的输出视图。这提高了透明、反射、薄和无特征的场景部分的效率和性。 FIDIV-Syn可以在实时540p执行内部的内插图和外插任务,并超越了在大规模RealEstate10k数据集上的最新外插法。我们彻底评估了布局,例如删除软式磁盘网络,从较少的例子中培训,从一般分辨率到更深入的深度。

0

相关内容

估计/估计量

估计/估计量

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

专知

19+阅读 · 2018年5月31日

Triptolide诱导c-FLIP选择性剪切在调控TRAIL耐药胰腺癌细胞凋亡中的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

肿瘤转移抑制因子CD82细胞外小环结构域抑制肿瘤细胞迁移机制的研究

国家自然科学基金

0+阅读 · 2014年12月31日

TRIM24调控mRNA可变剪切促进三阴型乳腺癌远端转移的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于介尺度动态结构的气固EMMS模型扩展

国家自然科学基金

0+阅读 · 2013年12月31日

大肠杆菌K1外膜蛋白A特异结构在其导致新生儿细菌性脑膜炎中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

人星状病毒非结构蛋白nsP1a C末端结构域致宿主细胞凋亡的机制

国家自然科学基金

0+阅读 · 2012年12月31日

前列腺癌中Nedd4L对TrkA的抑癌性泛素化研究

国家自然科学基金

0+阅读 · 2012年12月31日

核仁蛋白Bmsl1在斑马鱼肝脏早期发育过程中的功能研究

国家自然科学基金

0+阅读 · 2011年12月31日

动脉粥样硬化中PPARγ19978;调c-Ski的机制及作用研究

国家自然科学基金

0+阅读 · 2010年12月31日

准一致熔PMN-PT基赝三元系高Trt弛豫铁电材料的MPB组分设计与强制对流条件下的晶体生长

国家自然科学基金

0+阅读 · 2009年12月31日

Can Language Understand Depth?

Arxiv

0+阅读 · 2022年7月3日

Comparative Synthesis: Learning Near-Optimal Network Designs by Query

Arxiv

0+阅读 · 2022年7月2日

Weakly-supervised High-fidelity Ultrasound Video Synthesis with Feature Decoupling

Arxiv

0+阅读 · 2022年7月1日

Towards Two-view 6D Object Pose Estimation: A Comparative Study on Fusion Strategy

Arxiv

0+阅读 · 2022年7月1日

Deep Motion Network for Freehand 3D Ultrasound Reconstruction

Arxiv

1+阅读 · 2022年7月1日

MultiViz: An Analysis Benchmark for Visualizing and Understanding Multimodal Models

Arxiv

1+阅读 · 2022年6月30日

HRFuser: A Multi-resolution Sensor Fusion Architecture for 2D Object Detection

Arxiv

0+阅读 · 2022年6月30日

Generative Models as a Data Source for Multiview Representation Learning

Arxiv

16+阅读 · 2021年6月9日

DOTA: A Large-scale Dataset for Object Detection in Aerial Images

Arxiv

19+阅读 · 2018年1月27日

Conditional Random Field and Deep Feature Learning for Hyperspectral Image Segmentation

Arxiv

11+阅读 · 2017年12月27日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

GPT-5如何对齐？从硬性拒绝到安全完成：走向以输出为中心的安全训练

【伯克利博士论文】超越人类监督的视觉智能

【ICCV2025】SO(3) 上连续非保守动力系统的预测

2025年中国数据要素行业发展研究报告

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

专知

19+阅读 · 2018年5月31日

相关论文

Can Language Understand Depth?

Arxiv

0+阅读 · 2022年7月3日

Comparative Synthesis: Learning Near-Optimal Network Designs by Query

Arxiv

0+阅读 · 2022年7月2日

Weakly-supervised High-fidelity Ultrasound Video Synthesis with Feature Decoupling

Arxiv

0+阅读 · 2022年7月1日

Towards Two-view 6D Object Pose Estimation: A Comparative Study on Fusion Strategy

Arxiv

0+阅读 · 2022年7月1日

Deep Motion Network for Freehand 3D Ultrasound Reconstruction

Arxiv

1+阅读 · 2022年7月1日

MultiViz: An Analysis Benchmark for Visualizing and Understanding Multimodal Models

Arxiv

1+阅读 · 2022年6月30日

HRFuser: A Multi-resolution Sensor Fusion Architecture for 2D Object Detection

Arxiv

0+阅读 · 2022年6月30日

Generative Models as a Data Source for Multiview Representation Learning

Arxiv

16+阅读 · 2021年6月9日

DOTA: A Large-scale Dataset for Object Detection in Aerial Images

Arxiv

19+阅读 · 2018年1月27日

Conditional Random Field and Deep Feature Learning for Hyperspectral Image Segmentation

Arxiv

11+阅读 · 2017年12月27日

相关基金

Triptolide诱导c-FLIP选择性剪切在调控TRAIL耐药胰腺癌细胞凋亡中的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

肿瘤转移抑制因子CD82细胞外小环结构域抑制肿瘤细胞迁移机制的研究

国家自然科学基金

0+阅读 · 2014年12月31日

TRIM24调控mRNA可变剪切促进三阴型乳腺癌远端转移的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于介尺度动态结构的气固EMMS模型扩展

国家自然科学基金

0+阅读 · 2013年12月31日

大肠杆菌K1外膜蛋白A特异结构在其导致新生儿细菌性脑膜炎中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

人星状病毒非结构蛋白nsP1a C末端结构域致宿主细胞凋亡的机制

国家自然科学基金

0+阅读 · 2012年12月31日

前列腺癌中Nedd4L对TrkA的抑癌性泛素化研究

国家自然科学基金

0+阅读 · 2012年12月31日

核仁蛋白Bmsl1在斑马鱼肝脏早期发育过程中的功能研究

国家自然科学基金

0+阅读 · 2011年12月31日

动脉粥样硬化中PPARγ19978;调c-Ski的机制及作用研究

国家自然科学基金

0+阅读 · 2010年12月31日

准一致熔PMN-PT基赝三元系高Trt弛豫铁电材料的MPB组分设计与强制对流条件下的晶体生长

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员