利用概念重建进行未经监督的单光深度估计 (Unsupervised Single-shot Depth Estimation using Perceptual Reconstruction) - 专知论文

会员服务 ·

0

估计/估计量 · Learning · Single-Shot · 无监督 · Extensibility ·

2022 年 6 月 8 日

Unsupervised Single-shot Depth Estimation using Perceptual Reconstruction

翻译：利用概念重建进行未经监督的单光深度估计

Christoph Angermann,Matthias Schwab,Markus Haltmeier,Christian Laubichler,Steinbjörn Jónsson

from arxiv, arXiv admin note: text overlap with arXiv:2103.16938

Real-time estimation of actual object depth is an essential module for various autonomous system tasks such as 3D reconstruction, scene understanding and condition assessment. During the last decade of machine learning, extensive deployment of deep learning methods to computer vision tasks has yielded approaches that succeed in achieving realistic depth synthesis out of a simple RGB modality. Most of these models are based on paired RGB-depth data and/or the availability of video sequences and stereo images. The lack of sequences, stereo data and RGB-depth pairs makes depth estimation a fully unsupervised single-image transfer problem that has barely been explored so far. This study builds on recent advances in the field of generative neural networks in order to establish fully unsupervised single-shot depth estimation. Two generators for RGB-to-depth and depth-to-RGB transfer are implemented and simultaneously optimized using the Wasserstein-1 distance, a novel perceptual reconstruction term and hand-crafted image filters. We comprehensively evaluate the models using industrial surface depth data as well as the Texas 3D Face Recognition Database, the CelebAMask-HQ database of human portraits and the SURREAL dataset that records body depth. For each evaluation dataset the proposed method shows a significant increase in depth accuracy compared to state-of-the-art single-image transfer methods.

翻译：对实际物体深度的实时估计是各种自主系统任务的基本模块,例如3D重建、现场理解和状况评估。在过去十年的机器学习期间,广泛运用深学习方法进行计算机愿景任务,产生了一些方法,成功地从简单的 RGB 模式中实现现实的深度合成。这些模型大多以配对的 RGB 深度数据和(或)视频序列和立体图像的提供为基础。由于缺乏序列、立体数据和RGB深度对配对,因此对完全不受监督的单一图像传输问题进行了深度估计,而迄今为止,这个问题还很少得到探讨。本研究以基因神经神经网络领域的最新进展为基础,以便建立完全不受监督的单发深度估计。RGB 深度和深度对RGB传输的两种生成器是使用瓦塞尔斯坦-1距离、新颖的视觉重建术语和手制图像过滤器进行实施和同时优化的。我们用工业表面深度数据以及德克萨斯 3D脸辨识数据库、CelebAMsk-HQ 进行全面评估,目的是建立基因神经网络领域的最新进展,以便建立完全不受监督的单一光线路透测测测测测的深度数据库,以显示每一项的每项数据系统,从而对比地测测测测测测测测测的每个数据方法。

0

相关内容

估计/估计量

估计/估计量

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

32+阅读 · 2021年6月12日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

基于氢键自组装的模块化荧光传感

国家自然科学基金

1+阅读 · 2014年12月31日

磁性稀土金属-有机骨架分子材料的动态调控及多功能化研究

国家自然科学基金

0+阅读 · 2014年12月31日

吡唑类配体笼状化合物的设计合成及其分子识别研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于天然产物Drimenal的新型杀菌剂分子设计、合成及构效关系研究

国家自然科学基金

0+阅读 · 2013年12月31日

新型抗生素Bagremycins生物合成基因簇的鉴定与解析

国家自然科学基金

0+阅读 · 2012年12月31日

基于稠环噻吩羧酸配体的金属-有机骨架化合物的设计合成、结构、性质、功能界面组装和化学传感器的研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于酪氨酸激酶和Wnt信号通路的多靶点抗癌药物发现

国家自然科学基金

0+阅读 · 2012年12月31日

稀土配合物修饰的多维篮子型多金属氧酸盐的设计合成及抗癌活性的研究

国家自然科学基金

0+阅读 · 2012年12月31日

Toll 样受体介导的巨噬细胞对prion清除的分子机制

国家自然科学基金

0+阅读 · 2009年12月31日

DeepFusion: Real-Time Dense 3D Reconstruction for Monocular SLAM using Single-View Depth and Gradient Predictions

Arxiv

0+阅读 · 2022年7月25日

Sparse-based Domain Adaptation Network for OCTA Image Super-Resolution Reconstruction

Arxiv

0+阅读 · 2022年7月25日

A Visual Navigation Perspective for Category-Level Object Pose Estimation

Arxiv

0+阅读 · 2022年7月23日

CHORE: Contact, Human and Object REconstruction from a single RGB image

CHORE: Contact, Human and Object REconstruction from a single RGB image

Arxiv

0+阅读 · 2022年7月22日

Visual Speech-Aware Perceptual 3D Facial Expression Reconstruction from Videos

Arxiv

0+阅读 · 2022年7月22日

NeurAR: Neural Uncertainty for Autonomous 3D Reconstruction

Arxiv

0+阅读 · 2022年7月22日

Unsupervised Knowledge-Transfer for Learned Image Reconstruction

Arxiv

0+阅读 · 2022年7月21日

Multi-Event-Camera Depth Estimation and Outlier Rejection by Refocused Events Fusion

Arxiv

0+阅读 · 2022年7月21日

IEOPF: An Active Contour Model for Image Segmentation with Inhomogeneities Estimated by Orthogonal Primary Functions

Arxiv

10+阅读 · 2018年1月20日

Pose-Normalized Image Generation for Person Re-identification

Arxiv

11+阅读 · 2018年1月18日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

32+阅读 · 2021年6月12日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《乌克兰无人机产业：志愿者与政策在构建新兴无人机产业中的协同作用》最新报告

《人工智能辅助决策中的数据可视化：系统性综述》

人工智能驱动弹药制造现代化：美国陆军转型之路

《敏捷作战部署中枢纽-辐条基地选址优化研究》80页

相关资讯

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

DeepFusion: Real-Time Dense 3D Reconstruction for Monocular SLAM using Single-View Depth and Gradient Predictions

Arxiv

0+阅读 · 2022年7月25日

Sparse-based Domain Adaptation Network for OCTA Image Super-Resolution Reconstruction

Arxiv

0+阅读 · 2022年7月25日

A Visual Navigation Perspective for Category-Level Object Pose Estimation

Arxiv

0+阅读 · 2022年7月23日

CHORE: Contact, Human and Object REconstruction from a single RGB image

CHORE: Contact, Human and Object REconstruction from a single RGB image

Arxiv

0+阅读 · 2022年7月22日

Visual Speech-Aware Perceptual 3D Facial Expression Reconstruction from Videos

Arxiv

0+阅读 · 2022年7月22日

NeurAR: Neural Uncertainty for Autonomous 3D Reconstruction

Arxiv

0+阅读 · 2022年7月22日

Unsupervised Knowledge-Transfer for Learned Image Reconstruction

Arxiv

0+阅读 · 2022年7月21日

Multi-Event-Camera Depth Estimation and Outlier Rejection by Refocused Events Fusion

Arxiv

0+阅读 · 2022年7月21日

IEOPF: An Active Contour Model for Image Segmentation with Inhomogeneities Estimated by Orthogonal Primary Functions

Arxiv

10+阅读 · 2018年1月20日

Pose-Normalized Image Generation for Person Re-identification

Arxiv

11+阅读 · 2018年1月18日

相关基金

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

基于氢键自组装的模块化荧光传感

国家自然科学基金

1+阅读 · 2014年12月31日

磁性稀土金属-有机骨架分子材料的动态调控及多功能化研究

国家自然科学基金

0+阅读 · 2014年12月31日

吡唑类配体笼状化合物的设计合成及其分子识别研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于天然产物Drimenal的新型杀菌剂分子设计、合成及构效关系研究

国家自然科学基金

0+阅读 · 2013年12月31日

新型抗生素Bagremycins生物合成基因簇的鉴定与解析

国家自然科学基金

0+阅读 · 2012年12月31日

基于稠环噻吩羧酸配体的金属-有机骨架化合物的设计合成、结构、性质、功能界面组装和化学传感器的研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于酪氨酸激酶和Wnt信号通路的多靶点抗癌药物发现

国家自然科学基金

0+阅读 · 2012年12月31日

稀土配合物修饰的多维篮子型多金属氧酸盐的设计合成及抗癌活性的研究

国家自然科学基金

0+阅读 · 2012年12月31日

Toll 样受体介导的巨噬细胞对prion清除的分子机制

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员