自我监督的正交平面深度估计：PlaneDepth (PlaneDepth: Self-supervised Depth Estimation via Orthogonal Planes) - 专知论文

会员服务 ·

0

正交 · 估计/估计量 · 表示 · Extensibility · HTTPS ·

2023 年 3 月 23 日

PlaneDepth: Self-supervised Depth Estimation via Orthogonal Planes

翻译：自我监督的正交平面深度估计：PlaneDepth

Ruoyu Wang,Zehao Yu,Shenghua Gao

from arxiv, Accepted by CVPR 2023. Code and models are available at: https://github.com/svip-lab/PlaneDepth

Multiple near frontal-parallel planes based depth representation demonstrated impressive results in self-supervised monocular depth estimation (MDE). Whereas, such a representation would cause the discontinuity of the ground as it is perpendicular to the frontal-parallel planes, which is detrimental to the identification of drivable space in autonomous driving. In this paper, we propose the PlaneDepth, a novel orthogonal planes based presentation, including vertical planes and ground planes. PlaneDepth estimates the depth distribution using a Laplacian Mixture Model based on orthogonal planes for an input image. These planes are used to synthesize a reference view to provide the self-supervision signal. Further, we find that the widely used resizing and cropping data augmentation breaks the orthogonality assumptions, leading to inferior plane predictions. We address this problem by explicitly constructing the resizing cropping transformation to rectify the predefined planes and predicted camera pose. Moreover, we propose an augmented self-distillation loss supervised with a bilateral occlusion mask to boost the robustness of orthogonal planes representation for occlusions. Thanks to our orthogonal planes representation, we can extract the ground plane in an unsupervised manner, which is important for autonomous driving. Extensive experiments on the KITTI dataset demonstrate the effectiveness and efficiency of our method. The code is available at https://github.com/svip-lab/PlaneDepth.

翻译：多面非正交的深度表示方式在自我监督的单目深度估计（MDE）中都取得了令人印象深刻的结果。然而，这种表示方式会导致地面的不连续性，因为它垂直于前视平面，这对于自主驾驶的行驶空间识别是有害的。在本文中，我们提出了PlaneDepth，一种基于正交平面的新型表示方法，包括垂直平面和地面平面。PlaneDepth使用拉普拉斯混合模型估计正交平面上的深度分布，从而为输入图像提供自我监督信号。此外，我们发现，广泛使用的缩放和裁剪数据增强会破坏正交性假设，导致较差的平面预测结果。我们通过明确构造缩放裁剪变换，以校正预定义的平面和预测的相机姿态，来解决这个问题。此外，我们提出了增强型自蒸馏损失，用双边遮挡掩蔽进行监督，以提高正交平面表示的抗干扰性。由于我们的正交平面表示方法，我们可以以无监督的方式提取地面平面，这对于自主驾驶非常重要。在KITTI数据集上进行的大量实验表明了我们方法的有效性和高效性。代码可在https://github.com/svip-lab/PlaneDepth获取。

0

相关内容

【CVPR 2022】基于实例深度估计的统一深度感知全景分割 PanopticDepth: Per-Instance Depth Estimation for Unified Depth-Aware Panoptic Segmentation

【CVPR 2022】基于实例深度估计的统一深度感知全景分割 PanopticDepth: Per-Instance Depth Estimation for Unified Depth-Aware Panoptic Segmentation

专知会员服务

18+阅读 · 2022年3月19日

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

专知会员服务

13+阅读 · 2022年3月12日

【CVPR 2022】一种无需使用负样本的自监督学习方法，Self-Supervised Predictive Learning: A Negative-Free Method for Sound Source Localization in Visual Scenes

【CVPR 2022】一种无需使用负样本的自监督学习方法，Self-Supervised Predictive Learning: A Negative-Free Method for Sound Source Localization in Visual Scenes

专知会员服务

15+阅读 · 2022年3月12日

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

专知会员服务

23+阅读 · 2021年6月3日

MonoGRNet：单目3D目标检测的通用框架（TPAMI2021）

MonoGRNet：单目3D目标检测的通用框架（TPAMI2021）

专知会员服务

18+阅读 · 2021年5月3日

【NeurIPS2020】通过最大编码率降低原理学习多样和有判别性的表示

【NeurIPS2020】通过最大编码率降低原理学习多样和有判别性的表示

专知会员服务

15+阅读 · 2020年9月30日

【CVPR2020-Oral】自监督单目场景流量估计，Self-Supervised Monocular SFE

【CVPR2020-Oral】自监督单目场景流量估计，Self-Supervised Monocular SFE

专知会员服务

23+阅读 · 2020年4月9日

【论文推荐】小样本视频合成，Few-shot Video-to-Video Synthesis

【论文推荐】小样本视频合成，Few-shot Video-to-Video Synthesis

专知会员服务

24+阅读 · 2019年12月15日

【深度估计| 2019最新综述】单目深度估计方法综述（Monocular Depth Estimation: A Survey）

专知会员服务

69+阅读 · 2019年11月23日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【泡泡图灵智库】Visual SLAM: 为什么要用BA（ICRA）

【泡泡图灵智库】Visual SLAM: 为什么要用BA（ICRA）

泡泡机器人SLAM

51+阅读 · 2019年7月11日

【泡泡汇总】CVPR2019 SLAM Paperlist

【泡泡汇总】CVPR2019 SLAM Paperlist

泡泡机器人SLAM

14+阅读 · 2019年6月12日

【泡泡一分钟】FarSight：从户外图像中实现远距离深度估计

【泡泡一分钟】FarSight：从户外图像中实现远距离深度估计

泡泡机器人SLAM

11+阅读 · 2019年5月22日

【泡泡一分钟】从三维流动中学习单目视觉里程计及三维稠密建图

【泡泡一分钟】从三维流动中学习单目视觉里程计及三维稠密建图

泡泡机器人SLAM

12+阅读 · 2019年2月12日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【泡泡一分钟】RoomNet：端到端房屋布局估计

【泡泡一分钟】RoomNet：端到端房屋布局估计

泡泡机器人SLAM

18+阅读 · 2018年12月4日

【泡泡一分钟】SSD6D：基于RGB的三维检测和6自由度位姿估计(ICCV2017-159)

【泡泡一分钟】SSD6D：基于RGB的三维检测和6自由度位姿估计(ICCV2017-159)

泡泡机器人SLAM

17+阅读 · 2018年10月12日

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

专知

17+阅读 · 2018年2月11日

【论文推荐】最新5篇目标检测相关论文——显著目标检测、弱监督One-Shot检测、多框检测器、携带物体检测、假彩色图像检测

【论文推荐】最新5篇目标检测相关论文——显著目标检测、弱监督One-Shot检测、多框检测器、携带物体检测、假彩色图像检测

专知

74+阅读 · 2018年1月16日

GDF11/KLF4/IR通路在阿尔茨海默病发生及其进展中的作用研究

国家自然科学基金

0+阅读 · 2015年12月31日

有理映射的参数空间

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

一类单位逼近卷积函数的边界渐近问题

国家自然科学基金

0+阅读 · 2013年12月31日

Net1表达上调促进肝癌侵袭转移的作用与分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于构形平面匹配的可重构机器人运动学求解方法的研究

国家自然科学基金

0+阅读 · 2012年12月31日

弱监督条件下RGB-D时序图像的语义分割模型与迁移学习算法

国家自然科学基金

0+阅读 · 2012年12月31日

基于小波和几何小波的脉冲星信号处理与天文图像去噪

国家自然科学基金

0+阅读 · 2011年12月31日

PI3K/Akt/GSK3β信号转导通路在锰致神经细胞凋亡中的作用

国家自然科学基金

0+阅读 · 2011年12月31日

高精度球面2自由度并联机构优化设计理论及控制的研究

国家自然科学基金

0+阅读 · 2008年12月31日

Fast Traversability Estimation for Wild Visual Navigation

Arxiv

0+阅读 · 2023年5月15日

Optimal and Robust Category-level Perception: Object Pose and Shape Estimation from 2D and 3D Semantic Keypoints

Arxiv

0+阅读 · 2023年5月15日

Heuristic Weakly Supervised 3D Human Pose Estimation

Arxiv

0+阅读 · 2023年5月12日

Self-Supervised Video Representation Learning via Latent Time Navigation

Arxiv

0+阅读 · 2023年5月10日

FusionDepth: Complement Self-Supervised Monocular Depth Estimation with Cost Volume

Arxiv

0+阅读 · 2023年5月10日

Bayesian Synthetic Likelihood

Arxiv

0+阅读 · 2023年5月10日

Multi-Object Self-Supervised Depth Denoising

Arxiv

0+阅读 · 2023年5月9日

Self-supervised Geometric Perception

Arxiv

24+阅读 · 2021年3月4日

Directional Graph Networks

Directional Graph Networks

Arxiv

27+阅读 · 2020年12月10日

Monocular Object and Plane SLAM in Structured Environments

Monocular Object and Plane SLAM in Structured Environments

Arxiv

12+阅读 · 2018年9月10日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

【CVPR 2022】基于实例深度估计的统一深度感知全景分割 PanopticDepth: Per-Instance Depth Estimation for Unified Depth-Aware Panoptic Segmentation

【CVPR 2022】基于实例深度估计的统一深度感知全景分割 PanopticDepth: Per-Instance Depth Estimation for Unified Depth-Aware Panoptic Segmentation

专知会员服务

18+阅读 · 2022年3月19日

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

专知会员服务

13+阅读 · 2022年3月12日

【CVPR 2022】一种无需使用负样本的自监督学习方法，Self-Supervised Predictive Learning: A Negative-Free Method for Sound Source Localization in Visual Scenes

【CVPR 2022】一种无需使用负样本的自监督学习方法，Self-Supervised Predictive Learning: A Negative-Free Method for Sound Source Localization in Visual Scenes

专知会员服务

15+阅读 · 2022年3月12日

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

专知会员服务

23+阅读 · 2021年6月3日

MonoGRNet：单目3D目标检测的通用框架（TPAMI2021）

MonoGRNet：单目3D目标检测的通用框架（TPAMI2021）

专知会员服务

18+阅读 · 2021年5月3日

【NeurIPS2020】通过最大编码率降低原理学习多样和有判别性的表示

【NeurIPS2020】通过最大编码率降低原理学习多样和有判别性的表示

专知会员服务

15+阅读 · 2020年9月30日

【CVPR2020-Oral】自监督单目场景流量估计，Self-Supervised Monocular SFE

【CVPR2020-Oral】自监督单目场景流量估计，Self-Supervised Monocular SFE

专知会员服务

23+阅读 · 2020年4月9日

【论文推荐】小样本视频合成，Few-shot Video-to-Video Synthesis

【论文推荐】小样本视频合成，Few-shot Video-to-Video Synthesis

专知会员服务

24+阅读 · 2019年12月15日

【深度估计| 2019最新综述】单目深度估计方法综述（Monocular Depth Estimation: A Survey）

专知会员服务

69+阅读 · 2019年11月23日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

《小型无人机系统侦测追踪技术：声学、计算机视觉与深度学习融合方案》最新98页

《"牧羊人网格"拦截策略：实现无人机集群可靠拦截的新范式》

光纤无人机：反无人机系统的重大挑战

《作战建模与仿真实证研究》

相关资讯

【泡泡图灵智库】Visual SLAM: 为什么要用BA（ICRA）

【泡泡图灵智库】Visual SLAM: 为什么要用BA（ICRA）

泡泡机器人SLAM

51+阅读 · 2019年7月11日

【泡泡汇总】CVPR2019 SLAM Paperlist

【泡泡汇总】CVPR2019 SLAM Paperlist

泡泡机器人SLAM

14+阅读 · 2019年6月12日

【泡泡一分钟】FarSight：从户外图像中实现远距离深度估计

【泡泡一分钟】FarSight：从户外图像中实现远距离深度估计

泡泡机器人SLAM

11+阅读 · 2019年5月22日

【泡泡一分钟】从三维流动中学习单目视觉里程计及三维稠密建图

【泡泡一分钟】从三维流动中学习单目视觉里程计及三维稠密建图

泡泡机器人SLAM

12+阅读 · 2019年2月12日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【泡泡一分钟】RoomNet：端到端房屋布局估计

【泡泡一分钟】RoomNet：端到端房屋布局估计

泡泡机器人SLAM

18+阅读 · 2018年12月4日

【泡泡一分钟】SSD6D：基于RGB的三维检测和6自由度位姿估计(ICCV2017-159)

【泡泡一分钟】SSD6D：基于RGB的三维检测和6自由度位姿估计(ICCV2017-159)

泡泡机器人SLAM

17+阅读 · 2018年10月12日

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

专知

17+阅读 · 2018年2月11日

【论文推荐】最新5篇目标检测相关论文——显著目标检测、弱监督One-Shot检测、多框检测器、携带物体检测、假彩色图像检测

【论文推荐】最新5篇目标检测相关论文——显著目标检测、弱监督One-Shot检测、多框检测器、携带物体检测、假彩色图像检测

专知

74+阅读 · 2018年1月16日

相关论文

Fast Traversability Estimation for Wild Visual Navigation

Arxiv

0+阅读 · 2023年5月15日

Optimal and Robust Category-level Perception: Object Pose and Shape Estimation from 2D and 3D Semantic Keypoints

Arxiv

0+阅读 · 2023年5月15日

Heuristic Weakly Supervised 3D Human Pose Estimation

Arxiv

0+阅读 · 2023年5月12日

Self-Supervised Video Representation Learning via Latent Time Navigation

Arxiv

0+阅读 · 2023年5月10日

FusionDepth: Complement Self-Supervised Monocular Depth Estimation with Cost Volume

Arxiv

0+阅读 · 2023年5月10日

Bayesian Synthetic Likelihood

Arxiv

0+阅读 · 2023年5月10日

Multi-Object Self-Supervised Depth Denoising

Arxiv

0+阅读 · 2023年5月9日

Self-supervised Geometric Perception

Arxiv

24+阅读 · 2021年3月4日

Directional Graph Networks

Directional Graph Networks

Arxiv

27+阅读 · 2020年12月10日

Monocular Object and Plane SLAM in Structured Environments

Monocular Object and Plane SLAM in Structured Environments

Arxiv

12+阅读 · 2018年9月10日

相关基金

GDF11/KLF4/IR通路在阿尔茨海默病发生及其进展中的作用研究

国家自然科学基金

0+阅读 · 2015年12月31日

有理映射的参数空间

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

一类单位逼近卷积函数的边界渐近问题

国家自然科学基金

0+阅读 · 2013年12月31日

Net1表达上调促进肝癌侵袭转移的作用与分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于构形平面匹配的可重构机器人运动学求解方法的研究

国家自然科学基金

0+阅读 · 2012年12月31日

弱监督条件下RGB-D时序图像的语义分割模型与迁移学习算法

国家自然科学基金

0+阅读 · 2012年12月31日

基于小波和几何小波的脉冲星信号处理与天文图像去噪

国家自然科学基金

0+阅读 · 2011年12月31日

PI3K/Akt/GSK3β信号转导通路在锰致神经细胞凋亡中的作用

国家自然科学基金

0+阅读 · 2011年12月31日

高精度球面2自由度并联机构优化设计理论及控制的研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员