自行监督的单心心深度估算的自学地貌聚合 (Self-distilled Feature Aggregation for Self-supervised Monocular Depth Estimation) - 专知论文

会员服务 ·

0

估计/估计量 · 偏移量 · Networking · Attention · CASES ·

2022 年 9 月 15 日

Self-distilled Feature Aggregation for Self-supervised Monocular Depth Estimation

翻译：自行监督的单心心深度估算的自学地貌聚合

Zhengming Zhou,Qiulei Dong

from arxiv, Accepted to ECCV 2022

Self-supervised monocular depth estimation has received much attention recently in computer vision. Most of the existing works in literature aggregate multi-scale features for depth prediction via either straightforward concatenation or element-wise addition, however, such feature aggregation operations generally neglect the contextual consistency between multi-scale features. Addressing this problem, we propose the Self-Distilled Feature Aggregation (SDFA) module for simultaneously aggregating a pair of low-scale and high-scale features and maintaining their contextual consistency. The SDFA employs three branches to learn three feature offset maps respectively: one offset map for refining the input low-scale feature and the other two for refining the input high-scale feature under a designed self-distillation manner. Then, we propose an SDFA-based network for self-supervised monocular depth estimation, and design a self-distilled training strategy to train the proposed network with the SDFA module. Experimental results on the KITTI dataset demonstrate that the proposed method outperforms the comparative state-of-the-art methods in most cases. The code is available at https://github.com/ZM-Zhou/SDFA-Net_pytorch.

翻译：在计算机视野中,自监督的单体深度估计最近受到了很多注意。文献中现有的多数作品通过直截了当的混凝土或元素添加将深度预测的多尺度特征汇总在一起,然而,这种特征汇总作业通常忽视了多尺度特征之间的背景一致性。针对这一问题,我们提议采用自改进的地貌聚合模块,同时将一对低尺度和高尺度的特征集合起来,并保持其背景一致性。SDFA使用三个分支分别学习三个特征抵消图:一个用于改进输入的低尺度特征的抵消图,另外两个用于在设计自我蒸馏方式下改进输入的高尺度特征。然后,我们提议基于SDFA的网络进行自我监督的单体深度估计,并设计一个自改进的培训战略,用SDFA模块对拟议网络进行培训。KITTI数据集的实验结果表明,拟议的方法在多数情况下都超越了比较的状态方法。该代码可在 https://github.com/Z-Z-Z-HO-FART-SVERch.

0

相关内容

估计/估计量

估计/估计量

【CVPR2020-中科院计算所】弱监督语义分割的自监督等价注意力机制，Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation

【CVPR2020-中科院计算所】弱监督语义分割的自监督等价注意力机制，Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation

专知会员服务

76+阅读 · 2020年4月10日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【深度估计| 2019最新综述】单目深度估计方法综述（Monocular Depth Estimation: A Survey）

专知会员服务

69+阅读 · 2019年11月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

ExBert — 可视化分析Transformer学到的表示

ExBert — 可视化分析Transformer学到的表示

专知会员服务

32+阅读 · 2019年10月16日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新七篇图像分割相关论文—域适应深度表示学习、循环残差卷积、二值分割、图像合成、无监督跨模态

【论文推荐】最新七篇图像分割相关论文—域适应深度表示学习、循环残差卷积、二值分割、图像合成、无监督跨模态

专知

19+阅读 · 2018年6月1日

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

专知

19+阅读 · 2018年5月31日

【论文推荐】最新六篇生成式对抗网络（GAN）相关论文—半监督学习、对偶、交互生成对抗网络、激活、纳什均衡、tempoGAN

【论文推荐】最新六篇生成式对抗网络（GAN）相关论文—半监督学习、对偶、交互生成对抗网络、激活、纳什均衡、tempoGAN

专知

23+阅读 · 2018年2月23日

【论文推荐】最新6篇生成式对抗网络（GAN）相关论文—半监督对抗学习、行人再识别、代表性特征、高分辨率深度卷积、自监督、超分辨

【论文推荐】最新6篇生成式对抗网络（GAN）相关论文—半监督对抗学习、行人再识别、代表性特征、高分辨率深度卷积、自监督、超分辨

专知

10+阅读 · 2018年2月1日

基于深度信念网络的高光谱遥感影像变化检测方法研究

国家自然科学基金

3+阅读 · 2015年12月31日

二氢丹参酮I对多药耐药结肠癌及胃肠肿瘤活化成纤维细胞中代谢重编程的调控作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

TRIB3基因表达对糖尿病大血管致纤维病变的作用及中药桃仁干预机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

图像超分辨率盲重建方法的若干关键问题研究

国家自然科学基金

0+阅读 · 2014年12月31日

星载红外多光谱运动目标探测杂波抑制方法研究

国家自然科学基金

1+阅读 · 2013年12月31日

遥感立体、多视图影像压缩研究

国家自然科学基金

0+阅读 · 2012年12月31日

健脾解毒方对COX-2激活JNK信号通路介导大肠癌多药耐药的调控研究

国家自然科学基金

0+阅读 · 2012年12月31日

SCAP/SREBP相关通路与精神分裂症伴代谢综合征的作用机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于近红外与电子鼻技术的药用菊花道地药材模式识别研究

国家自然科学基金

0+阅读 · 2009年12月31日

敏感问题连续抽样调查的统计方法及艾滋病高危人群总体特征的估计

国家自然科学基金

0+阅读 · 2009年12月31日

Super-Resolution Based Patch-Free 3D Medical Image Segmentation with Self-Supervised Guidance

Arxiv

0+阅读 · 2022年10月26日

High-Resolution Depth Estimation for 360-degree Panoramas through Perspective and Panoramic Depth Images Registration

Arxiv

0+阅读 · 2022年10月26日

On Fine-Tuned Deep Features for Unsupervised Domain Adaptation

On Fine-Tuned Deep Features for Unsupervised Domain Adaptation

Arxiv

0+阅读 · 2022年10月25日

An Effective Deep Network for Head Pose Estimation without Keypoints

Arxiv

0+阅读 · 2022年10月25日

Depth Monocular Estimation with Attention-based Encoder-Decoder Network from Single Image

Arxiv

0+阅读 · 2022年10月24日

Contrastive Representation Learning for Gaze Estimation

Arxiv

0+阅读 · 2022年10月24日

Brouhaha: multi-task training for voice activity detection, speech-to-noise ratio, and C50 room acoustics estimation

Arxiv

0+阅读 · 2022年10月24日

Boosting vision transformers for image retrieval

Arxiv

0+阅读 · 2022年10月21日

Self-Supervised Robustifying Guidance for Monocular 3D Face Reconstruction

Arxiv

0+阅读 · 2022年10月21日

Adversarial Mutual Information for Text Generation

Adversarial Mutual Information for Text Generation

Arxiv

13+阅读 · 2020年6月30日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

【CVPR2020-中科院计算所】弱监督语义分割的自监督等价注意力机制，Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation

【CVPR2020-中科院计算所】弱监督语义分割的自监督等价注意力机制，Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation

专知会员服务

76+阅读 · 2020年4月10日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【深度估计| 2019最新综述】单目深度估计方法综述（Monocular Depth Estimation: A Survey）

专知会员服务

69+阅读 · 2019年11月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

ExBert — 可视化分析Transformer学到的表示

ExBert — 可视化分析Transformer学到的表示

专知会员服务

32+阅读 · 2019年10月16日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【牛津博士论文】零样本强化学习综述

《美军条令：陆军指挥官与规划人员地理空间指南》60页

战术边缘指挥控制：防务面临的核心挑战

迈向开放世界检测：综述

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新七篇图像分割相关论文—域适应深度表示学习、循环残差卷积、二值分割、图像合成、无监督跨模态

【论文推荐】最新七篇图像分割相关论文—域适应深度表示学习、循环残差卷积、二值分割、图像合成、无监督跨模态

专知

19+阅读 · 2018年6月1日

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

专知

19+阅读 · 2018年5月31日

【论文推荐】最新六篇生成式对抗网络（GAN）相关论文—半监督学习、对偶、交互生成对抗网络、激活、纳什均衡、tempoGAN

【论文推荐】最新六篇生成式对抗网络（GAN）相关论文—半监督学习、对偶、交互生成对抗网络、激活、纳什均衡、tempoGAN

专知

23+阅读 · 2018年2月23日

【论文推荐】最新6篇生成式对抗网络（GAN）相关论文—半监督对抗学习、行人再识别、代表性特征、高分辨率深度卷积、自监督、超分辨

【论文推荐】最新6篇生成式对抗网络（GAN）相关论文—半监督对抗学习、行人再识别、代表性特征、高分辨率深度卷积、自监督、超分辨

专知

10+阅读 · 2018年2月1日

相关论文

Super-Resolution Based Patch-Free 3D Medical Image Segmentation with Self-Supervised Guidance

Arxiv

0+阅读 · 2022年10月26日

High-Resolution Depth Estimation for 360-degree Panoramas through Perspective and Panoramic Depth Images Registration

Arxiv

0+阅读 · 2022年10月26日

On Fine-Tuned Deep Features for Unsupervised Domain Adaptation

On Fine-Tuned Deep Features for Unsupervised Domain Adaptation

Arxiv

0+阅读 · 2022年10月25日

An Effective Deep Network for Head Pose Estimation without Keypoints

Arxiv

0+阅读 · 2022年10月25日

Depth Monocular Estimation with Attention-based Encoder-Decoder Network from Single Image

Arxiv

0+阅读 · 2022年10月24日

Contrastive Representation Learning for Gaze Estimation

Arxiv

0+阅读 · 2022年10月24日

Brouhaha: multi-task training for voice activity detection, speech-to-noise ratio, and C50 room acoustics estimation

Arxiv

0+阅读 · 2022年10月24日

Boosting vision transformers for image retrieval

Arxiv

0+阅读 · 2022年10月21日

Self-Supervised Robustifying Guidance for Monocular 3D Face Reconstruction

Arxiv

0+阅读 · 2022年10月21日

Adversarial Mutual Information for Text Generation

Adversarial Mutual Information for Text Generation

Arxiv

13+阅读 · 2020年6月30日

相关基金

基于深度信念网络的高光谱遥感影像变化检测方法研究

国家自然科学基金

3+阅读 · 2015年12月31日

二氢丹参酮I对多药耐药结肠癌及胃肠肿瘤活化成纤维细胞中代谢重编程的调控作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

TRIB3基因表达对糖尿病大血管致纤维病变的作用及中药桃仁干预机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

图像超分辨率盲重建方法的若干关键问题研究

国家自然科学基金

0+阅读 · 2014年12月31日

星载红外多光谱运动目标探测杂波抑制方法研究

国家自然科学基金

1+阅读 · 2013年12月31日

遥感立体、多视图影像压缩研究

国家自然科学基金

0+阅读 · 2012年12月31日

健脾解毒方对COX-2激活JNK信号通路介导大肠癌多药耐药的调控研究

国家自然科学基金

0+阅读 · 2012年12月31日

SCAP/SREBP相关通路与精神分裂症伴代谢综合征的作用机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于近红外与电子鼻技术的药用菊花道地药材模式识别研究

国家自然科学基金

0+阅读 · 2009年12月31日

敏感问题连续抽样调查的统计方法及艾滋病高危人群总体特征的估计

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员