RDFC-GAN: RGB-Depth Fusion CycleGAN for Indoor Depth Completion - 专知论文

会员服务 ·

0

CycleGAN · Branch · Extensibility · RGB-D · 值域 ·

2023 年 6 月 6 日

RDFC-GAN: RGB-Depth Fusion CycleGAN for Indoor Depth Completion

翻译：暂无翻译

Haowen Wang,Zhengping Che,Mingyuan Wang,Zhiyuan Xu,Xiuquan Qiao,Mengshi Qi,Feifei Feng,Jian Tang

from arxiv, Haowen Wang and Zhengping Che are with equal contributions. Under review. An earlier version has been accepted by CVPR 2022 (arXiv:2203.10856)

The raw depth image captured by indoor depth sensors usually has an extensive range of missing depth values due to inherent limitations such as the inability to perceive transparent objects and the limited distance range. The incomplete depth map with missing values burdens many downstream vision tasks, and a rising number of depth completion methods have been proposed to alleviate this issue. While most existing methods can generate accurate dense depth maps from sparse and uniformly sampled depth maps, they are not suitable for complementing large contiguous regions of missing depth values, which is common and critical in images captured in indoor environments. To overcome these challenges, we design a novel two-branch end-to-end fusion network named RDFC-GAN, which takes a pair of RGB and incomplete depth images as input to predict a dense and completed depth map. The first branch employs an encoder-decoder structure, by adhering to the Manhattan world assumption and utilizing normal maps from RGB-D information as guidance, to regress the local dense depth values from the raw depth map. In the other branch, we propose an RGB-depth fusion CycleGAN to transfer the RGB image to the fine-grained textured depth map. We adopt adaptive fusion modules named W-AdaIN to propagate the features across the two branches, and we append a confidence fusion head to fuse the two outputs of the branches for the final depth map. Extensive experiments on NYU-Depth V2 and SUN RGB-D demonstrate that our proposed method clearly improves the depth completion performance, especially in a more realistic setting of indoor environments, with the help of our proposed pseudo depth maps in training.

翻译：暂无翻译

0

相关内容

CycleGAN

CVPR 2023开会了！谷歌等最新《视觉上理解和解释注意力》教程，附152页ppt

CVPR 2023开会了！谷歌等最新《视觉上理解和解释注意力》教程，附152页ppt

专知会员服务

85+阅读 · 2023年6月19日

【CVPR 2022】基于实例深度估计的统一深度感知全景分割 PanopticDepth: Per-Instance Depth Estimation for Unified Depth-Aware Panoptic Segmentation

【CVPR 2022】基于实例深度估计的统一深度感知全景分割 PanopticDepth: Per-Instance Depth Estimation for Unified Depth-Aware Panoptic Segmentation

专知会员服务

18+阅读 · 2022年3月19日

【CVPR 2022】单目3D语义场景完成框架，MonoScene: Monocular 3D Semantic Scene Completion

【CVPR 2022】单目3D语义场景完成框架，MonoScene: Monocular 3D Semantic Scene Completion

专知会员服务

15+阅读 · 2022年3月3日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

80+阅读 · 2020年7月26日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

CVPR2019| 05-13更新14篇论文及代码合集（含目标损失/零样本识别/姿态估计/GAN等）

CVPR2019| 05-13更新14篇论文及代码合集（含目标损失/零样本识别/姿态估计/GAN等）

极市平台

15+阅读 · 2019年5月13日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

42+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

17+阅读 · 2018年12月24日

【泡泡一分钟】基于多视图卷积网络的草图三维重建技术(3dv-66)

【泡泡一分钟】基于多视图卷积网络的草图三维重建技术(3dv-66)

泡泡机器人SLAM

11+阅读 · 2018年3月31日

【论文推荐】最新六篇生成式对抗网络（GAN）相关论文—半监督学习、对偶、交互生成对抗网络、激活、纳什均衡、tempoGAN

【论文推荐】最新六篇生成式对抗网络（GAN）相关论文—半监督学习、对偶、交互生成对抗网络、激活、纳什均衡、tempoGAN

专知

23+阅读 · 2018年2月23日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

19+阅读 · 2017年12月17日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

基于特征骨架质谱定位法快速发现海绵中aaptamine生物碱类抗肿瘤先导化合物

国家自然科学基金

0+阅读 · 2015年12月31日

钢丝网水泥砂浆加固土坯墙体纵横墙连接效应试验研究

国家自然科学基金

0+阅读 · 2014年12月31日

面向城市突发公共事件的直觉模糊感知进化群决策方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于Dectin-1受体识别的酵母葡聚糖酶解片段的链结构及构效关系的研究

国家自然科学基金

0+阅读 · 2013年12月31日

超疏水性有机无机杂化层状钛硅亚微球的合成及其催化烯烃环氧化反应性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

Corin介导的ANP活化在动脉粥样硬化形成及其炎症反应中的作用与机制

国家自然科学基金

0+阅读 · 2012年12月31日

基于Linked Open Data的Web服务语义互操作关键技术

国家自然科学基金

0+阅读 · 2012年12月31日

HC-SCR反应中乙醇催化制氢与还原剂活化耦合研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于语义的图像合成

国家自然科学基金

0+阅读 · 2011年12月31日

人可溶型IL-13受体α#23545;成纤维细胞胶原生成作用的研究

国家自然科学基金

0+阅读 · 2008年12月31日

Weakly Supervised Multi-Modal 3D Human Body Pose Estimation for Autonomous Driving

Arxiv

0+阅读 · 2023年7月27日

Semantic-Aware Dual Contrastive Learning for Multi-label Image Classification

Arxiv

0+阅读 · 2023年7月27日

DanceFormer: Music Conditioned 3D Dance Generation with Parametric Motion Transformer

Arxiv

0+阅读 · 2023年7月27日

Learning Common Rationale to Improve Self-Supervised Representation for Fine-Grained Visual Recognition Problems

Arxiv

0+阅读 · 2023年7月27日

Unifying Flow, Stereo and Depth Estimation

Arxiv

0+阅读 · 2023年7月26日

FDCT: Fast Depth Completion for Transparent Objects

Arxiv

0+阅读 · 2023年7月25日

3DRP-Net: 3D Relative Position-aware Network for 3D Visual Grounding

Arxiv

0+阅读 · 2023年7月25日

GridMM: Grid Memory Map for Vision-and-Language Navigation

Arxiv

0+阅读 · 2023年7月25日

SST: Real-time End-to-end Monocular 3D Reconstruction via Sparse Spatial-Temporal Guidance

Arxiv

0+阅读 · 2023年7月25日

3D Hand Shape and Pose Estimation from a Single RGB Image

3D Hand Shape and Pose Estimation from a Single RGB Image

Arxiv

17+阅读 · 2019年3月3日

VIP会员

文章信息

相关主题

相关VIP内容

CVPR 2023开会了！谷歌等最新《视觉上理解和解释注意力》教程，附152页ppt

CVPR 2023开会了！谷歌等最新《视觉上理解和解释注意力》教程，附152页ppt

专知会员服务

85+阅读 · 2023年6月19日

【CVPR 2022】基于实例深度估计的统一深度感知全景分割 PanopticDepth: Per-Instance Depth Estimation for Unified Depth-Aware Panoptic Segmentation

【CVPR 2022】基于实例深度估计的统一深度感知全景分割 PanopticDepth: Per-Instance Depth Estimation for Unified Depth-Aware Panoptic Segmentation

专知会员服务

18+阅读 · 2022年3月19日

【CVPR 2022】单目3D语义场景完成框架，MonoScene: Monocular 3D Semantic Scene Completion

【CVPR 2022】单目3D语义场景完成框架，MonoScene: Monocular 3D Semantic Scene Completion

专知会员服务

15+阅读 · 2022年3月3日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

80+阅读 · 2020年7月26日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

中文版4500字 | 数字战场：解读战争中的网络电磁行动

【新书】没有标签的数据：实用的无监督机器学习

【ICML2025】因果感知对比学习用于鲁棒的多变量时间序列异常检测

Nature：大脑中的多时间尺度强化学习

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

CVPR2019| 05-13更新14篇论文及代码合集（含目标损失/零样本识别/姿态估计/GAN等）

CVPR2019| 05-13更新14篇论文及代码合集（含目标损失/零样本识别/姿态估计/GAN等）

极市平台

15+阅读 · 2019年5月13日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

42+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

17+阅读 · 2018年12月24日

【泡泡一分钟】基于多视图卷积网络的草图三维重建技术(3dv-66)

【泡泡一分钟】基于多视图卷积网络的草图三维重建技术(3dv-66)

泡泡机器人SLAM

11+阅读 · 2018年3月31日

【论文推荐】最新六篇生成式对抗网络（GAN）相关论文—半监督学习、对偶、交互生成对抗网络、激活、纳什均衡、tempoGAN

【论文推荐】最新六篇生成式对抗网络（GAN）相关论文—半监督学习、对偶、交互生成对抗网络、激活、纳什均衡、tempoGAN

专知

23+阅读 · 2018年2月23日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

19+阅读 · 2017年12月17日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

相关论文

Weakly Supervised Multi-Modal 3D Human Body Pose Estimation for Autonomous Driving

Arxiv

0+阅读 · 2023年7月27日

Semantic-Aware Dual Contrastive Learning for Multi-label Image Classification

Arxiv

0+阅读 · 2023年7月27日

DanceFormer: Music Conditioned 3D Dance Generation with Parametric Motion Transformer

Arxiv

0+阅读 · 2023年7月27日

Learning Common Rationale to Improve Self-Supervised Representation for Fine-Grained Visual Recognition Problems

Arxiv

0+阅读 · 2023年7月27日

Unifying Flow, Stereo and Depth Estimation

Arxiv

0+阅读 · 2023年7月26日

FDCT: Fast Depth Completion for Transparent Objects

Arxiv

0+阅读 · 2023年7月25日

3DRP-Net: 3D Relative Position-aware Network for 3D Visual Grounding

Arxiv

0+阅读 · 2023年7月25日

GridMM: Grid Memory Map for Vision-and-Language Navigation

Arxiv

0+阅读 · 2023年7月25日

SST: Real-time End-to-end Monocular 3D Reconstruction via Sparse Spatial-Temporal Guidance

Arxiv

0+阅读 · 2023年7月25日

3D Hand Shape and Pose Estimation from a Single RGB Image

3D Hand Shape and Pose Estimation from a Single RGB Image

Arxiv

17+阅读 · 2019年3月3日

相关基金

基于特征骨架质谱定位法快速发现海绵中aaptamine生物碱类抗肿瘤先导化合物

国家自然科学基金

0+阅读 · 2015年12月31日

钢丝网水泥砂浆加固土坯墙体纵横墙连接效应试验研究

国家自然科学基金

0+阅读 · 2014年12月31日

面向城市突发公共事件的直觉模糊感知进化群决策方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于Dectin-1受体识别的酵母葡聚糖酶解片段的链结构及构效关系的研究

国家自然科学基金

0+阅读 · 2013年12月31日

超疏水性有机无机杂化层状钛硅亚微球的合成及其催化烯烃环氧化反应性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

Corin介导的ANP活化在动脉粥样硬化形成及其炎症反应中的作用与机制

国家自然科学基金

0+阅读 · 2012年12月31日

基于Linked Open Data的Web服务语义互操作关键技术

国家自然科学基金

0+阅读 · 2012年12月31日

HC-SCR反应中乙醇催化制氢与还原剂活化耦合研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于语义的图像合成

国家自然科学基金

0+阅读 · 2011年12月31日

人可溶型IL-13受体α#23545;成纤维细胞胶原生成作用的研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员