用于快速和可缩放立立体匹配的多尺度迭代残留物 (Multi-scale Iterative Residuals for Fast and Scalable Stereo Matching) - 专知论文

会员服务 ·

0

模型评估 · Networking · FAST · 可约的 · AANet ·

2021 年 10 月 25 日

Multi-scale Iterative Residuals for Fast and Scalable Stereo Matching

翻译：用于快速和可缩放立立体匹配的多尺度迭代残留物

Kumail Raza,René Schuster,Didier Stricker

from arxiv, Accepted at CSCS21

Despite the remarkable progress of deep learning in stereo matching, there exists a gap in accuracy between real-time models and slower state-of-the-art models which are suitable for practical applications. This paper presents an iterative multi-scale coarse-to-fine refinement (iCFR) framework to bridge this gap by allowing it to adopt any stereo matching network to make it fast, more efficient and scalable while keeping comparable accuracy. To reduce the computational cost of matching, we use multi-scale warped features to estimate disparity residuals and push the disparity search range in the cost volume to a minimum limit. Finally, we apply a refinement network to recover the loss of precision which is inherent in multi-scale approaches. We test our iCFR framework by adopting the matching networks from state-of-the art GANet and AANet. The result is 49$\times$ faster inference time compared to GANetdeep and 4$\times$ less memory consumption, with comparable error. Our best performing network, which we call FRSNet is scalable even up to an input resolution of 6K on a GTX 1080Ti, with inference time still below one second and comparable accuracy to AANet+. It out-performs all real-time stereo methods and achieves competitive accuracy on the KITTI benchmark.

翻译：尽管在立体匹配方面的深层学习取得了显著进展,但在实时模型与适合实际应用的较先进模型之间,在准确性方面存在差距,本文展示了一个迭代多尺度粗到软改进框架,以弥补这一差距,允许其采用任何立体匹配网络,使其快速、更有效和可缩放,同时保持可比准确性。为降低匹配的计算成本,我们使用多尺度扭曲特征来估计差异剩余量,并将成本量中的差异搜索范围推至最低限度。最后,我们应用一个精细网络来恢复多尺度方法所固有的精确度损失。我们测试我们的iCFR框架,采用来自先进GANet和AAANet的匹配网络。结果为49美元比GANet更快,4美元比记忆消耗少,差4美元。我们称之为FRSNet的最佳性能网络,甚至可以升级到6K在GTX 1080的精确度上输入第二项决议。我们测试我们的iFR框架,从GX 1080的精确度到KIT的精确度,比A-stal-stal-stall ax ax ax precilation-stall aximme ax be precal-time ax bex bex bex bex bex bex precaltime ax ax axtimetimexxtime axtimex-stst rotimex rotime robility-st rotical-st rogy 的方法。

0

相关内容

模型评估

机器学习系统设计系统评估标准

【KDD2020-Google】神经输入搜索的大规模深度推荐模型

专知会员服务

23+阅读 · 2020年9月8日

【ICML2020】深度神经网络置信感知学习，Conﬁdence-Aware Learning for Deep Neural Networks

【ICML2020】深度神经网络置信感知学习，Conﬁdence-Aware Learning for Deep Neural Networks

专知会员服务

74+阅读 · 2020年7月6日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

【WWW2020-MAGNN】异质图嵌入的集合图神经网络 MAGNN: Metapath Aggregated Graph Neural Network for Heterogeneous Graph Embedding

【WWW2020-MAGNN】异质图嵌入的集合图神经网络 MAGNN: Metapath Aggregated Graph Neural Network for Heterogeneous Graph Embedding

专知会员服务

116+阅读 · 2020年2月10日

【深度学习的未来】《The Future of Deep Learning》by Paramita (Guha) Ghosh

【深度学习的未来】《The Future of Deep Learning》by Paramita (Guha) Ghosh

专知会员服务

23+阅读 · 2020年1月16日

【Google】神经架构搜索（Neural Architecture Search and Beyond），Barret Zoph

【Google】神经架构搜索（Neural Architecture Search and Beyond），Barret Zoph

专知会员服务

31+阅读 · 2019年11月25日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

【泡泡一分钟】FarSight：从户外图像中实现远距离深度估计

【泡泡一分钟】FarSight：从户外图像中实现远距离深度估计

泡泡机器人SLAM

11+阅读 · 2019年5月22日

CVPR2019 | Stereo R-CNN 3D 目标检测

CVPR2019 | Stereo R-CNN 3D 目标检测

极市平台

27+阅读 · 2019年3月10日

2018机器学习开源资源盘点

2018机器学习开源资源盘点

专知

6+阅读 · 2019年2月2日

TCN v2 + 3Dconv 运动信息

TCN v2 + 3Dconv 运动信息

CreateAMind

4+阅读 · 2019年1月8日

Facebook PyText 在 Github 上开源了

Facebook PyText 在 Github 上开源了

AINLP

7+阅读 · 2018年12月14日

【泡泡点云时空】非刚性形状3D关键点描述子的生成与匹配（ECCV2018-6）

【泡泡点云时空】非刚性形状3D关键点描述子的生成与匹配（ECCV2018-6）

泡泡机器人SLAM

3+阅读 · 2018年10月14日

博客 | CIFAR10 数据预处理

博客 | CIFAR10 数据预处理

AI研习社

11+阅读 · 2018年10月12日

【泡泡一分钟】无监督学习的立体匹配方法(ICCV-2017)

【泡泡一分钟】无监督学习的立体匹配方法(ICCV-2017)

泡泡机器人SLAM

8+阅读 · 2018年10月9日

Faster R-CNN

数据挖掘入门与实战

4+阅读 · 2018年4月20日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

Scale-Net: Learning to Reduce Scale Differences for Large-Scale Invariant Image Matching

Arxiv

0+阅读 · 2021年12月20日

Parallel Multi-Scale Networks with Deep Supervision for Hand Keypoint Detection

Arxiv

0+阅读 · 2021年12月19日

MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep Learning

MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep Learning

Arxiv

4+阅读 · 2021年10月28日

High-Performance Large-Scale Image Recognition Without Normalization

Arxiv

5+阅读 · 2021年2月11日

AdderSR: Towards Energy Efficient Image Super-Resolution

AdderSR: Towards Energy Efficient Image Super-Resolution

Arxiv

9+阅读 · 2020年9月18日

Scalable Gromov-Wasserstein Learning for Graph Partitioning and Matching

Arxiv

8+阅读 · 2019年10月9日

The Perfect Match: 3D Point Cloud Matching with Smoothed Densities

Arxiv

3+阅读 · 2019年4月2日

Recurrent MVSNet for High-resolution Multi-view Stereo Depth Inference

Recurrent MVSNet for High-resolution Multi-view Stereo Depth Inference

Arxiv

4+阅读 · 2019年2月27日

Hierarchical Metric Learning and Matching for 2D and 3D Geometric Correspondences

Arxiv

10+阅读 · 2018年3月27日

Collaborative Learning for Weakly Supervised Object Detection

Arxiv

9+阅读 · 2018年2月10日

VIP会员

文章信息

相关主题

相关VIP内容

【KDD2020-Google】神经输入搜索的大规模深度推荐模型

专知会员服务

23+阅读 · 2020年9月8日

【ICML2020】深度神经网络置信感知学习，Conﬁdence-Aware Learning for Deep Neural Networks

【ICML2020】深度神经网络置信感知学习，Conﬁdence-Aware Learning for Deep Neural Networks

专知会员服务

74+阅读 · 2020年7月6日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

【WWW2020-MAGNN】异质图嵌入的集合图神经网络 MAGNN: Metapath Aggregated Graph Neural Network for Heterogeneous Graph Embedding

【WWW2020-MAGNN】异质图嵌入的集合图神经网络 MAGNN: Metapath Aggregated Graph Neural Network for Heterogeneous Graph Embedding

专知会员服务

116+阅读 · 2020年2月10日

【深度学习的未来】《The Future of Deep Learning》by Paramita (Guha) Ghosh

【深度学习的未来】《The Future of Deep Learning》by Paramita (Guha) Ghosh

专知会员服务

23+阅读 · 2020年1月16日

【Google】神经架构搜索（Neural Architecture Search and Beyond），Barret Zoph

【Google】神经架构搜索（Neural Architecture Search and Beyond），Barret Zoph

专知会员服务

31+阅读 · 2019年11月25日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

新书册《几何深度学习的数学基础》

中程单向攻击无人机的战略意义：俄乌战争启示

在无标注条件下适配视觉—语言模型：全面综述

面向视觉语言模型的持续学习：遗忘之外的综述与分类体系

相关资讯

【泡泡一分钟】FarSight：从户外图像中实现远距离深度估计

【泡泡一分钟】FarSight：从户外图像中实现远距离深度估计

泡泡机器人SLAM

11+阅读 · 2019年5月22日

CVPR2019 | Stereo R-CNN 3D 目标检测

CVPR2019 | Stereo R-CNN 3D 目标检测

极市平台

27+阅读 · 2019年3月10日

2018机器学习开源资源盘点

2018机器学习开源资源盘点

专知

6+阅读 · 2019年2月2日

TCN v2 + 3Dconv 运动信息

TCN v2 + 3Dconv 运动信息

CreateAMind

4+阅读 · 2019年1月8日

Facebook PyText 在 Github 上开源了

Facebook PyText 在 Github 上开源了

AINLP

7+阅读 · 2018年12月14日

【泡泡点云时空】非刚性形状3D关键点描述子的生成与匹配（ECCV2018-6）

【泡泡点云时空】非刚性形状3D关键点描述子的生成与匹配（ECCV2018-6）

泡泡机器人SLAM

3+阅读 · 2018年10月14日

博客 | CIFAR10 数据预处理

博客 | CIFAR10 数据预处理

AI研习社

11+阅读 · 2018年10月12日

【泡泡一分钟】无监督学习的立体匹配方法(ICCV-2017)

【泡泡一分钟】无监督学习的立体匹配方法(ICCV-2017)

泡泡机器人SLAM

8+阅读 · 2018年10月9日

Faster R-CNN

数据挖掘入门与实战

4+阅读 · 2018年4月20日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

相关论文

Scale-Net: Learning to Reduce Scale Differences for Large-Scale Invariant Image Matching

Arxiv

0+阅读 · 2021年12月20日

Parallel Multi-Scale Networks with Deep Supervision for Hand Keypoint Detection

Arxiv

0+阅读 · 2021年12月19日

MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep Learning

MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep Learning

Arxiv

4+阅读 · 2021年10月28日

High-Performance Large-Scale Image Recognition Without Normalization

Arxiv

5+阅读 · 2021年2月11日

AdderSR: Towards Energy Efficient Image Super-Resolution

AdderSR: Towards Energy Efficient Image Super-Resolution

Arxiv

9+阅读 · 2020年9月18日

Scalable Gromov-Wasserstein Learning for Graph Partitioning and Matching

Arxiv

8+阅读 · 2019年10月9日

The Perfect Match: 3D Point Cloud Matching with Smoothed Densities

Arxiv

3+阅读 · 2019年4月2日

Recurrent MVSNet for High-resolution Multi-view Stereo Depth Inference

Recurrent MVSNet for High-resolution Multi-view Stereo Depth Inference

Arxiv

4+阅读 · 2019年2月27日

Hierarchical Metric Learning and Matching for 2D and 3D Geometric Correspondences

Arxiv

10+阅读 · 2018年3月27日

Collaborative Learning for Weakly Supervised Object Detection

Arxiv

9+阅读 · 2018年2月10日

微信扫码咨询专知VIP会员