以变换器从序列到序列的视角对立立立立体深度估计 (Revisiting Stereo Depth Estimation From a Sequence-to-Sequence Perspective with Transformers) - 专知论文

会员服务 ·

0

估计/估计量 · 变换 · 可辨认的 · INFORMS · 置信度 ·

2021 年 8 月 25 日

Revisiting Stereo Depth Estimation From a Sequence-to-Sequence Perspective with Transformers

翻译：以变换器从序列到序列的视角对立立立立体深度估计

Zhaoshuo Li,Xingtong Liu,Nathan Drenkow,Andy Ding,Francis X. Creighton,Russell H. Taylor,Mathias Unberath

from arxiv, Our code is available at https://github.com/mli0603/stereo-transformer

Stereo depth estimation relies on optimal correspondence matching between pixels on epipolar lines in the left and right images to infer depth. In this work, we revisit the problem from a sequence-to-sequence correspondence perspective to replace cost volume construction with dense pixel matching using position information and attention. This approach, named STereo TRansformer (STTR), has several advantages: It 1) relaxes the limitation of a fixed disparity range, 2) identifies occluded regions and provides confidence estimates, and 3) imposes uniqueness constraints during the matching process. We report promising results on both synthetic and real-world datasets and demonstrate that STTR generalizes across different domains, even without fine-tuning.

翻译：在这项工作中,我们从顺序到顺序的通信角度重新审视问题,以使用位置信息和注意力用密集像素来取代成本体积的构造。这个名为STEREO TRANSEXEN(STTR)的方法有几个优点:1) 放松固定差异范围的限制,2) 查明隐蔽区域并提供信任估计,3) 在匹配过程中施加独特性限制。我们报告合成和真实世界数据集的有希望的结果,并证明STTR对不同领域进行概括,即使没有微调。

0

相关内容

估计/估计量

估计/估计量

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

326+阅读 · 2020年11月26日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【ACL2020】对抗性文本生成，Improving Adversarial Text Generation

专知会员服务

52+阅读 · 2020年5月5日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

95 FPS！超快速3D目标检测网络开源了！SFA3D：基于LiDAR的实时、准确的3D目标检测模型

95 FPS！超快速3D目标检测网络开源了！SFA3D：基于LiDAR的实时、准确的3D目标检测模型

CVer

4+阅读 · 2020年11月14日

CVPR2019 | Stereo R-CNN 3D 目标检测

CVPR2019 | Stereo R-CNN 3D 目标检测

极市平台

27+阅读 · 2019年3月10日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【泡泡一分钟】基于相对位姿估计去噪处理的多机器人协同定位算法（ICRA-25）

【泡泡一分钟】基于相对位姿估计去噪处理的多机器人协同定位算法（ICRA-25）

泡泡机器人SLAM

3+阅读 · 2018年2月5日

SBEVNet: End-to-End Deep Stereo Layout Estimation

Arxiv

0+阅读 · 2021年10月17日

Multi-View Stereo Network with attention thin volume

Arxiv

0+阅读 · 2021年10月16日

Attention meets Geometry: Geometry Guided Spatial-Temporal Attention for Consistent Self-Supervised Monocular Depth Estimation

Arxiv

0+阅读 · 2021年10月15日

Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers

Arxiv

10+阅读 · 2020年12月31日

Star-Transformer

Star-Transformer

Arxiv

5+阅读 · 2019年2月28日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

326+阅读 · 2020年11月26日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【ACL2020】对抗性文本生成，Improving Adversarial Text Generation

专知会员服务

52+阅读 · 2020年5月5日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

生成式人工智能导论：可靠性、负责任开发及实际应用（第二版）

《2025财年美陆军转型倡议（ATI）部队结构与组织提案》

【CMU博士论文】分布偏移下的可信机器学习

智能体 EDA 的曙光：自主数字芯片设计综述

相关资讯

95 FPS！超快速3D目标检测网络开源了！SFA3D：基于LiDAR的实时、准确的3D目标检测模型

95 FPS！超快速3D目标检测网络开源了！SFA3D：基于LiDAR的实时、准确的3D目标检测模型

CVer

4+阅读 · 2020年11月14日

CVPR2019 | Stereo R-CNN 3D 目标检测

CVPR2019 | Stereo R-CNN 3D 目标检测

极市平台

27+阅读 · 2019年3月10日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【泡泡一分钟】基于相对位姿估计去噪处理的多机器人协同定位算法（ICRA-25）

【泡泡一分钟】基于相对位姿估计去噪处理的多机器人协同定位算法（ICRA-25）

泡泡机器人SLAM

3+阅读 · 2018年2月5日

相关论文

SBEVNet: End-to-End Deep Stereo Layout Estimation

Arxiv

0+阅读 · 2021年10月17日

Multi-View Stereo Network with attention thin volume

Arxiv

0+阅读 · 2021年10月16日

Attention meets Geometry: Geometry Guided Spatial-Temporal Attention for Consistent Self-Supervised Monocular Depth Estimation

Arxiv

0+阅读 · 2021年10月15日

Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers

Arxiv

10+阅读 · 2020年12月31日

Star-Transformer

Star-Transformer

Arxiv

5+阅读 · 2019年2月28日

微信扫码咨询专知VIP会员