多视图立体网络,关注量微薄 (Multi-View Stereo Network with attention thin volume) - 专知论文

会员服务 ·

0

INFORMS · Networking · 注意力机制 · 可约的 · 缩放 ·

2021 年 10 月 16 日

Multi-View Stereo Network with attention thin volume

翻译：多视图立体网络,关注量微薄

We propose an efficient multi-view stereo (MVS) network for infering depth value from multiple RGB images. Recent studies have shown that mapping the geometric relationship in real space to neural network is an essential topic of the MVS problem. Specifically, these methods focus on how to express the correspondence between different views by constructing a nice cost volume. In this paper, we propose a more complete cost volume construction approach based on absorbing previous experience. First of all, we introduce the self-attention mechanism to fully aggregate the dominant information from input images and accurately model the long-range dependency, so as to selectively aggregate reference features. Secondly, we introduce the group-wise correlation to feature aggregation, which greatly reduces the memory and calculation burden. Meanwhile, this method enhances the information interaction between different feature channels. With this approach, a more lightweight and efficient cost volume is constructed. Finally we follow the coarse to fine strategy and refine the depth sampling range scale by scale with the help of uncertainty estimation. We further combine the previous steps to get the attention thin volume. Quantitative and qualitative experiments are presented to demonstrate the performance of our model.

翻译：我们建议建立一个高效的多视图立体(MVS)网络,从多个 RGB 图像中推断深度值。最近的研究表明,测绘实际空间的几何关系与神经网络的几何关系是MVS问题的一个基本主题。具体地说,这些方法侧重于如何通过构建一个高成本体积来表达不同观点之间的对应关系。在本文中,我们建议基于吸收以往经验的更完整的成本量构建方法。首先,我们引入自我注意机制,充分汇总输入图像中的主要信息,准确模拟长距离依赖性,以便有选择地综合参考特征。第二,我们引入群集的群集相关性,这极大地减少了记忆和计算负担。与此同时,这一方法加强了不同特征渠道之间的信息互动。用这种方法构建了一个更轻、更高效的成本量。最后,我们遵循粗略的策略,并根据不确定性估计来改进深度取样范围。我们进一步整合了先前的步骤,以吸引对薄体积的注意。我们介绍了定量和定性实验,以展示模型的性能。

0

相关内容

INFORMS

《计算机信息》杂志发表高质量的论文，扩大了运筹学和计算的范围，寻求有关理论、方法、实验、系统和应用方面的原创研究论文、新颖的调查和教程论文，以及描述新的和有用的软件工具的论文。官网链接：https://pubsonline.informs.org/journal/ijoc

【ICCV2021】基于Transformer 的神经绘画

专知会员服务

23+阅读 · 2021年9月20日

【因果基础】Causality Basics，36页ppt

专知会员服务

52+阅读 · 2021年8月8日

[WWW2021]图结构估计神经网络

[WWW2021]图结构估计神经网络

专知会员服务

43+阅读 · 2021年3月29日

【KDD 2020】M2GRL: 一个多任务多视角图表示学习框架的Web-scale的推荐系统，M2GRL: A Multi-task Multi-view Graph Representation Learning Framework for Web-scale Recommender Systems

【KDD 2020】M2GRL: 一个多任务多视角图表示学习框架的Web-scale的推荐系统，M2GRL: A Multi-task Multi-view Graph Representation Learning Framework for Web-scale Recommender Systems

专知会员服务

29+阅读 · 2020年6月30日

多语言神经机器翻译综述论文，34页pdf，A Comprehensive Survey of Multilingual Neural Machine Translation

多语言神经机器翻译综述论文，34页pdf，A Comprehensive Survey of Multilingual Neural Machine Translation

专知会员服务

19+阅读 · 2020年4月25日

基于动态时空图CNNs的交通流预测，Dynamic Spatio-temporal Graph-based CNNs for Traffic Flow Prediction

基于动态时空图CNNs的交通流预测，Dynamic Spatio-temporal Graph-based CNNs for Traffic Flow Prediction

专知会员服务

136+阅读 · 2020年3月8日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Attention最新进展

Attention最新进展

极市平台

5+阅读 · 2020年5月30日

模式国重实验室21篇论文入选CVPR 2020

模式国重实验室21篇论文入选CVPR 2020

专知

30+阅读 · 2020年3月8日

【泡泡汇总】CVPR2019 SLAM Paperlist

【泡泡汇总】CVPR2019 SLAM Paperlist

泡泡机器人SLAM

14+阅读 · 2019年6月12日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

CVPR2019 | Stereo R-CNN 3D 目标检测

CVPR2019 | Stereo R-CNN 3D 目标检测

极市平台

27+阅读 · 2019年3月10日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

Call4Papers

5+阅读 · 2018年12月7日

《pyramid Attention Network for Semantic Segmentation》

《pyramid Attention Network for Semantic Segmentation》

统计学习与视觉计算组

44+阅读 · 2018年8月30日

计算机视觉近一年进展综述

计算机视觉近一年进展综述

机器学习研究会

9+阅读 · 2017年11月25日

Curvature-guided dynamic scale networks for Multi-view Stereo

Arxiv

0+阅读 · 2021年12月11日

NeuLF: Efficient Novel View Synthesis with Neural 4D Light Field

Arxiv

0+阅读 · 2021年12月9日

IterMVS: Iterative Probability Estimation for Efficient Multi-View Stereo

Arxiv

0+阅读 · 2021年12月9日

Pose-guided Generative Adversarial Net for Novel View Action Synthesis

Arxiv

0+阅读 · 2021年12月9日

Data-Driven 3D Reconstruction of Dressed Humans From Sparse Views

Arxiv

0+阅读 · 2021年12月5日

Generalized Binary Search Network for Highly-Efficient Multi-View Stereo

Arxiv

0+阅读 · 2021年12月4日

FaSS-MVS -- Fast Multi-View Stereo with Surface-Aware Semi-Global Matching from UAV-borne Monocular Imagery

Arxiv

0+阅读 · 2021年12月1日

Spatiotemporal Graph Neural Network based Mask Reconstruction for Video Object Segmentation

Spatiotemporal Graph Neural Network based Mask Reconstruction for Video Object Segmentation

Arxiv

3+阅读 · 2020年12月10日

Graph Convolutional Neural Networks via Motif-based Attention

Arxiv

10+阅读 · 2019年2月25日

Semantic Segmentation via Highly Fused Convolutional Network with Multiple Soft Cost Functions

Arxiv

3+阅读 · 2018年1月4日

VIP会员

文章信息

相关主题

注意力机制

相关VIP内容

【ICCV2021】基于Transformer 的神经绘画

专知会员服务

23+阅读 · 2021年9月20日

【因果基础】Causality Basics，36页ppt

专知会员服务

52+阅读 · 2021年8月8日

[WWW2021]图结构估计神经网络

[WWW2021]图结构估计神经网络

专知会员服务

43+阅读 · 2021年3月29日

【KDD 2020】M2GRL: 一个多任务多视角图表示学习框架的Web-scale的推荐系统，M2GRL: A Multi-task Multi-view Graph Representation Learning Framework for Web-scale Recommender Systems

【KDD 2020】M2GRL: 一个多任务多视角图表示学习框架的Web-scale的推荐系统，M2GRL: A Multi-task Multi-view Graph Representation Learning Framework for Web-scale Recommender Systems

专知会员服务

29+阅读 · 2020年6月30日

多语言神经机器翻译综述论文，34页pdf，A Comprehensive Survey of Multilingual Neural Machine Translation

多语言神经机器翻译综述论文，34页pdf，A Comprehensive Survey of Multilingual Neural Machine Translation

专知会员服务

19+阅读 · 2020年4月25日

基于动态时空图CNNs的交通流预测，Dynamic Spatio-temporal Graph-based CNNs for Traffic Flow Prediction

基于动态时空图CNNs的交通流预测，Dynamic Spatio-temporal Graph-based CNNs for Traffic Flow Prediction

专知会员服务

136+阅读 · 2020年3月8日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

从社会学实验到行为仿真：理解基于Agent的观点动力学建模思维

中英文版《GPT-5 System Card速览》报告

ACL 2025 | 大模型结构化知识提示的泛化能力研究

【普林斯顿博士论文】大型模型的高效推理

相关资讯

Attention最新进展

Attention最新进展

极市平台

5+阅读 · 2020年5月30日

模式国重实验室21篇论文入选CVPR 2020

模式国重实验室21篇论文入选CVPR 2020

专知

30+阅读 · 2020年3月8日

【泡泡汇总】CVPR2019 SLAM Paperlist

【泡泡汇总】CVPR2019 SLAM Paperlist

泡泡机器人SLAM

14+阅读 · 2019年6月12日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

CVPR2019 | Stereo R-CNN 3D 目标检测

CVPR2019 | Stereo R-CNN 3D 目标检测

极市平台

27+阅读 · 2019年3月10日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

Call4Papers

5+阅读 · 2018年12月7日

《pyramid Attention Network for Semantic Segmentation》

《pyramid Attention Network for Semantic Segmentation》

统计学习与视觉计算组

44+阅读 · 2018年8月30日

计算机视觉近一年进展综述

计算机视觉近一年进展综述

机器学习研究会

9+阅读 · 2017年11月25日

相关论文

Curvature-guided dynamic scale networks for Multi-view Stereo

Arxiv

0+阅读 · 2021年12月11日

NeuLF: Efficient Novel View Synthesis with Neural 4D Light Field

Arxiv

0+阅读 · 2021年12月9日

IterMVS: Iterative Probability Estimation for Efficient Multi-View Stereo

Arxiv

0+阅读 · 2021年12月9日

Pose-guided Generative Adversarial Net for Novel View Action Synthesis

Arxiv

0+阅读 · 2021年12月9日

Data-Driven 3D Reconstruction of Dressed Humans From Sparse Views

Arxiv

0+阅读 · 2021年12月5日

Generalized Binary Search Network for Highly-Efficient Multi-View Stereo

Arxiv

0+阅读 · 2021年12月4日

FaSS-MVS -- Fast Multi-View Stereo with Surface-Aware Semi-Global Matching from UAV-borne Monocular Imagery

Arxiv

0+阅读 · 2021年12月1日

Spatiotemporal Graph Neural Network based Mask Reconstruction for Video Object Segmentation

Spatiotemporal Graph Neural Network based Mask Reconstruction for Video Object Segmentation

Arxiv

3+阅读 · 2020年12月10日

Graph Convolutional Neural Networks via Motif-based Attention

Arxiv

10+阅读 · 2019年2月25日

Semantic Segmentation via Highly Fused Convolutional Network with Multiple Soft Cost Functions

Arxiv

3+阅读 · 2018年1月4日

微信扫码咨询专知VIP会员