用于三维天体探测的多相机校准自由 BEV 代表 (Multi-Camera Calibration Free BEV Representation for 3D Object Detection) - 专知论文

会员服务 ·

0

可约的 · Attention · 变换 · 稳健性 · 表示 ·

2022 年 10 月 31 日

Multi-Camera Calibration Free BEV Representation for 3D Object Detection

翻译：用于三维天体探测的多相机校准自由 BEV 代表

Hongxiang Jiang,Wenming Meng,Hongmei Zhu,Qian Zhang,Jihao Yin

from arxiv, 15 pages, 7 figures

In advanced paradigms of autonomous driving, learning Bird's Eye View (BEV) representation from surrounding views is crucial for multi-task framework. However, existing methods based on depth estimation or camera-driven attention are not stable to obtain transformation under noisy camera parameters, mainly with two challenges, accurate depth prediction and calibration. In this work, we present a completely Multi-Camera Calibration Free Transformer (CFT) for robust BEV representation, which focuses on exploring implicit mapping, not relied on camera intrinsics and extrinsics. To guide better feature learning from image views to BEV, CFT mines potential 3D information in BEV via our designed position-aware enhancement (PA). Instead of camera-driven point-wise or global transformation, for interaction within more effective region and lower computation cost, we propose a view-aware attention which also reduces redundant computation and promotes converge. CFT achieves 49.7% NDS on the nuScenes detection task leaderboard, which is the first work removing camera parameters, comparable to other geometry-guided methods. Without temporal input and other modal information, CFT achieves second highest performance with a smaller image input 1600 * 640. Thanks to view-attention variant, CFT reduces memory and transformer FLOPs for vanilla attention by about 12% and 60%, respectively, with improved NDS by 1.0%. Moreover, its natural robustness to noisy camera parameters makes CFT more competitive.

翻译：在自主驾驶的先进范式中,学习鸟眼视图(BEV)代表来自周围观点的先进模式对于多任务框架至关重要。然而,基于深度估计或摄像器驱动的注意的现有方法并不稳定,无法在噪音摄像参数下实现转型,主要有两个挑战,即准确深度预测和校准。在这项工作中,我们提出了一个完全多镜头校准自由变换器(FFFT),用于强大的BEV代表,重点是探索隐性绘图,而不是依赖相机的内在和外缘。为了引导从图像视图学习到BEV的更好特征,通过我们设计的位置认知增强(PA)来引导BEV的3D潜在信息。为了在更有效的区域内部互动和较低的计算成本,我们建议采用以摄像为驱动的点或全球变异,我们提出一种全景感关注,这也会减少冗余的计算,促进趋同。

0

相关内容

可约的

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【CVPR2020】强化特征点，Reinforced Feature Points: Optimizing Feature Detection and Description for a High-Level Task

【CVPR2020】强化特征点，Reinforced Feature Points: Optimizing Feature Detection and Description for a High-Level Task

专知会员服务

49+阅读 · 2020年2月25日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

复杂数据模型中的分布逼近方法

国家自然科学基金

3+阅读 · 2014年12月31日

原位自组装金属有机骨架纳米杂化多层膜的基础研究

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

表面等离激元增强GaAs基化合物太阳电池的光电转换效率

国家自然科学基金

0+阅读 · 2013年12月31日

基于动态基元特征的场景流计算

国家自然科学基金

0+阅读 · 2012年12月31日

利用介观层状杂化材料的剪裁制备金属氧化物纳米材料

国家自然科学基金

0+阅读 · 2012年12月31日

Intermedin-53在心肌肥厚中的作用和机制

国家自然科学基金

0+阅读 · 2011年12月31日

纳米层状/颗粒Tin+1AlCn/Al2O3复合材料的原位反应合成及机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于图域几何PDE与特征不变量的离散曲面处理

国家自然科学基金

0+阅读 · 2009年12月31日

计算几何中曲线曲面插值的研究与应用

国家自然科学基金

0+阅读 · 2008年12月31日

Towards Feature Distribution Alignment and Diversity Enhancement for Data-Free Quantization

Arxiv

0+阅读 · 2022年12月19日

Camera Calibration through Camera Projection Loss

Arxiv

0+阅读 · 2022年12月19日

From a Bird's Eye View to See: Joint Camera and Subject Registration without the Camera Calibration

Arxiv

0+阅读 · 2022年12月19日

GLFF: Global and Local Feature Fusion for Face Forgery Detection

Arxiv

0+阅读 · 2022年12月18日

Rethinking Dimensionality Reduction in Grid-based 3D Object Detection

Arxiv

0+阅读 · 2022年12月17日

Variable-Based Calibration for Machine Learning Classifiers

Arxiv

0+阅读 · 2022年12月16日

RaLiBEV: Radar and LiDAR BEV Fusion Learning for Anchor Box Free Object Detection System

Arxiv

0+阅读 · 2022年12月16日

DETR4D: Direct Multi-View 3D Object Detection with Sparse Attention

Arxiv

0+阅读 · 2022年12月15日

Learning Markerless Robot-Depth Camera Calibration and End-Effector Pose Estimation

Arxiv

0+阅读 · 2022年12月15日

MSDNN: Multi-Scale Deep Neural Network for Salient Object Detection

Arxiv

21+阅读 · 2018年1月12日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【CVPR2020】强化特征点，Reinforced Feature Points: Optimizing Feature Detection and Description for a High-Level Task

【CVPR2020】强化特征点，Reinforced Feature Points: Optimizing Feature Detection and Description for a High-Level Task

专知会员服务

49+阅读 · 2020年2月25日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

相关论文

Towards Feature Distribution Alignment and Diversity Enhancement for Data-Free Quantization

Arxiv

0+阅读 · 2022年12月19日

Camera Calibration through Camera Projection Loss

Arxiv

0+阅读 · 2022年12月19日

From a Bird's Eye View to See: Joint Camera and Subject Registration without the Camera Calibration

Arxiv

0+阅读 · 2022年12月19日

GLFF: Global and Local Feature Fusion for Face Forgery Detection

Arxiv

0+阅读 · 2022年12月18日

Rethinking Dimensionality Reduction in Grid-based 3D Object Detection

Arxiv

0+阅读 · 2022年12月17日

Variable-Based Calibration for Machine Learning Classifiers

Arxiv

0+阅读 · 2022年12月16日

RaLiBEV: Radar and LiDAR BEV Fusion Learning for Anchor Box Free Object Detection System

Arxiv

0+阅读 · 2022年12月16日

DETR4D: Direct Multi-View 3D Object Detection with Sparse Attention

Arxiv

0+阅读 · 2022年12月15日

Learning Markerless Robot-Depth Camera Calibration and End-Effector Pose Estimation

Arxiv

0+阅读 · 2022年12月15日

MSDNN: Multi-Scale Deep Neural Network for Salient Object Detection

Arxiv

21+阅读 · 2018年1月12日

相关基金

复杂数据模型中的分布逼近方法

国家自然科学基金

3+阅读 · 2014年12月31日

原位自组装金属有机骨架纳米杂化多层膜的基础研究

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

表面等离激元增强GaAs基化合物太阳电池的光电转换效率

国家自然科学基金

0+阅读 · 2013年12月31日

基于动态基元特征的场景流计算

国家自然科学基金

0+阅读 · 2012年12月31日

利用介观层状杂化材料的剪裁制备金属氧化物纳米材料

国家自然科学基金

0+阅读 · 2012年12月31日

Intermedin-53在心肌肥厚中的作用和机制

国家自然科学基金

0+阅读 · 2011年12月31日

纳米层状/颗粒Tin+1AlCn/Al2O3复合材料的原位反应合成及机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于图域几何PDE与特征不变量的离散曲面处理

国家自然科学基金

0+阅读 · 2009年12月31日

计算几何中曲线曲面插值的研究与应用

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员