BEVFusion: 多任务多传感器与单一鸟类眼视图代表 (BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation) - 专知论文

会员服务 ·

0

可约的 · 3D · 计算成本 · 表示 · 优化器 ·

2022 年 5 月 26 日

BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation

翻译：BEVFusion: 多任务多传感器与单一鸟类眼视图代表

Zhijian Liu,Haotian Tang,Alexander Amini,Xinyu Yang,Huizi Mao,Daniela Rus,Song Han

from arxiv, The first two authors contributed equally to this work. Project page: https://bevfusion.mit.edu

Multi-sensor fusion is essential for an accurate and reliable autonomous driving system. Recent approaches are based on point-level fusion: augmenting the LiDAR point cloud with camera features. However, the camera-to-LiDAR projection throws away the semantic density of camera features, hindering the effectiveness of such methods, especially for semantic-oriented tasks (such as 3D scene segmentation). In this paper, we break this deeply-rooted convention with BEVFusion, an efficient and generic multi-task multi-sensor fusion framework. It unifies multi-modal features in the shared bird's-eye view (BEV) representation space, which nicely preserves both geometric and semantic information. To achieve this, we diagnose and lift key efficiency bottlenecks in the view transformation with optimized BEV pooling, reducing latency by more than 40x. BEVFusion is fundamentally task-agnostic and seamlessly supports different 3D perception tasks with almost no architectural changes. It establishes the new state of the art on nuScenes, achieving 1.3% higher mAP and NDS on 3D object detection and 13.6% higher mIoU on BEV map segmentation, with 1.9x lower computation cost.

翻译：多传感器聚合对于准确和可靠的自主驱动系统至关重要。最近的方法基于点级融合: 增加激光雷达点云, 并配有相机功能。然而, 相机到激光雷达投影会丢弃相机特征的语义密度, 妨碍这些方法的有效性, 特别是对于以语义为导向的任务( 如 3D 场点分割) 。在本文中, 我们打破了这个与BEVFusion( 一个高效和通用的多任务多传感器聚合框架) 的根深蒂固的公约。它统一了共享鸟眼视图( BEV) 代表空间的多模式性能, 这很好地保存了几何和语义信息。为了实现这一点, 我们诊断并提升了这些方法的有效性, 特别是对于以语义为导向的任务( 如 3D 场隔离 3D ), 将拉近40x 。 BEVFusion 从根本上是任务-, 并且无缝地支持不同的 3D 感知任务, 几乎没有建筑变化。它确立了关于核巡视的艺术的新状态, 达到1.3%的高度, 的MAP 3D 和ND 4D 4, 4D 的测算为13 4D 4, 4, 4D 4D 4D 4D 低的 4, 4x 4x 4x 4, 4, 4x 4x 4x 4x 4x 4x 4x 4, 4x 4x 4x 4x 4x 4x 4x 4x 4x 4x 4。

0

相关内容

可约的

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【Facebook-Ishan Mishra】计算机视觉自监督学习，92页ppt

专知会员服务

36+阅读 · 2021年7月7日

对比学习简述

专知会员服务

90+阅读 · 2021年6月29日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

【泡泡汇总】CVPR2019 SLAM Paperlist

【泡泡汇总】CVPR2019 SLAM Paperlist

泡泡机器人SLAM

14+阅读 · 2019年6月12日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【泡泡前沿追踪】跟踪SLAM前沿动态系列之IROS2018

【泡泡前沿追踪】跟踪SLAM前沿动态系列之IROS2018

泡泡机器人SLAM

29+阅读 · 2018年10月28日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新十篇度量学习相关论文—可量化表示、非线性度量学习、在线深度量学习、大间隔最近邻、判别深度度量、域自适应

【论文推荐】最新十篇度量学习相关论文—可量化表示、非线性度量学习、在线深度量学习、大间隔最近邻、判别深度度量、域自适应

专知

12+阅读 · 2018年5月18日

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

专知

25+阅读 · 2018年2月6日

全球海洋热含量估计中的Mapping方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于高效光电化学池器件的过渡金属氧化物光电极材料的设计与可控制备

国家自然科学基金

0+阅读 · 2014年12月31日

从海马-杏仁核神经元PI3K/Akt/mTOR信号通路与细胞骨架关系探讨抑郁症发病机制

国家自然科学基金

0+阅读 · 2012年12月31日

半夏泻心汤调节2型糖尿病人GLP-1和β细胞功能的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

微米尺寸铌酸锂晶体回音壁模式微腔的制备和光学非线性增强研究

国家自然科学基金

0+阅读 · 2012年12月31日

一维CuInS2-ZnS异质结构纳米材料的合成和光电性质

国家自然科学基金

0+阅读 · 2012年12月31日

波长交错高采样率高精度光电模数转换器的研究

国家自然科学基金

0+阅读 · 2012年12月31日

若干宽带隙A2IIIB3VI基半导体的能带结构、微结构与热电特性

国家自然科学基金

0+阅读 · 2011年12月31日

高温熔盐拓扑反应机理与钙钛矿结构材料的形貌可控制备

国家自然科学基金

0+阅读 · 2009年12月31日

气－固反应制备IyCo4Sb12/SnO2 纳米复合材料及其热电性能研究

国家自然科学基金

0+阅读 · 2008年12月31日

XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model

Arxiv

0+阅读 · 2022年7月14日

BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers

Arxiv

0+阅读 · 2022年7月13日

M-FUSE: Multi-frame Fusion for Scene Flow Estimation

Arxiv

0+阅读 · 2022年7月12日

Learning Ego 3D Representation as Ray Tracing

Arxiv

0+阅读 · 2022年7月12日

VEM$^2$L: A Plug-and-play Framework for Fusing Text and Structure Knowledge on Sparse Knowledge Graph Completion

Arxiv

0+阅读 · 2022年7月12日

Multi-Task Learning for Visual Scene Understanding

Arxiv

29+阅读 · 2022年3月28日

Cross-Modal Object Tracking: Modality-Aware Representations and A Unified Benchmark

Arxiv

14+阅读 · 2021年11月11日

Cross-Modal Discrete Representation Learning

Arxiv

18+阅读 · 2021年6月10日

A Survey on Multi-Task Learning

Arxiv

31+阅读 · 2021年3月29日

MVFNet: Multi-View Fusion Network for Efficient Video Recognition

Arxiv

13+阅读 · 2021年1月5日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【Facebook-Ishan Mishra】计算机视觉自监督学习，92页ppt

专知会员服务

36+阅读 · 2021年7月7日

对比学习简述

专知会员服务

90+阅读 · 2021年6月29日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

操作系统智能体：基于多模态大模型（MLLM）的通用计算设备智能体综述

《美国太空军系统全生命周期建模、仿真与分析效能提升方案》最新84页报告

【博士论文】推进数据高效的深度学习：非参数 Transformer、主动测试与上下文学习

自主人工智能：未来战争是否将是自主化的？

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

【泡泡汇总】CVPR2019 SLAM Paperlist

【泡泡汇总】CVPR2019 SLAM Paperlist

泡泡机器人SLAM

14+阅读 · 2019年6月12日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【泡泡前沿追踪】跟踪SLAM前沿动态系列之IROS2018

【泡泡前沿追踪】跟踪SLAM前沿动态系列之IROS2018

泡泡机器人SLAM

29+阅读 · 2018年10月28日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新十篇度量学习相关论文—可量化表示、非线性度量学习、在线深度量学习、大间隔最近邻、判别深度度量、域自适应

【论文推荐】最新十篇度量学习相关论文—可量化表示、非线性度量学习、在线深度量学习、大间隔最近邻、判别深度度量、域自适应

专知

12+阅读 · 2018年5月18日

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

专知

25+阅读 · 2018年2月6日

相关论文

XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model

Arxiv

0+阅读 · 2022年7月14日

BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers

Arxiv

0+阅读 · 2022年7月13日

M-FUSE: Multi-frame Fusion for Scene Flow Estimation

Arxiv

0+阅读 · 2022年7月12日

Learning Ego 3D Representation as Ray Tracing

Arxiv

0+阅读 · 2022年7月12日

VEM$^2$L: A Plug-and-play Framework for Fusing Text and Structure Knowledge on Sparse Knowledge Graph Completion

Arxiv

0+阅读 · 2022年7月12日

Multi-Task Learning for Visual Scene Understanding

Arxiv

29+阅读 · 2022年3月28日

Cross-Modal Object Tracking: Modality-Aware Representations and A Unified Benchmark

Arxiv

14+阅读 · 2021年11月11日

Cross-Modal Discrete Representation Learning

Arxiv

18+阅读 · 2021年6月10日

A Survey on Multi-Task Learning

Arxiv

31+阅读 · 2021年3月29日

MVFNet: Multi-View Fusion Network for Efficient Video Recognition

Arxiv

13+阅读 · 2021年1月5日

相关基金

全球海洋热含量估计中的Mapping方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于高效光电化学池器件的过渡金属氧化物光电极材料的设计与可控制备

国家自然科学基金

0+阅读 · 2014年12月31日

从海马-杏仁核神经元PI3K/Akt/mTOR信号通路与细胞骨架关系探讨抑郁症发病机制

国家自然科学基金

0+阅读 · 2012年12月31日

半夏泻心汤调节2型糖尿病人GLP-1和β细胞功能的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

微米尺寸铌酸锂晶体回音壁模式微腔的制备和光学非线性增强研究

国家自然科学基金

0+阅读 · 2012年12月31日

一维CuInS2-ZnS异质结构纳米材料的合成和光电性质

国家自然科学基金

0+阅读 · 2012年12月31日

波长交错高采样率高精度光电模数转换器的研究

国家自然科学基金

0+阅读 · 2012年12月31日

若干宽带隙A2IIIB3VI基半导体的能带结构、微结构与热电特性

国家自然科学基金

0+阅读 · 2011年12月31日

高温熔盐拓扑反应机理与钙钛矿结构材料的形貌可控制备

国家自然科学基金

0+阅读 · 2009年12月31日

气－固反应制备IyCo4Sb12/SnO2 纳米复合材料及其热电性能研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员