Rustummer: 多视图 3D 探测的适应性事件感知恢复 (FrustumFormer: Adaptive Instance-aware Resampling for Multi-view 3D Detection) - 专知论文

会员服务 ·

0

示例 · 3D · INFORMS · 变换 · 可约的 ·

2023 年 1 月 10 日

FrustumFormer: Adaptive Instance-aware Resampling for Multi-view 3D Detection

翻译：Rustummer: 多视图 3D 探测的适应性事件感知恢复

Yuqi Wang,Yuntao Chen,Zhaoxiang Zhang

from arxiv, technical report

The transformation of features from 2D perspective space to 3D space is essential to multi-view 3D object detection. Recent approaches mainly focus on the design of view transformation, either pixel-wisely lifting perspective view features into 3D space with estimated depth or grid-wisely constructing BEV features via 3D projection, treating all pixels or grids equally. However, choosing what to transform is also important but has rarely been discussed before. The pixels of a moving car are more informative than the pixels of the sky. To fully utilize the information contained in images, the view transformation should be able to adapt to different image regions according to their contents. In this paper, we propose a novel framework named FrustumFormer, which pays more attention to the features in instance regions via adaptive instance-aware resampling. Specifically, the model obtains instance frustums on the bird's eye view by leveraging image view object proposals. An adaptive occupancy mask within the instance frustum is learned to refine the instance location. Moreover, the temporal frustum intersection could further reduce the localization uncertainty of objects. Comprehensive experiments on the nuScenes dataset demonstrate the effectiveness of FrustumFormer, and we achieve a new state-of-the-art performance on the benchmark. Codes will be released soon.

翻译：从 2D 角度空间到 3D 空间的地貌转换对于多视图 3D 对象检测至关重要。最近的方法主要侧重于视图转换的设计, 要么是像素明智地提升视角视图功能, 要么是三维空间, 估计深度或以网格明智的方式通过 3D 投影构建 BEV 特征, 平等对待所有像素或网格。但是, 选择要变换的像素也很重要, 但以前很少讨论过。移动汽车的像素比天空的像素更加丰富。要充分利用图像中所含的信息, 视图转换应该能够根据图像的内容适应不同的图像区域。在本文件中, 我们提出了一个名为 Frustem Former 的新框架, 该框架通过适应性实例- 图像重新标注的方式, 更多地关注实例区域的地貌特征。具体地说, 模型在鸟的视觉视图上获取实例的结晶体图, 样中学习一个适应性的占用面罩, 来改进实例位置的位置。此外, 时间的骨质交交叉可以进一步减少物体的本地化不确定性。在新标准上的全面实验, 将很快地显示我们获得的状态。

0

相关内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

一类离散Hindmarsh-Rose模型的分支延拓

国家自然科学基金

0+阅读 · 2015年12月31日

Jacobi行列式和Hilbert变换中的若干问题及应用

国家自然科学基金

0+阅读 · 2014年12月31日

利用钙离子魔幻波长高精度测量4S-4P跃迁振子强度

国家自然科学基金

0+阅读 · 2014年12月31日

过钻头正交偶极子声波测井换能器研究

国家自然科学基金

0+阅读 · 2014年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

空天基MIMO-SAR地面运动目标探测方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于多速率滤波器组的OFDM雷达波形设计

国家自然科学基金

1+阅读 · 2011年12月31日

与实数非整数基表示相关的若干分形问题

国家自然科学基金

0+阅读 · 2011年12月31日

基于振动和声频信号HHT特征提取的高速列车轨道伤损探测方法研究

国家自然科学基金

0+阅读 · 2010年12月31日

漂浮式风电机组三维流动及动态失速特性研究

国家自然科学基金

0+阅读 · 2009年12月31日

Surround-View Vision-based 3D Detection for Autonomous Driving: A Survey

Arxiv

0+阅读 · 2023年3月7日

3M3D: Multi-view, Multi-path, Multi-representation for 3D Object Detection

Arxiv

0+阅读 · 2023年3月7日

Refined Pseudo labeling for Source-free Domain Adaptive Object Detection

Arxiv

0+阅读 · 2023年3月7日

FIT: Frequency-based Image Translation for Domain Adaptive Object Detection

Arxiv

0+阅读 · 2023年3月7日

Graph-based View Motion Planning for Fruit Detection

Arxiv

0+阅读 · 2023年3月6日

BSH-Det3D: Improving 3D Object Detection with BEV Shape Heatmap

Arxiv

0+阅读 · 2023年3月3日

Confidence-driven Bounding Box Localization for Small Object Detection

Arxiv

0+阅读 · 2023年3月3日

MSMDFusion: Fusing LiDAR and Camera at Multiple Scales with Multi-Depth Seeds for 3D Object Detection

Arxiv

0+阅读 · 2023年3月3日

Towards Domain Generalization for Multi-view 3D Object Detection in Bird-Eye-View

Arxiv

0+阅读 · 2023年3月3日

Robust Collaborative 3D Object Detection in Presence of Pose Errors

Arxiv

0+阅读 · 2023年3月3日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《人与智能体在系统工程建模语言V2任务中的性能表现：基于用户中心化的评估方法》308页

《数据安全国家标准体系（2025版）》征求意见稿

AlphaMosaic：人工智能赋能的作战管理系统

《军事行动中通信平台的战略价值：提升战术效能与作战优势》

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

Surround-View Vision-based 3D Detection for Autonomous Driving: A Survey

Arxiv

0+阅读 · 2023年3月7日

3M3D: Multi-view, Multi-path, Multi-representation for 3D Object Detection

Arxiv

0+阅读 · 2023年3月7日

Refined Pseudo labeling for Source-free Domain Adaptive Object Detection

Arxiv

0+阅读 · 2023年3月7日

FIT: Frequency-based Image Translation for Domain Adaptive Object Detection

Arxiv

0+阅读 · 2023年3月7日

Graph-based View Motion Planning for Fruit Detection

Arxiv

0+阅读 · 2023年3月6日

BSH-Det3D: Improving 3D Object Detection with BEV Shape Heatmap

Arxiv

0+阅读 · 2023年3月3日

Confidence-driven Bounding Box Localization for Small Object Detection

Arxiv

0+阅读 · 2023年3月3日

MSMDFusion: Fusing LiDAR and Camera at Multiple Scales with Multi-Depth Seeds for 3D Object Detection

Arxiv

0+阅读 · 2023年3月3日

Towards Domain Generalization for Multi-view 3D Object Detection in Bird-Eye-View

Arxiv

0+阅读 · 2023年3月3日

Robust Collaborative 3D Object Detection in Presence of Pose Errors

Arxiv

0+阅读 · 2023年3月3日

相关基金

一类离散Hindmarsh-Rose模型的分支延拓

国家自然科学基金

0+阅读 · 2015年12月31日

Jacobi行列式和Hilbert变换中的若干问题及应用

国家自然科学基金

0+阅读 · 2014年12月31日

利用钙离子魔幻波长高精度测量4S-4P跃迁振子强度

国家自然科学基金

0+阅读 · 2014年12月31日

过钻头正交偶极子声波测井换能器研究

国家自然科学基金

0+阅读 · 2014年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

空天基MIMO-SAR地面运动目标探测方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于多速率滤波器组的OFDM雷达波形设计

国家自然科学基金

1+阅读 · 2011年12月31日

与实数非整数基表示相关的若干分形问题

国家自然科学基金

0+阅读 · 2011年12月31日

基于振动和声频信号HHT特征提取的高速列车轨道伤损探测方法研究

国家自然科学基金

0+阅读 · 2010年12月31日

漂浮式风电机组三维流动及动态失速特性研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员