深度互动:通过模式互动探测3D对象 (DeepInteraction: 3D Object Detection via Modality Interaction) - 专知论文

会员服务 ·

0

INTERACT · 目标检测 · 模态 · 3D · INFORMS ·

2022 年 8 月 24 日

DeepInteraction: 3D Object Detection via Modality Interaction

翻译：深度互动:通过模式互动探测3D对象

Zeyu Yang,Jiaqi Chen,Zhenwei Miao,Wei Li,Xiatian Zhu,Li Zhang

Existing top-performance 3D object detectors typically rely on the multi-modal fusion strategy. This design is however fundamentally restricted due to overlooking the modality-specific useful information and finally hampering the model performance. To address this limitation, in this work we introduce a novel modality interaction strategy where individual per-modality representations are learned and maintained throughout for enabling their unique characteristics to be exploited during object detection. To realize this proposed strategy, we design a DeepInteraction architecture characterized by a multi-modal representational interaction encoder and a multi-modal predictive interaction decoder. Experiments on the large-scale nuScenes dataset show that our proposed method surpasses all prior arts often by a large margin. Crucially, our method is ranked at the first position at the highly competitive nuScenes object detection leaderboard.

翻译：现有的顶级性能 3D 物体探测器通常依靠多式组合战略。但是,由于忽略了特定模式的有用信息,最终阻碍了模型性能,这一设计受到根本的限制。为了解决这一局限性,我们在此工作中引入了一种新的模式互动战略,即学习并始终保持个人按时制表现方式,以便能够在物体探测过程中利用它们的独特特征。为了实现这一拟议战略,我们设计了一个以多式代表互动编码器和多式预测性互动解密器为特征的“深度互动架构”。对大型核星数据集的实验表明,我们的拟议方法往往大大超越了以往的所有艺术。关键是,我们的方法位于竞争激烈的核星物体探测引导板的第一位置。

0

相关内容

INTERACT

IFIP TC13 Conference on Human-Computer Interaction是人机交互领域的研究者和实践者展示其工作的重要平台。多年来，这些会议吸引了来自几个国家和文化的研究人员。官网链接：http://interact2019.org/

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【CVPR 2022】采用稀疏Transformer的单步法三维物体检测器，Embracing Single Stride 3D Object Detector with Sparse Transformer

【CVPR 2022】采用稀疏Transformer的单步法三维物体检测器，Embracing Single Stride 3D Object Detector with Sparse Transformer

专知会员服务

5+阅读 · 2022年3月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

偏晶体系合金熔体-液相分离凝固的结构演变历程及机制

国家自然科学基金

0+阅读 · 2014年12月31日

早年应激与nectin-afadin系统调控海马环路发育与可塑性的分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

利用微纳电极对奥奈达希瓦氏菌胞外电子传递的研究

国家自然科学基金

0+阅读 · 2013年12月31日

带有行限制的覆盖阵列的研究

国家自然科学基金

0+阅读 · 2013年12月31日

Catestatin蛋白肽段抑制动脉粥样硬化的作用及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

杂多化合物-离子液体双催化剂的柴油深度氧化脱硫协同作用的研究

国家自然科学基金

0+阅读 · 2012年12月31日

vinexin β在血管损伤后新生内膜增生中的作用及其机制

国家自然科学基金

0+阅读 · 2012年12月31日

血管紧张素II促进骨髓间充质干细胞跨肺微血管内皮迁移的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于压缩感知融合深度的三维视频编码关键技术的研究

国家自然科学基金

0+阅读 · 2012年12月31日

IV-VI族二维纳米结构的可控合成及光电性能

国家自然科学基金

0+阅读 · 2011年12月31日

The KFIoU Loss for Rotated Object Detection

Arxiv

0+阅读 · 2022年10月6日

CIR-Net: Cross-modality Interaction and Refinement for RGB-D Salient Object Detection

Arxiv

0+阅读 · 2022年10月6日

Multi-Camera Collaborative Depth Prediction via Consistent Structure Estimation

Arxiv

0+阅读 · 2022年10月5日

Cross-Modality Fusion Transformer for Multispectral Object Detection

Arxiv

0+阅读 · 2022年10月4日

Fully Sparse 3D Object Detection

Fully Sparse 3D Object Detection

Arxiv

0+阅读 · 2022年10月3日

PointPillars Backbone Type Selection For Fast and Accurate LiDAR Object Detection

Arxiv

0+阅读 · 2022年9月30日

Cross-Modal Object Tracking: Modality-Aware Representations and A Unified Benchmark

Arxiv

14+阅读 · 2021年11月11日

3D Object Detection for Autonomous Driving: A Survey

Arxiv

12+阅读 · 2021年6月21日

Text Detection and Recognition in the Wild: A Review

Arxiv

20+阅读 · 2020年6月8日

Reverse Attention for Salient Object Detection

Arxiv

11+阅读 · 2019年4月15日

VIP会员

文章信息

相关主题

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【CVPR 2022】采用稀疏Transformer的单步法三维物体检测器，Embracing Single Stride 3D Object Detector with Sparse Transformer

【CVPR 2022】采用稀疏Transformer的单步法三维物体检测器，Embracing Single Stride 3D Object Detector with Sparse Transformer

专知会员服务

5+阅读 · 2022年3月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

面向性能、成本效益、云边隐私与可信性的大小语言模型协作综述

乌克兰太空研究（2022-2024年） | 176页

【CMU博士论文】大型语言模型的隐性特性

国防领域人工智能走向何方？

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

The KFIoU Loss for Rotated Object Detection

Arxiv

0+阅读 · 2022年10月6日

CIR-Net: Cross-modality Interaction and Refinement for RGB-D Salient Object Detection

Arxiv

0+阅读 · 2022年10月6日

Multi-Camera Collaborative Depth Prediction via Consistent Structure Estimation

Arxiv

0+阅读 · 2022年10月5日

Cross-Modality Fusion Transformer for Multispectral Object Detection

Arxiv

0+阅读 · 2022年10月4日

Fully Sparse 3D Object Detection

Fully Sparse 3D Object Detection

Arxiv

0+阅读 · 2022年10月3日

PointPillars Backbone Type Selection For Fast and Accurate LiDAR Object Detection

Arxiv

0+阅读 · 2022年9月30日

Cross-Modal Object Tracking: Modality-Aware Representations and A Unified Benchmark

Arxiv

14+阅读 · 2021年11月11日

3D Object Detection for Autonomous Driving: A Survey

Arxiv

12+阅读 · 2021年6月21日

Text Detection and Recognition in the Wild: A Review

Arxiv

20+阅读 · 2020年6月8日

Reverse Attention for Salient Object Detection

Arxiv

11+阅读 · 2019年4月15日

相关基金

偏晶体系合金熔体-液相分离凝固的结构演变历程及机制

国家自然科学基金

0+阅读 · 2014年12月31日

早年应激与nectin-afadin系统调控海马环路发育与可塑性的分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

利用微纳电极对奥奈达希瓦氏菌胞外电子传递的研究

国家自然科学基金

0+阅读 · 2013年12月31日

带有行限制的覆盖阵列的研究

国家自然科学基金

0+阅读 · 2013年12月31日

Catestatin蛋白肽段抑制动脉粥样硬化的作用及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

杂多化合物-离子液体双催化剂的柴油深度氧化脱硫协同作用的研究

国家自然科学基金

0+阅读 · 2012年12月31日

vinexin β在血管损伤后新生内膜增生中的作用及其机制

国家自然科学基金

0+阅读 · 2012年12月31日

血管紧张素II促进骨髓间充质干细胞跨肺微血管内皮迁移的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于压缩感知融合深度的三维视频编码关键技术的研究

国家自然科学基金

0+阅读 · 2012年12月31日

IV-VI族二维纳米结构的可控合成及光电性能

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员