微缩Det:走向最终到尾端 3D 对象探测 (SparseDet: Towards End-to-End 3D Object Detection) - 专知论文

会员服务 ·

0

目标检测 · 3D · 端到端 · Performer · 特化 ·

2022 年 6 月 2 日

SparseDet: Towards End-to-End 3D Object Detection

翻译：微缩Det:走向最终到尾端 3D 对象探测

Jianhong Han,Zhaoyi Wan,Zhe Liu,Jie Feng,Bingfeng Zhou

In this paper, we propose SparseDet for end-to-end 3D object detection from point cloud. Existing works on 3D object detection rely on dense object candidates over all locations in a 3D or 2D grid following the mainstream methods for object detection in 2D images. However, this dense paradigm requires expertise in data to fulfill the gap between label and detection. As a new detection paradigm, SparseDet maintains a fixed set of learnable proposals to represent latent candidates and directly perform classification and localization for 3D objects through stacked transformers. It demonstrates that effective 3D object detection can be achieved with none of post-processing such as redundant removal and non-maximum suppression. With a properly designed network, SparseDet achieves highly competitive detection accuracy while running with a more efficient speed of 34.5 FPS. We believe this end-to-end paradigm of SparseDet will inspire new thinking on the sparsity of 3D object detection.

翻译：在本文中,我们建议从点云进行端到端的 3D 对象探测。 3D 对象探测的现有工作依赖于3D 或 2D 网格中所有地点的密度对象候选人, 遵循2D 图像中物体探测的主流方法。然而, 这种密集的模式要求数据方面的专门知识, 以填补标签与探测之间的差距。作为新的探测模式, Sprass Det 维持一套固定的可学习的建议, 以代表潜在对象, 通过堆叠变压器直接对 3D 对象进行分类和本地化。它表明, 有效的 3D 对象探测不可能在后处理中实现, 如多余的清除和非最大抑制等。在一个设计得当的网络中, SprassDet 取得高度竞争性的探测准确性, 同时以34.5 FPS 的更有效速度运行。我们相信, SprassD D 的端到端模式将激发对 3D 对象探测的宽度进行新的思考。

0

相关内容

目标检测

目标检测，也叫目标提取，是一种与计算机视觉和图像处理有关的计算机技术，用于检测数字图像和视频中特定类别的语义对象（例如人，建筑物或汽车）的实例。深入研究的对象检测领域包括面部检测和行人检测。对象检测在计算机视觉的许多领域都有应用，包括图像检索和视频监视。

知识荟萃

精品入门和进阶教程、论文和代码整理等

更多

查看相关VIP内容、论文、资讯等

【CVPR 2022】采用稀疏Transformer的单步法三维物体检测器，Embracing Single Stride 3D Object Detector with Sparse Transformer

【CVPR 2022】采用稀疏Transformer的单步法三维物体检测器，Embracing Single Stride 3D Object Detector with Sparse Transformer

专知会员服务

5+阅读 · 2022年3月12日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

铜锌锡硫在晶界面的能带结构研究

国家自然科学基金

0+阅读 · 2014年12月31日

压缩感知域高光谱图像稀疏解混方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

半导体衬底上FeSe薄膜的外延生长及界面超导

国家自然科学基金

0+阅读 · 2013年12月31日

基于Decorin基因甲基化调控的非小细胞肺癌转移的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

低温液体泵用高温超导磁悬浮轴承的性能研究

国家自然科学基金

0+阅读 · 2009年12月31日

REMg2TMx型多相合金的吸/放氢行为和衰减机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

室温下金刚石晶体内N-V center单电子自旋量子比特研究

国家自然科学基金

0+阅读 · 2009年12月31日

海藻酸钠对碳纳米管的分散机理及碳纳米管纤维的液晶纺丝成型

国家自然科学基金

0+阅读 · 2009年12月31日

nano-MOy/3DOM La1-xSrxMO3 (M = Cr, Mn, Co)的可控制备及其氧化VOCs的催化性能研究

国家自然科学基金

0+阅读 · 2009年12月31日

储电型柔性有机薄膜太阳电池的基础研究

国家自然科学基金

0+阅读 · 2008年12月31日

Det6D: A Ground-Aware Full-Pose 3D Object Detector for Improving Terrain Robustness

Arxiv

0+阅读 · 2022年7月19日

Class-agnostic Object Detection with Multi-modal Transformer

Arxiv

0+阅读 · 2022年7月19日

Dense Teacher: Dense Pseudo-Labels for Semi-supervised Object Detection

Arxiv

0+阅读 · 2022年7月19日

Joint stereo 3D object detection and implicit surface reconstruction

Arxiv

0+阅读 · 2022年7月18日

DID-M3D: Decoupling Instance Depth for Monocular 3D Object Detection

Arxiv

0+阅读 · 2022年7月18日

Towards Open World Object Detection

Arxiv

13+阅读 · 2021年3月3日

Image/Video Deep Anomaly Detection: A Survey

Arxiv

16+阅读 · 2021年3月2日

Co-mining: Self-Supervised Learning for Sparsely Annotated Object Detection

Arxiv

13+阅读 · 2020年12月3日

UP-DETR: Unsupervised Pre-training for Object Detection with Transformers

UP-DETR: Unsupervised Pre-training for Object Detection with Transformers

Arxiv

19+阅读 · 2020年11月18日

Reverse Attention for Salient Object Detection

Arxiv

11+阅读 · 2019年4月15日

VIP会员

文章信息

相关主题

相关VIP内容

【CVPR 2022】采用稀疏Transformer的单步法三维物体检测器，Embracing Single Stride 3D Object Detector with Sparse Transformer

【CVPR 2022】采用稀疏Transformer的单步法三维物体检测器，Embracing Single Stride 3D Object Detector with Sparse Transformer

专知会员服务

5+阅读 · 2022年3月12日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《复杂工程系统模型驱动设计决策支持系统：早期设计阶段挑战》最新138页

《日本陆上自卫队2040年作战方式与未来作战研究》最新23页slides

人工智能作为战争武器

《后勤保障》最新23页

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

相关论文

Det6D: A Ground-Aware Full-Pose 3D Object Detector for Improving Terrain Robustness

Arxiv

0+阅读 · 2022年7月19日

Class-agnostic Object Detection with Multi-modal Transformer

Arxiv

0+阅读 · 2022年7月19日

Dense Teacher: Dense Pseudo-Labels for Semi-supervised Object Detection

Arxiv

0+阅读 · 2022年7月19日

Joint stereo 3D object detection and implicit surface reconstruction

Arxiv

0+阅读 · 2022年7月18日

DID-M3D: Decoupling Instance Depth for Monocular 3D Object Detection

Arxiv

0+阅读 · 2022年7月18日

Towards Open World Object Detection

Arxiv

13+阅读 · 2021年3月3日

Image/Video Deep Anomaly Detection: A Survey

Arxiv

16+阅读 · 2021年3月2日

Co-mining: Self-Supervised Learning for Sparsely Annotated Object Detection

Arxiv

13+阅读 · 2020年12月3日

UP-DETR: Unsupervised Pre-training for Object Detection with Transformers

UP-DETR: Unsupervised Pre-training for Object Detection with Transformers

Arxiv

19+阅读 · 2020年11月18日

Reverse Attention for Salient Object Detection

Arxiv

11+阅读 · 2019年4月15日

相关基金

铜锌锡硫在晶界面的能带结构研究

国家自然科学基金

0+阅读 · 2014年12月31日

压缩感知域高光谱图像稀疏解混方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

半导体衬底上FeSe薄膜的外延生长及界面超导

国家自然科学基金

0+阅读 · 2013年12月31日

基于Decorin基因甲基化调控的非小细胞肺癌转移的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

低温液体泵用高温超导磁悬浮轴承的性能研究

国家自然科学基金

0+阅读 · 2009年12月31日

REMg2TMx型多相合金的吸/放氢行为和衰减机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

室温下金刚石晶体内N-V center单电子自旋量子比特研究

国家自然科学基金

0+阅读 · 2009年12月31日

海藻酸钠对碳纳米管的分散机理及碳纳米管纤维的液晶纺丝成型

国家自然科学基金

0+阅读 · 2009年12月31日

nano-MOy/3DOM La1-xSrxMO3 (M = Cr, Mn, Co)的可控制备及其氧化VOCs的催化性能研究

国家自然科学基金

0+阅读 · 2009年12月31日

储电型柔性有机薄膜太阳电池的基础研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员