油漆和蒸馏:利用语义通过网络促进3D对象探测 (Paint and Distill: Boosting 3D Object Detection with Semantic Passing Network) - 专知论文

会员服务 ·

0

LIDAR · Boosting（一种模型训练加速方式） · 知识 (knowledge) · Performer · Networking ·

2022 年 7 月 12 日

Paint and Distill: Boosting 3D Object Detection with Semantic Passing Network

翻译：油漆和蒸馏:利用语义通过网络促进3D对象探测

Bo Ju,Zhikang Zou,Xiaoqing Ye,Minyue Jiang,Xiao Tan,Errui Ding,Jingdong Wang

from arxiv, Accepted by ACMMM2022

3D object detection task from lidar or camera sensors is essential for autonomous driving. Pioneer attempts at multi-modality fusion complement the sparse lidar point clouds with rich semantic texture information from images at the cost of extra network designs and overhead. In this work, we propose a novel semantic passing framework, named SPNet, to boost the performance of existing lidar-based 3D detection models with the guidance of rich context painting, with no extra computation cost during inference. Our key design is to first exploit the potential instructive semantic knowledge within the ground-truth labels by training a semantic-painted teacher model and then guide the pure-lidar network to learn the semantic-painted representation via knowledge passing modules at different granularities: class-wise passing, pixel-wise passing and instance-wise passing. Experimental results show that the proposed SPNet can seamlessly cooperate with most existing 3D detection frameworks with 1~5% AP gain and even achieve new state-of-the-art 3D detection performance on the KITTI test benchmark. Code is available at: https://github.com/jb892/SPNet.

翻译：3D 目标检测任务来自 Lidar 或相机传感器,对于自主驱动至关重要。多式聚合的先锋尝试以以额外的网络设计和管理成本为代价,对稀疏的 Lidar点云云进行丰富的语义纹理信息补充。在这项工作中,我们提议了一个名为 SPNet 的新型语义通过框架,以丰富背景绘画为指导,提升现有基于Lidar 的3D 检测模型的性能,在推断过程中不产生额外的计算费用。我们的关键设计是首先通过培训一个语义涂鸦教师模型,利用地真伪标签中潜在的启发性语义学知识,然后指导纯lidar网络通过不同微粒体的知识传递模块学习语义-语义表达。实验结果表明,拟议的SPNet能够与大多数现有的3D 检测框架进行无缝合作,获得1-5% AP 的收益,甚至实现KITTI 测试基准的新的状态- 3D 检测性能。代码可查到: https:// SP/ lix2/ 。

0

相关内容

LIDAR

【CVPR 2022】采用稀疏Transformer的单步法三维物体检测器，Embracing Single Stride 3D Object Detector with Sparse Transformer

【CVPR 2022】采用稀疏Transformer的单步法三维物体检测器，Embracing Single Stride 3D Object Detector with Sparse Transformer

专知会员服务

5+阅读 · 2022年3月12日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

基于深度学习的图像语义分割技术研究进展，Research on Progress of Image Semantic Segmentation Based on Deep Learning

基于深度学习的图像语义分割技术研究进展，Research on Progress of Image Semantic Segmentation Based on Deep Learning

专知会员服务

64+阅读 · 2020年2月16日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

TorchSeg：基于pytorch的语义分割算法开源了

TorchSeg：基于pytorch的语义分割算法开源了

极市平台

20+阅读 · 2019年1月28日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】NiftyNet：面向医学图像分析和图像引导治疗的开源CNN平台（附代码）

【推荐】NiftyNet：面向医学图像分析和图像引导治疗的开源CNN平台（附代码）

机器学习研究会

12+阅读 · 2018年1月27日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

miR-21/PDCD4/NF-κB通路在血小板抗菌促糖尿病溃疡愈合中的作用及分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

基于体素特征的森林冠层叶绿素反演研究

国家自然科学基金

0+阅读 · 2015年12月31日

EGCG通过Notch调节炎症的分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

酮选择性氧化多级孔金属氧化物复合及负载催化材料合成及其传质-氧化性能调控

国家自然科学基金

0+阅读 · 2014年12月31日

分子基多功能铁电单晶的设计、合成与调控

国家自然科学基金

0+阅读 · 2013年12月31日

miR-124通过EGR1调控糖尿病肾病进展及肾脏纤维化的分子机制

国家自然科学基金

0+阅读 · 2013年12月31日

基于硫属化铕铁性材料的制备与磁电性质的研究

国家自然科学基金

0+阅读 · 2012年12月31日

蒽醌/石墨烯纳米复合材料电极的电催化氧还原性能及其在异相electro-Fenton-like体系中的应用研究

国家自然科学基金

0+阅读 · 2011年12月31日

手性多孔有机无机杂化配位聚合物材料的离子热合成与性能研究

国家自然科学基金

0+阅读 · 2009年12月31日

nano-MOy/3DOM La1-xSrxMO3 (M = Cr, Mn, Co)的可控制备及其氧化VOCs的催化性能研究

国家自然科学基金

0+阅读 · 2009年12月31日

Semantic Novelty Detection via Relational Reasoning

Semantic Novelty Detection via Relational Reasoning

Arxiv

0+阅读 · 2022年9月2日

MPPNet: Multi-Frame Feature Intertwining with Proxy Points for 3D Temporal Object Detection

MPPNet: Multi-Frame Feature Intertwining with Proxy Points for 3D Temporal Object Detection

Arxiv

0+阅读 · 2022年9月2日

ProposalContrast: Unsupervised Pre-training for LiDAR-based 3D Object Detection

Arxiv

0+阅读 · 2022年9月2日

Contrastive Semantic-Guided Image Smoothing Network

Contrastive Semantic-Guided Image Smoothing Network

Arxiv

0+阅读 · 2022年9月2日

Deep Learning for UAV-based Object Detection and Tracking: A Survey

Arxiv

62+阅读 · 2021年10月25日

FocalMix: Semi-Supervised Learning for 3D Medical Image Detection

FocalMix: Semi-Supervised Learning for 3D Medical Image Detection

Arxiv

10+阅读 · 2020年3月20日

Commonsense Knowledge Base Completion with Structural and Semantic Context

Commonsense Knowledge Base Completion with Structural and Semantic Context

Arxiv

20+阅读 · 2019年12月19日

Object Detection in 20 Years: A Survey

Object Detection in 20 Years: A Survey

Arxiv

48+阅读 · 2019年5月13日

MV-YOLO: Motion Vector-aided Tracking by Semantic Object Detection

Arxiv

10+阅读 · 2018年4月30日

Recurrent Residual Convolutional Neural Network based on U-Net (R2U-Net) for Medical Image Segmentation

Arxiv

13+阅读 · 2018年2月20日

VIP会员

文章信息

相关主题

Boosting（一种模型训练加速方式）

知识 (knowledge)

相关VIP内容

【CVPR 2022】采用稀疏Transformer的单步法三维物体检测器，Embracing Single Stride 3D Object Detector with Sparse Transformer

【CVPR 2022】采用稀疏Transformer的单步法三维物体检测器，Embracing Single Stride 3D Object Detector with Sparse Transformer

专知会员服务

5+阅读 · 2022年3月12日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

基于深度学习的图像语义分割技术研究进展，Research on Progress of Image Semantic Segmentation Based on Deep Learning

基于深度学习的图像语义分割技术研究进展，Research on Progress of Image Semantic Segmentation Based on Deep Learning

专知会员服务

64+阅读 · 2020年2月16日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【牛津博士论文】零样本强化学习综述

《美军条令：陆军指挥官与规划人员地理空间指南》60页

战术边缘指挥控制：防务面临的核心挑战

迈向开放世界检测：综述

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

TorchSeg：基于pytorch的语义分割算法开源了

TorchSeg：基于pytorch的语义分割算法开源了

极市平台

20+阅读 · 2019年1月28日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】NiftyNet：面向医学图像分析和图像引导治疗的开源CNN平台（附代码）

【推荐】NiftyNet：面向医学图像分析和图像引导治疗的开源CNN平台（附代码）

机器学习研究会

12+阅读 · 2018年1月27日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

相关论文

Semantic Novelty Detection via Relational Reasoning

Semantic Novelty Detection via Relational Reasoning

Arxiv

0+阅读 · 2022年9月2日

MPPNet: Multi-Frame Feature Intertwining with Proxy Points for 3D Temporal Object Detection

MPPNet: Multi-Frame Feature Intertwining with Proxy Points for 3D Temporal Object Detection

Arxiv

0+阅读 · 2022年9月2日

ProposalContrast: Unsupervised Pre-training for LiDAR-based 3D Object Detection

Arxiv

0+阅读 · 2022年9月2日

Contrastive Semantic-Guided Image Smoothing Network

Contrastive Semantic-Guided Image Smoothing Network

Arxiv

0+阅读 · 2022年9月2日

Deep Learning for UAV-based Object Detection and Tracking: A Survey

Arxiv

62+阅读 · 2021年10月25日

FocalMix: Semi-Supervised Learning for 3D Medical Image Detection

FocalMix: Semi-Supervised Learning for 3D Medical Image Detection

Arxiv

10+阅读 · 2020年3月20日

Commonsense Knowledge Base Completion with Structural and Semantic Context

Commonsense Knowledge Base Completion with Structural and Semantic Context

Arxiv

20+阅读 · 2019年12月19日

Object Detection in 20 Years: A Survey

Object Detection in 20 Years: A Survey

Arxiv

48+阅读 · 2019年5月13日

MV-YOLO: Motion Vector-aided Tracking by Semantic Object Detection

Arxiv

10+阅读 · 2018年4月30日

Recurrent Residual Convolutional Neural Network based on U-Net (R2U-Net) for Medical Image Segmentation

Arxiv

13+阅读 · 2018年2月20日

相关基金

miR-21/PDCD4/NF-κB通路在血小板抗菌促糖尿病溃疡愈合中的作用及分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

基于体素特征的森林冠层叶绿素反演研究

国家自然科学基金

0+阅读 · 2015年12月31日

EGCG通过Notch调节炎症的分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

酮选择性氧化多级孔金属氧化物复合及负载催化材料合成及其传质-氧化性能调控

国家自然科学基金

0+阅读 · 2014年12月31日

分子基多功能铁电单晶的设计、合成与调控

国家自然科学基金

0+阅读 · 2013年12月31日

miR-124通过EGR1调控糖尿病肾病进展及肾脏纤维化的分子机制

国家自然科学基金

0+阅读 · 2013年12月31日

基于硫属化铕铁性材料的制备与磁电性质的研究

国家自然科学基金

0+阅读 · 2012年12月31日

蒽醌/石墨烯纳米复合材料电极的电催化氧还原性能及其在异相electro-Fenton-like体系中的应用研究

国家自然科学基金

0+阅读 · 2011年12月31日

手性多孔有机无机杂化配位聚合物材料的离子热合成与性能研究

国家自然科学基金

0+阅读 · 2009年12月31日

nano-MOy/3DOM La1-xSrxMO3 (M = Cr, Mn, Co)的可控制备及其氧化VOCs的催化性能研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员