MGTANet:3D天体探测使用长期短期动议-指导时间注意编码序列激光雷达点 (MGTANet: Encoding Sequential LiDAR Points Using Long Short-Term Motion-Guided Temporal Attention for 3D Object Detection) - 专知论文

会员服务 ·

0

LIDAR · Performer · 点云 · 3D · 目标检测 ·

2022 年 12 月 1 日

MGTANet: Encoding Sequential LiDAR Points Using Long Short-Term Motion-Guided Temporal Attention for 3D Object Detection

翻译：MGTANet:3D天体探测使用长期短期动议-指导时间注意编码序列激光雷达点

Junho Koh,Junhyung Lee,Youngwoo Lee,Jaekyum Kim,Jun Won Choi

from arxiv, Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI'23)

Most scanning LiDAR sensors generate a sequence of point clouds in real-time. While conventional 3D object detectors use a set of unordered LiDAR points acquired over a fixed time interval, recent studies have revealed that substantial performance improvement can be achieved by exploiting the spatio-temporal context present in a sequence of LiDAR point sets. In this paper, we propose a novel 3D object detection architecture, which can encode LiDAR point cloud sequences acquired by multiple successive scans. The encoding process of the point cloud sequence is performed on two different time scales. We first design a short-term motion-aware voxel encoding that captures the short-term temporal changes of point clouds driven by the motion of objects in each voxel. We also propose long-term motion-guided bird's eye view (BEV) feature enhancement that adaptively aligns and aggregates the BEV feature maps obtained by the short-term voxel encoding by utilizing the dynamic motion context inferred from the sequence of the feature maps. The experiments conducted on the public nuScenes benchmark demonstrate that the proposed 3D object detector offers significant improvements in performance compared to the baseline methods and that it sets a state-of-the-art performance for certain 3D object detection categories. Code is available at https://github.com/HYjhkoh/MGTANet.git

翻译：虽然常规的 3D 对象探测器使用一套固定时间间隔内获得的未定序的 LiDAR 点,但最近的研究显示,通过利用LIDAR 点数组序列中存在的轨迹-时空环境,可以实现显著的性能改进。在本文中,我们提议了一个新型的 3D 对象探测结构,该结构可以将连续多次扫描获得的LIDAR 点云序列编码成一个序列。点云序列的编码过程在两个不同的时间尺度上进行。我们首先设计一套短期运动- 觉知/ voxel 编码,以捕捉由每个 voxel 物体运动驱动的点云短期时间变化。我们还提出长期运动- 导航鸟眼观(BEV) 特性增强,通过利用从地貌地图序列中推断的动态运动环境来对通过短期 voxel 编码获得的 BEV 特征地图进行适应性调整和汇总。在公共 nuScenes 基准上进行的实验显示,拟议的 3D 对象探测器/D 标准显示,在基准线/ 目标探测中提供显著的性改进。

0

相关内容

LIDAR

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Sigma 1受体对血管性痴呆小鼠血脑屏障的调节作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

麦冬皂苷通过下调lnc-MALAT1抑制NSCLC血管生成的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

抑癌基因HOXD10及其启动子甲基化调控前列腺癌的作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

Ferroportin1（FPN1)基因对破骨细胞分化和功能的调控及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

PICK1在脑内氧化应激损伤中的作用及其机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

SPARC对脾脏边缘带B细胞功能的调节作用及机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

血管紧张素-(1-7)在动脉粥样硬化中对基质金属蛋白酶-8的调控和机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

CD4+T细胞亚群失衡在高眼压视神经损伤中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

NOD样受体介导的免疫调控在糖尿病肾病中的作用机制及干预策略

国家自然科学基金

0+阅读 · 2011年12月31日

IMPORTANT-Net: Integrated MRI Multi-Parameter Reinforcement Fusion Generator with Attention Network for Synthesizing Absent Data

Arxiv

0+阅读 · 2023年2月3日

Leveraging task dependency and contrastive learning for Legal Judgement Prediction on the European Court of Human Rights

Leveraging task dependency and contrastive learning for Legal Judgement Prediction on the European Court of Human Rights

Arxiv

0+阅读 · 2023年2月3日

CVTNet: A Cross-View Transformer Network for Place Recognition Using LiDAR Data

Arxiv

0+阅读 · 2023年2月3日

High-resolution Iterative Feedback Network for Camouflaged Object Detection

Arxiv

0+阅读 · 2023年2月3日

Robust Camera Pose Refinement for Multi-Resolution Hash Encoding

Arxiv

0+阅读 · 2023年2月3日

Aerial Image Object Detection With Vision Transformer Detector (ViTDet)

Arxiv

0+阅读 · 2023年2月2日

Ditto in the House: Building Articulation Models of Indoor Scenes through Interactive Perception

Arxiv

0+阅读 · 2023年2月2日

AOP-Net: All-in-One Perception Network for Joint LiDAR-based 3D Object Detection and Panoptic Segmentation

Arxiv

0+阅读 · 2023年2月2日

FocalMix: Semi-Supervised Learning for 3D Medical Image Detection

FocalMix: Semi-Supervised Learning for 3D Medical Image Detection

Arxiv

10+阅读 · 2020年3月20日

Prime Sample Attention in Object Detection

Arxiv

13+阅读 · 2019年4月9日

VIP会员

文章信息

相关主题

相关VIP内容

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《利用大语言模型（LLM）优化海军陆战队经验教训学习》2025年最新103页

《加拿大陆军顶层作战概念》2025最新33页

超越第一人称视角（FPV）无人机：汲取俄乌战争的全部教训

《瓦洛伦斯（ValoRens）项目 - 预测分析：解读敌方意图》

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

IMPORTANT-Net: Integrated MRI Multi-Parameter Reinforcement Fusion Generator with Attention Network for Synthesizing Absent Data

Arxiv

0+阅读 · 2023年2月3日

Leveraging task dependency and contrastive learning for Legal Judgement Prediction on the European Court of Human Rights

Leveraging task dependency and contrastive learning for Legal Judgement Prediction on the European Court of Human Rights

Arxiv

0+阅读 · 2023年2月3日

CVTNet: A Cross-View Transformer Network for Place Recognition Using LiDAR Data

Arxiv

0+阅读 · 2023年2月3日

High-resolution Iterative Feedback Network for Camouflaged Object Detection

Arxiv

0+阅读 · 2023年2月3日

Robust Camera Pose Refinement for Multi-Resolution Hash Encoding

Arxiv

0+阅读 · 2023年2月3日

Aerial Image Object Detection With Vision Transformer Detector (ViTDet)

Arxiv

0+阅读 · 2023年2月2日

Ditto in the House: Building Articulation Models of Indoor Scenes through Interactive Perception

Arxiv

0+阅读 · 2023年2月2日

AOP-Net: All-in-One Perception Network for Joint LiDAR-based 3D Object Detection and Panoptic Segmentation

Arxiv

0+阅读 · 2023年2月2日

FocalMix: Semi-Supervised Learning for 3D Medical Image Detection

FocalMix: Semi-Supervised Learning for 3D Medical Image Detection

Arxiv

10+阅读 · 2020年3月20日

Prime Sample Attention in Object Detection

Arxiv

13+阅读 · 2019年4月9日

相关基金

Sigma 1受体对血管性痴呆小鼠血脑屏障的调节作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

麦冬皂苷通过下调lnc-MALAT1抑制NSCLC血管生成的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

抑癌基因HOXD10及其启动子甲基化调控前列腺癌的作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

Ferroportin1（FPN1)基因对破骨细胞分化和功能的调控及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

PICK1在脑内氧化应激损伤中的作用及其机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

SPARC对脾脏边缘带B细胞功能的调节作用及机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

血管紧张素-(1-7)在动脉粥样硬化中对基质金属蛋白酶-8的调控和机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

CD4+T细胞亚群失衡在高眼压视神经损伤中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

NOD样受体介导的免疫调控在糖尿病肾病中的作用机制及干预策略

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员