PVT: 用于 3D 深层学习的点Voxel 变换器 (PVT: Point-Voxel Transformer for 3D Deep Learning) - 专知论文

会员服务 ·

0

3D · 学成 · 可约的 · 变换 · INFORMS ·

2021 年 9 月 22 日

PVT: Point-Voxel Transformer for 3D Deep Learning

翻译：PVT: 用于 3D 深层学习的点Voxel 变换器

Cheng Zhang,Haocheng Wan,Shengqiang Liu,Xinyi Shen,Zizhao Wu

In this paper, we present an efficient and high-performance neural architecture, termed Point-Voxel Transformer (PVT)for 3D deep learning, which deeply integrates both 3D voxel-based and point-based self-attention computation to learn more discriminative features from 3D data. Specifically, we conduct multi-head self-attention (MSA) computation in voxels to obtain the efficient learning pattern and the coarse-grained local features while performing self-attention in points to provide finer-grained information about the global context. In addition, to reduce the cost of MSA computation with high efficiency, we design a cyclic shifted boxing scheme by limiting the MSA computation to non-overlapping local box and also preserving cross-box connection. Evaluated on classification benchmark, our method not only achieves state-of-the-art accuracy of 94.0% (no voting) but outperforms previous Transformer-based models with 7x measured speedup on average. On part and semantic segmentation, our model also obtains strong performance(86.5% and 68.2% mIoU, respectively). For 3D object detection task, we replace the primitives in Frustrum PointNet with PVT block and achieve an improvement of 8.6% AP.

翻译：在本文中,我们展示了一个高效和高性能的神经结构,称为3D深层学习的点-福克斯变异器(PVT),它深度整合了3D voxel基和点-自控计算方法,从 3D 数据中学习更加歧视的特征。具体地说,我们用三D 数据进行多头自控(MSA)计算,以获得高效的学习模式和粗粒的本地特征,同时进行点自控,以提供精确的关于全球环境的信息。此外,为了降低管理协议计算的成本,我们设计了一个循环式转换箱计划,将管理协议的计算限制在不重叠的地方框和保存交叉框连接。根据分类基准评估,我们的方法不仅实现了94.0%(无投票)的最新准确度,而且比以往的基于变压器模型高出平均7x的速率。在部分和语义分割方面,我们的模型还取得了很强的性能(86.5 % 和68.2% 移动框框), 并分别取代了FIstrainstru 目标(我们B) 3 和FIstrainstrestrestrestru 。

0

相关内容

3D是英文“Three Dimensions”的简称，中文是指三维、三个维度、三个坐标，即有长、有宽、有高，换句话说，就是立体的，是相对于只有长和宽的平面（2D）而言。

【图与几何深度学习】Graph and geometric deep learning，49页ppt

【图与几何深度学习】Graph and geometric deep learning，49页ppt

专知会员服务

65+阅读 · 2021年4月24日

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

321+阅读 · 2020年11月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

59+阅读 · 2020年1月25日

【厦门大学】综述：深度学习3D点云分割，Review: deep learning on 3D point clouds

【厦门大学】综述：深度学习3D点云分割，Review: deep learning on 3D point clouds

专知会员服务

71+阅读 · 2020年1月22日

【报告推荐】三维及超形体分析中的几何与数据学习（Geometry and Learning from Data in 3D and Beyond - Shape Analysis）

【报告推荐】三维及超形体分析中的几何与数据学习（Geometry and Learning from Data in 3D and Beyond - Shape Analysis）

专知会员服务

23+阅读 · 2019年11月10日

【IPAM workshops】加州大学洛杉矶分校会议：Geometry and Learning from Data in 3D and Beyond， workshop Ⅳ： Deep Geometric Learning of Big Data and Applications

【IPAM workshops】加州大学洛杉矶分校会议：Geometry and Learning from Data in 3D and Beyond， workshop Ⅳ： Deep Geometric Learning of Big Data and Applications

专知会员服务

19+阅读 · 2019年11月10日

【IPAM workshops】加州大学洛杉矶分校会议：Geometry and Learning from Data in 3D and Beyond，workshop Ⅲ：Geometry of Big Data

【IPAM workshops】加州大学洛杉矶分校会议：Geometry and Learning from Data in 3D and Beyond，workshop Ⅲ：Geometry of Big Data

专知会员服务

8+阅读 · 2019年11月10日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

BERT/Transformer/迁移学习NLP资源大列表

BERT/Transformer/迁移学习NLP资源大列表

专知

19+阅读 · 2019年6月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

CVPR2019| 04-23更新7篇论文及代码（1篇oral，含视频目标分割、物体检测、三维点云等）

CVPR2019| 04-23更新7篇论文及代码（1篇oral，含视频目标分割、物体检测、三维点云等）

极市平台

27+阅读 · 2019年4月23日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

STRCF for Visual Object Tracking

STRCF for Visual Object Tracking

统计学习与视觉计算组

15+阅读 · 2018年5月29日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

专知

17+阅读 · 2018年2月11日

【推荐】全卷积语义分割综述

【推荐】全卷积语义分割综述

机器学习研究会

19+阅读 · 2017年8月31日

Background-Aware 3D Point Cloud Segmentationwith Dynamic Point Feature Aggregation

Arxiv

0+阅读 · 2021年11月14日

Learning Online for Unified Segmentation and Tracking Models

Arxiv

0+阅读 · 2021年11月12日

Dense Unsupervised Learning for Video Segmentation

Arxiv

7+阅读 · 2021年11月11日

ASLFeat: Learning Local Features of Accurate Shape and Localization

Arxiv

6+阅读 · 2020年3月23日

Review: deep learning on 3D point clouds

Review: deep learning on 3D point clouds

Arxiv

5+阅读 · 2020年1月17日

Deep Learning for 3D Point Clouds: A Survey

Deep Learning for 3D Point Clouds: A Survey

Arxiv

3+阅读 · 2019年12月27日

RandLA-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds

Arxiv

11+阅读 · 2019年11月25日

Few-shot 3D Multi-modal Medical Image Segmentation using Generative Adversarial Learning

Few-shot 3D Multi-modal Medical Image Segmentation using Generative Adversarial Learning

Arxiv

9+阅读 · 2018年10月29日

Learning to Generate and Reconstruct 3D Meshes with only 2D Supervision

Learning to Generate and Reconstruct 3D Meshes with only 2D Supervision

Arxiv

5+阅读 · 2018年7月24日

PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space

Arxiv

4+阅读 · 2017年6月7日

VIP会员

文章信息

相关主题

相关VIP内容

【图与几何深度学习】Graph and geometric deep learning，49页ppt

【图与几何深度学习】Graph and geometric deep learning，49页ppt

专知会员服务

65+阅读 · 2021年4月24日

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

321+阅读 · 2020年11月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

59+阅读 · 2020年1月25日

【厦门大学】综述：深度学习3D点云分割，Review: deep learning on 3D point clouds

【厦门大学】综述：深度学习3D点云分割，Review: deep learning on 3D point clouds

专知会员服务

71+阅读 · 2020年1月22日

【报告推荐】三维及超形体分析中的几何与数据学习（Geometry and Learning from Data in 3D and Beyond - Shape Analysis）

【报告推荐】三维及超形体分析中的几何与数据学习（Geometry and Learning from Data in 3D and Beyond - Shape Analysis）

专知会员服务

23+阅读 · 2019年11月10日

【IPAM workshops】加州大学洛杉矶分校会议：Geometry and Learning from Data in 3D and Beyond， workshop Ⅳ： Deep Geometric Learning of Big Data and Applications

【IPAM workshops】加州大学洛杉矶分校会议：Geometry and Learning from Data in 3D and Beyond， workshop Ⅳ： Deep Geometric Learning of Big Data and Applications

专知会员服务

19+阅读 · 2019年11月10日

【IPAM workshops】加州大学洛杉矶分校会议：Geometry and Learning from Data in 3D and Beyond，workshop Ⅲ：Geometry of Big Data

【IPAM workshops】加州大学洛杉矶分校会议：Geometry and Learning from Data in 3D and Beyond，workshop Ⅲ：Geometry of Big Data

专知会员服务

8+阅读 · 2019年11月10日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

《乌克兰无人机产业：志愿者与政策在构建新兴无人机产业中的协同作用》最新报告

《人工智能辅助决策中的数据可视化：系统性综述》

人工智能驱动弹药制造现代化：美国陆军转型之路

《敏捷作战部署中枢纽-辐条基地选址优化研究》80页

相关资讯

BERT/Transformer/迁移学习NLP资源大列表

BERT/Transformer/迁移学习NLP资源大列表

专知

19+阅读 · 2019年6月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

CVPR2019| 04-23更新7篇论文及代码（1篇oral，含视频目标分割、物体检测、三维点云等）

CVPR2019| 04-23更新7篇论文及代码（1篇oral，含视频目标分割、物体检测、三维点云等）

极市平台

27+阅读 · 2019年4月23日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

STRCF for Visual Object Tracking

STRCF for Visual Object Tracking

统计学习与视觉计算组

15+阅读 · 2018年5月29日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

专知

17+阅读 · 2018年2月11日

【推荐】全卷积语义分割综述

【推荐】全卷积语义分割综述

机器学习研究会

19+阅读 · 2017年8月31日

相关论文

Background-Aware 3D Point Cloud Segmentationwith Dynamic Point Feature Aggregation

Arxiv

0+阅读 · 2021年11月14日

Learning Online for Unified Segmentation and Tracking Models

Arxiv

0+阅读 · 2021年11月12日

Dense Unsupervised Learning for Video Segmentation

Arxiv

7+阅读 · 2021年11月11日

ASLFeat: Learning Local Features of Accurate Shape and Localization

Arxiv

6+阅读 · 2020年3月23日

Review: deep learning on 3D point clouds

Review: deep learning on 3D point clouds

Arxiv

5+阅读 · 2020年1月17日

Deep Learning for 3D Point Clouds: A Survey

Deep Learning for 3D Point Clouds: A Survey

Arxiv

3+阅读 · 2019年12月27日

RandLA-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds

Arxiv

11+阅读 · 2019年11月25日

Few-shot 3D Multi-modal Medical Image Segmentation using Generative Adversarial Learning

Few-shot 3D Multi-modal Medical Image Segmentation using Generative Adversarial Learning

Arxiv

9+阅读 · 2018年10月29日

Learning to Generate and Reconstruct 3D Meshes with only 2D Supervision

Learning to Generate and Reconstruct 3D Meshes with only 2D Supervision

Arxiv

5+阅读 · 2018年7月24日

PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space

Arxiv

4+阅读 · 2017年6月7日

微信扫码咨询专知VIP会员