OctFormer: Octree-based Transformers for 3D Point Clouds - 专知论文

会员服务 ·

0

点云 · Attention · Microsoft Windows · 变换 · 3D ·

2023 年 5 月 8 日

OctFormer: Octree-based Transformers for 3D Point Clouds

翻译：暂无翻译

Peng-Shuai Wang

from arxiv, SIGGRAPH 2023, Journal Track

We propose octree-based transformers, named OctFormer, for 3D point cloud learning. OctFormer can not only serve as a general and effective backbone for 3D point cloud segmentation and object detection but also have linear complexity and is scalable for large-scale point clouds. The key challenge in applying transformers to point clouds is reducing the quadratic, thus overwhelming, computation complexity of attentions. To combat this issue, several works divide point clouds into non-overlapping windows and constrain attentions in each local window. However, the point number in each window varies greatly, impeding the efficient execution on GPU. Observing that attentions are robust to the shapes of local windows, we propose a novel octree attention, which leverages sorted shuffled keys of octrees to partition point clouds into local windows containing a fixed number of points while permitting shapes of windows to change freely. And we also introduce dilated octree attention to expand the receptive field further. Our octree attention can be implemented in 10 lines of code with open-sourced libraries and runs 17 times faster than other point cloud attentions when the point number exceeds 200k. Built upon the octree attention, OctFormer can be easily scaled up and achieves state-of-the-art performances on a series of 3D segmentation and detection benchmarks, surpassing previous sparse-voxel-based CNNs and point cloud transformers in terms of both efficiency and effectiveness. Notably, on the challenging ScanNet200 dataset, OctFormer outperforms sparse-voxel-based CNNs by 7.3 in mIoU. Our code and trained models are available at https://wang-ps.github.io/octformer.

翻译：暂无翻译

0

相关内容

根据激光测量原理得到的点云，包括三维坐标（XYZ）和激光反射强度（Intensity）。根据摄影测量原理得到的点云，包括三维坐标（XYZ）和颜色信息（RGB）。结合激光测量和摄影测量原理得到点云，包括三维坐标（XYZ）、激光反射强度（Intensity）和颜色信息（RGB）。在获取物体表面每个采样点的空间坐标后，得到的是一个点的集合，称之为“点云”(Point Cloud)

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

开源书：PyTorch深度学习起步

开源书：PyTorch深度学习起步

专知会员服务

51+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

ICLR'23截稿, 图神经网络依然火热 (附42 篇好文整理)

ICLR'23截稿, 图神经网络依然火热 (附42 篇好文整理)

图与推荐

2+阅读 · 2022年10月5日

【泡泡汇总】CVPR2019 SLAM Paperlist

【泡泡汇总】CVPR2019 SLAM Paperlist

泡泡机器人SLAM

14+阅读 · 2019年6月12日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec智能推荐

50+阅读 · 2018年8月27日

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

专知

17+阅读 · 2018年2月11日

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

专知

66+阅读 · 2018年1月31日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

内质网蛋白MoPer1及其互作蛋白调控稻瘟病菌生长发育和致病的分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

L-BM诱导的血流动力学改变对慢性心衰中自噬的调控和机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

大变形结构无网格拓扑优化方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

茉莉酸诱导相关AP2/EREBP转录因子调控橡胶草产胶机理的研究

国家自然科学基金

0+阅读 · 2012年12月31日

超冷原子气体在无序晶格中相变和相干动力学特性

国家自然科学基金

0+阅读 · 2012年12月31日

Eulerian bond-cubic 模型渗流性质的数值研究

国家自然科学基金

0+阅读 · 2012年12月31日

Heusler铁磁合金中的磁畴动态特性和马氏体结构相变研究

国家自然科学基金

0+阅读 · 2012年12月31日

蒺藜苜蓿MtHB2/HOX27转录因子抗逆调节机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

表面光催化超快动力学装置

国家自然科学基金

0+阅读 · 2011年12月31日

高通量海水淡化用纳米纤维复合膜微结构设计与调控

国家自然科学基金

0+阅读 · 2008年12月31日

PATCorrect: Non-autoregressive Phoneme-augmented Transformer for ASR Error Correction

PATCorrect: Non-autoregressive Phoneme-augmented Transformer for ASR Error Correction

Arxiv

0+阅读 · 2023年6月21日

An implicit-explicit solver for a two-fluid single-temperature model

Arxiv

0+阅读 · 2023年6月20日

A Collision-Based Hybrid Method for the BGK Equation

Arxiv

0+阅读 · 2023年6月20日

Concavity-Induced Distance for Unoriented Point Cloud Decomposition

Arxiv

0+阅读 · 2023年6月19日

Beyond Residence: A Mobility-based Approach for Improved Evaluation of Human Exposure to Environmental Hazards

Arxiv

0+阅读 · 2023年6月16日

Stable nodal projection method on octree grids

Arxiv

0+阅读 · 2023年6月16日

FEDformer: Frequency Enhanced Decomposed Transformer for Long-term Series Forecasting

Arxiv

10+阅读 · 2022年5月16日

SVT-Net: Super Light-Weight Sparse Voxel Transformer for Large Scale Place Recognition

Arxiv

12+阅读 · 2021年5月30日

RandLA-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds

Arxiv

11+阅读 · 2019年11月25日

Zero-Shot Transfer Learning for Event Extraction

Arxiv

10+阅读 · 2017年7月4日

VIP会员

文章信息

相关主题

Microsoft Windows

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

开源书：PyTorch深度学习起步

开源书：PyTorch深度学习起步

专知会员服务

51+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】基础模型训练中网络规模数据的负责任与高效使用

《俄乌战争背景下俄罗斯的战略性海军分析（2022-2025年）》最新100页报告

人工智能时代背景下的未来海战

相关资讯

ICLR'23截稿, 图神经网络依然火热 (附42 篇好文整理)

ICLR'23截稿, 图神经网络依然火热 (附42 篇好文整理)

图与推荐

2+阅读 · 2022年10月5日

【泡泡汇总】CVPR2019 SLAM Paperlist

【泡泡汇总】CVPR2019 SLAM Paperlist

泡泡机器人SLAM

14+阅读 · 2019年6月12日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec智能推荐

50+阅读 · 2018年8月27日

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

专知

17+阅读 · 2018年2月11日

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

专知

66+阅读 · 2018年1月31日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

相关论文

PATCorrect: Non-autoregressive Phoneme-augmented Transformer for ASR Error Correction

PATCorrect: Non-autoregressive Phoneme-augmented Transformer for ASR Error Correction

Arxiv

0+阅读 · 2023年6月21日

An implicit-explicit solver for a two-fluid single-temperature model

Arxiv

0+阅读 · 2023年6月20日

A Collision-Based Hybrid Method for the BGK Equation

Arxiv

0+阅读 · 2023年6月20日

Concavity-Induced Distance for Unoriented Point Cloud Decomposition

Arxiv

0+阅读 · 2023年6月19日

Beyond Residence: A Mobility-based Approach for Improved Evaluation of Human Exposure to Environmental Hazards

Arxiv

0+阅读 · 2023年6月16日

Stable nodal projection method on octree grids

Arxiv

0+阅读 · 2023年6月16日

FEDformer: Frequency Enhanced Decomposed Transformer for Long-term Series Forecasting

Arxiv

10+阅读 · 2022年5月16日

SVT-Net: Super Light-Weight Sparse Voxel Transformer for Large Scale Place Recognition

Arxiv

12+阅读 · 2021年5月30日

RandLA-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds

Arxiv

11+阅读 · 2019年11月25日

Zero-Shot Transfer Learning for Event Extraction

Arxiv

10+阅读 · 2017年7月4日

相关基金

内质网蛋白MoPer1及其互作蛋白调控稻瘟病菌生长发育和致病的分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

L-BM诱导的血流动力学改变对慢性心衰中自噬的调控和机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

大变形结构无网格拓扑优化方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

茉莉酸诱导相关AP2/EREBP转录因子调控橡胶草产胶机理的研究

国家自然科学基金

0+阅读 · 2012年12月31日

超冷原子气体在无序晶格中相变和相干动力学特性

国家自然科学基金

0+阅读 · 2012年12月31日

Eulerian bond-cubic 模型渗流性质的数值研究

国家自然科学基金

0+阅读 · 2012年12月31日

Heusler铁磁合金中的磁畴动态特性和马氏体结构相变研究

国家自然科学基金

0+阅读 · 2012年12月31日

蒺藜苜蓿MtHB2/HOX27转录因子抗逆调节机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

表面光催化超快动力学装置

国家自然科学基金

0+阅读 · 2011年12月31日

高通量海水淡化用纳米纤维复合膜微结构设计与调控

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员