以轻量量自注意为基础的模型进行分级点云编码和解码 (Hierarchical Point Cloud Encoding and Decoding with Lightweight Self-Attention based Model)

In this paper we present SA-CNN, a hierarchical and lightweight self-attention based encoding and decoding architecture for representation learning of point cloud data. The proposed SA-CNN introduces convolution and transposed convolution stacks to capture and generate contextual information among unordered 3D points. Following conventional hierarchical pipeline, the encoding process extracts feature in local-to-global manner, while the decoding process generates feature and point cloud in coarse-to-fine, multi-resolution stages. We demonstrate that SA-CNN is capable of a wide range of applications, namely classification, part segmentation, reconstruction, shape retrieval, and unsupervised classification. While achieving the state-of-the-art or comparable performance in the benchmarks, SA-CNN maintains its model complexity several order of magnitude lower than the others. In term of qualitative results, we visualize the multi-stage point cloud reconstructions and latent walks on rigid objects as well as deformable non-rigid human and robot models.

翻译：在本文中,我们介绍了一个基于分级和轻量级的自我注意编码和解码结构,用以代表点云数据的学习。拟议的SA-CNN引入了变化和转换的变换堆,以便在没有顺序的3D点中捕捉和生成背景信息。在传统的分级管道之后,编码过程提取以地方到全球的方式呈现特征,而解码过程则在粗到直线、多分辨率的阶段产生特征和点云。我们证明SA-CNN能够进行广泛的应用,即分类、部分分解、重建、形状检索和不受监督的分类。SA-CNN在达到基准中最先进或可比的性能的同时,保持其模型复杂性的若干数量级比其他的要低。在质量结果方面,我们设想了多级点云的重建以及僵硬物体上的潜在行走道,以及可变式的非硬形人类和机器人模型。

相关内容

点云

关注 49

根据激光测量原理得到的点云，包括三维坐标（XYZ）和激光反射强度（Intensity）。根据摄影测量原理得到的点云，包括三维坐标（XYZ）和颜色信息（RGB）。结合激光测量和摄影测量原理得到点云，包括三维坐标（XYZ）、激光反射强度（Intensity）和颜色信息（RGB）。在获取物体表面每个采样点的空间坐标后，得到的是一个点的集合，称之为“点云”(Point Cloud)

【教程】深度学习Keras与TensorFlow教程，Deep Learning with Keras and Tensorflow in R

专知会员服务

32+阅读 · 2022年3月9日

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日