基于学习的三维点云的无损压缩 (Learning-based lossless compression of 3D point cloud geometry)

This paper presents a learning-based, lossless compression method for static point cloud geometry, based on context-adaptive arithmetic coding. Unlike most existing methods working in the octree domain, our encoder operates in a hybrid mode, mixing octree and voxel-based coding. We adaptively partition the point cloud into multi-resolution voxel blocks according to the point cloud structure, and use octree to signal the partitioning. On the one hand, octree representation can eliminate the sparsity in the point cloud. On the other hand, in the voxel domain, convolutions can be naturally expressed, and geometric information (i.e., planes, surfaces, etc.) is explicitly processed by a neural network. Our context model benefits from these properties and learns a probability distribution of the voxels using a deep convolutional neural network with masked filters, called VoxelDNN. Experiments show that our method outperforms the state-of-the-art MPEG G-PCC standard with average rate savings of 28% on a diverse set of point clouds from the Microsoft Voxelized Upper Bodies (MVUB) and MPEG. The implementation is available at https://github.com/Weafre/VoxelDNN.

翻译：本文介绍了基于上下文适应性算术编码的静点云度测量基于学习的无损压缩方法。与奥克特里域大多数现行方法不同, 我们的编码器以混合模式运作, 混合奥克特里和 voxel 编码。我们根据点云结构将点云分解成多解式 voxel 区块, 并使用 octree 信号分隔。一方面, octree 代表器可以消除点云中的松散。另一方面, 在 voxel 域中, 可以自然表达混凝土, 测量信息( 即, 平面、表面等) 由神经网络明确处理。我们的上下文模型从这些特性中获益, 并学习了通过使用隐藏过滤器的深层进化神经网络对 voxel 的概率分布。实验显示, 我们的方法超越了 MPEG- PCC 标准, 其平均速率储蓄率为 28% 。在多元的 IMV/ IMVC 上, IMVQ/ Web 上可使用。

相关内容

点云

关注 48

根据激光测量原理得到的点云，包括三维坐标（XYZ）和激光反射强度（Intensity）。根据摄影测量原理得到的点云，包括三维坐标（XYZ）和颜色信息（RGB）。结合激光测量和摄影测量原理得到点云，包括三维坐标（XYZ）、激光反射强度（Intensity）和颜色信息（RGB）。在获取物体表面每个采样点的空间坐标后，得到的是一个点的集合，称之为“点云”(Point Cloud)

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

专知会员服务

23+阅读 · 2021年6月3日

【图与几何深度学习】Graph and geometric deep learning，49页ppt

专知会员服务

65+阅读 · 2021年4月24日