This paper presents a learning-based, lossless compression method for static point cloud geometry, based on context-adaptive arithmetic coding. Unlike most existing methods working in the octree domain, our encoder operates in a hybrid mode, mixing octree and voxel-based coding. We adaptively partition the point cloud into multi-resolution voxel blocks according to the point cloud structure, and use octree to signal the partitioning. On the one hand, octree representation can eliminate the sparsity in the point cloud. On the other hand, in the voxel domain, convolutions can be naturally expressed, and geometric information (i.e., planes, surfaces, etc.) is explicitly processed by a neural network. Our context model benefits from these properties and learns a probability distribution of the voxels using a deep convolutional neural network with masked filters, called VoxelDNN. Experiments show that our method outperforms the state-of-the-art MPEG G-PCC standard with average rate savings of 28% on a diverse set of point clouds from the Microsoft Voxelized Upper Bodies (MVUB) and MPEG. The implementation is available at https://github.com/Weafre/VoxelDNN.
翻译:本文介绍了基于上下文适应性算术编码的静点云度测量基于学习的无损压缩方法。 与奥克特里域大多数现行方法不同, 我们的编码器以混合模式运作, 混合奥克特里和 voxel 编码。 我们根据点云结构将点云分解成多解式 voxel 区块, 并使用 octree 信号分隔 。 一方面, octree 代表器可以消除点云中的松散。 另一方面, 在 voxel 域中, 可以自然表达混凝土, 测量信息( 即, 平面、 表面等) 由神经网络明确处理 。 我们的上下文模型从这些特性中获益, 并学习了通过使用隐藏过滤器的深层进化神经网络对 voxel 的概率分布 。 实验显示, 我们的方法超越了 MPEG- PCC 标准, 其平均速率储蓄率为 28% 。 在多元的 IMV/ IMVC 上, IMVQ/ Web 上可使用 。