We propose a deep autoencoder with graph topology inference and filtering to achieve compact representations of unorganized 3D point clouds in an unsupervised manner. Many previous works discretize 3D points to voxels and then use lattice-based methods to process and learn 3D spatial information; however, this leads to inevitable discretization errors. In this work, we handle raw 3D points without such compromise. The proposed networks follow the autoencoder framework with a focus on designing the decoder. The encoder adopts similar architectures as in PointNet. The decoder involves three novel modules. The folding module folds a canonical 2D lattice to the underlying surface of a 3D point cloud, achieving coarse reconstruction; the graph-topology-inference module learns a graph topology to represent pairwise relationships between 3D points, pushing the latent code to preserve both coordinates and pairwise relationships of points in 3D point clouds; and the graph-filtering module couples the above two modules, refining the coarse reconstruction through a learnt graph topology to obtain the final reconstruction. The proposed decoder leverages a learnable graph topology to push the codeword to preserve representative features and further improve the unsupervised-learning performance. We further provide theoretical analyses of the proposed architecture. In the experiments, we validate the proposed networks in three tasks, including 3D point cloud reconstruction, visualization, and transfer classification. The experimental results show that (1) the proposed networks outperform the state-of-the-art methods in various tasks; (2) a graph topology can be inferred as auxiliary information without specific supervision on graph topology inference; and (3) graph filtering refines the reconstruction, leading to better performances.
翻译:我们提出一个带有图示表层推断和过滤的深度自动编码器, 以不受监督的方式实现未组织的 3D 点云的缩放表达。 许多先前的工程将3D 点点分解成 voxel, 然后使用基于 lattice 的方法处理和学习 3D 空间信息; 但是, 这会导致不可避免的分解错误。 在这项工作中, 我们处理原始 3D 点的原始 3D 点。 拟议的网络遵循自动编码框架, 重点是设计解码器。 编码器采用了与PointNet 中类似的结构。 解码器包含三个新的模块。 折叠式模块将一个2D Lattice 的直径解解到 3D 点云云的底表层表面, 实现粗略重建; 图形- 图形- 图形- 图形- 图形化模块可以同时使用上述两个模块, 精细化的导性能分析, 通过一个经过学习的图表表层分析, 将数据转换到最终重建。