In this paper, we revisit the classical representation of 3D point clouds as linear shape models. Our key insight is to leverage deep learning to represent a collection of shapes as affine transformations of low-dimensional linear shape models. Each linear model is characterized by a shape prototype, a low-dimensional shape basis and two neural networks. The networks take as input a point cloud and predict the coordinates of a shape in the linear basis and the affine transformation which best approximate the input. Both linear models and neural networks are learned end-to-end using a single reconstruction loss. The main advantage of our approach is that, in contrast to many recent deep approaches which learn feature-based complex shape representations, our model is explicit and every operation occurs in 3D space. As a result, our linear shape models can be easily visualized and annotated, and failure cases can be visually understood. While our main goal is to introduce a compact and interpretable representation of shape collections, we show it leads to state of the art results for few-shot segmentation.
翻译:在本文中, 我们重新审视了3D点云的经典表达方式, 作为线性形状模型。 我们的关键洞察力是利用深层次的学习来代表一系列形状, 作为低维线性形状模型的折叠式转换。 每个线性模型的特征是形状原型、 低维形状基础和两个神经网络。 这些网络输入点云, 并预测线性基础形状的坐标和最接近输入的线性变化。 线性模型和神经网络都是用单一的重建损失来学习端对端的。 我们方法的主要优势是, 与最近许多深层次的方法相比, 这些方法学习基于地貌的复杂形状模型, 我们的模型是清晰的, 并且每个操作都在3D空间进行。 结果, 我们的线性形状模型可以很容易被视觉化和附加说明, 失败案例可以被视觉理解。 我们的主要目标是引入一个缩略图和可解释的形状收藏的缩略图, 我们的主要目的就是通过微分块来显示艺术结果的状态 。