3D morphable models are widely used for the shape representation of an object class in computer vision and graphics applications. In this work, we focus on deep 3D morphable models that directly apply deep learning on 3D mesh data with a hierarchical structure to capture information at multiple scales. While great efforts have been made to design the convolution operator, how to best aggregate vertex features across hierarchical levels deserves further attention. In contrast to resorting to mesh decimation, we propose an attention based module to learn mapping matrices for better feature aggregation across hierarchical levels. Specifically, the mapping matrices are generated by a compatibility function of the keys and queries. The keys and queries are trainable variables, learned by optimizing the target objective, and shared by all data samples of the same object class. Our proposed module can be used as a train-only drop-in replacement for the feature aggregation in existing architectures for both downsampling and upsampling. Our experiments show that through the end-to-end training of the mapping matrices, we achieve state-of-the-art results on a variety of 3D shape datasets in comparison to existing morphable models.
翻译:3D 可变模型被广泛用于计算机视觉和图形应用中对象类的形状表示。 在这项工作中,我们注重深3D可变模型,这些模型直接应用对三维网格数据的深层学习,并具有多级结构,以捕捉信息。虽然已作出巨大努力来设计组装操作员,但如何最佳地综合各等级的顶点特征值得进一步注意。与使用网状切除相比,我们提议了一个基于关注的模块,学习绘图矩阵,以更好地在各等级层次进行特征聚合。具体地说,绘图矩阵是由键和查询的兼容功能生成的。关键和查询是可训练的变量,通过优化目标目标目标目标而学习,由同一对象类的所有数据样本共享。我们提议的模块可以用作仅供火车的滴入式替换现有结构中用于下取样和上层的特征集合。我们的实验表明,通过对绘图矩阵的端到端培训,我们在与现有可变式模型比较的3D形状数据集上取得了最新结果。