半弱监督下的物体运动预测 (Semi-Weakly Supervised Object Kinematic Motion Prediction)

Given a 3D object, kinematic motion prediction aims to identify the mobile parts as well as the corresponding motion parameters. Due to the large variations in both topological structure and geometric details of 3D objects, this remains a challenging task and the lack of large scale labeled data also constrain the performance of deep learning based approaches. In this paper, we tackle the task of object kinematic motion prediction problem in a semi-weakly supervised manner. Our key observations are two-fold. First, although 3D dataset with fully annotated motion labels is limited, there are existing datasets and methods for object part semantic segmentation at large scale. Second, semantic part segmentation and mobile part segmentation is not always consistent but it is possible to detect the mobile parts from the underlying 3D structure. Towards this end, we propose a graph neural network to learn the map between hierarchical part-level segmentation and mobile parts parameters, which are further refined based on geometric alignment. This network can be first trained on PartNet-Mobility dataset with fully labeled mobility information and then applied on PartNet dataset with fine-grained and hierarchical part-level segmentation. The network predictions yield a large scale of 3D objects with pseudo labeled mobility information and can further be used for weakly-supervised learning with pre-existing segmentation. Our experiments show there are significant performance boosts with the augmented data for previous method designed for kinematic motion prediction on 3D partial scans.

翻译：给定一个3D物体，运动学运动预测旨在识别移动部件以及相应的运动参数。由于3D物体的拓扑结构和几何细节的巨大变化，这仍然是一个具有挑战性的任务，缺乏大规模标记数据也限制了基于深度学习的方法的性能。在本文中，我们以半弱监督的方式解决物体运动预测问题。我们的关键观察有两点。首先，尽管具有完全注释的运动标签的3D数据集有限，但存在用于大规模物体部分语义分割的数据集和方法。其次，语义部分分割和移动部件分割并不总是一致的，但是可以从底层3D结构中检测到移动部件。为此，我们提出了图神经网络来学习分层部分级别分割和移动部件参数之间的映射，这些参数基于几何对齐进一步得到改进。该网络可以首先在PartNet-Mobility数据集上进行训练，该数据集具有完全标记的移动性信息，然后在具有细粒度和分层部分级别分割的PartNet数据集上应用。网络预测产生了大量3D物体，具有伪标记的移动性信息，并且可以进一步用于具有预先存在分割的弱监督学习。我们的实验表明，在之前设计用于3D局部扫描的运动学运动预测方法中，通过增强数据已经显著提高了表现。