Many manipulation tasks can be naturally cast as a sequence of spatial relationships and constraints between objects. We aim to discover and scale these task-specific spatial relationships by representing manipulation tasks as operations over graphs. To do this, we pose manipulating a large, variable number of objects as a probabilistic classification problem over actions, objects and goals, learned using graph neural networks (GNNs). Our formulation first transforms the environment into a graph representation, then applies a trained GNN policy to predict which object to manipulate towards which goal state. Our GNN policies are trained using very few expert demonstrations on simple tasks, and exhibit generalization over number and configurations of objects in the environment and even to new, more complex tasks, while providing interpretable explanations for their decision-making. We present experiments which show that a single learned GNN policy can solve a variety of long-horizon blockstacking and rearrangement tasks.
翻译:许多操纵任务可以自然地成为物体之间空间关系和限制的序列。 我们的目标是通过将操纵任务作为图表上的操作来发现和扩展这些任务特有的空间关系。 为此,我们把操纵大量、可变的物体作为行动、物体和目标的概率分类问题,使用图形神经网络(GNNs)来学习。 我们的提法首先将环境转换成图示,然后应用经过培训的GNN政策来预测哪个目标是哪个目标状态。 我们的GNN政策在简单任务上几乎没有专家演示,而是在环境中对物体的数量和配置进行一般化,甚至展示新的、更复杂的任务,同时为它们的决策提供可解释的解释性解释性解释。 我们提出的实验表明,一项单一的GNN政策可以解决各种长相阻隔和重新安排任务。