Enabling robots to perform complex dynamic tasks such as picking up an object in one sweeping motion or pushing off a wall to quickly turn a corner is a challenging problem. The dynamic interactions implicit in these tasks are critical towards the successful execution of such tasks. Graph neural networks (GNNs) provide a principled way of learning the dynamics of interactive systems but can suffer from scaling issues as the number of interactions increases. Furthermore, the problem of using learned GNN-based models for optimal control is insufficiently explored. In this work, we present a method for efficiently learning the dynamics of interacting systems by simultaneously learning a dynamic graph structure and a stable and locally linear forward model of the system. The dynamic graph structure encodes evolving contact modes along a trajectory by making probabilistic predictions over the edges of the graph. Additionally, we introduce a temporal dependence in the learned graph structure which allows us to incorporate contact measurement updates during execution thus enabling more accurate forward predictions. The learned stable and locally linear dynamics enable the use of optimal control algorithms such as iLQR for long-horizon planning and control for complex interactive tasks. Through experiments in simulation and in the real world, we evaluate the performance of our method by using the learned interaction dynamics for control and demonstrate generalization to more objects and interactions not seen during training. We introduce a control scheme that takes advantage of contact measurement updates and hence is robust to prediction inaccuracies during execution.
翻译:使机器人能够执行复杂的动态任务,例如在一个整流运动中抓取一个物体,或推动一个墙壁以迅速翻转一个角落,这是一个具有挑战性的问题。这些任务中隐含的动态互动关系对于成功执行这些任务至关重要。图表神经网络(GNNs)为学习互动系统的动态提供了一种原则性方法,但随着互动次数的增加,可能会因规模问题而受到影响。此外,使用基于GNN的先进模型实现最佳控制的问题没有得到充分探讨。在这项工作中,我们提出了一个高效学习互动系统动态图结构的方法,同时学习一个稳定和本地直线的系统前方模型。动态图表结构通过在图的边缘进行概率预测,将沿轨迹不断演变的接触模式编码起来。此外,我们引入了一种时间依赖性的图表结构,使我们能够在执行期间纳入接触量更新,从而能够更准确地作出前瞻性预测。所学的稳定和本地线性动态使得能够使用像 iLQR这样的最佳控制算法,用于长期对复杂的互动任务进行规划和控制。通过模拟实验和在现实世界中进行更多的互动优势,我们用在所学到的动态分析时,我们所学到的动态分析方法来评估。