Deformable Object Manipulation (DOM) is of significant importance to both daily and industrial applications. Recent successes in differentiable physics simulators allow learning algorithms to train a policy with analytic gradients through environment dynamics, which significantly facilitates the development of DOM algorithms. However, existing DOM benchmarks are either single-object-based or non-differentiable. This leaves the questions of 1) how a task-specific algorithm performs on other tasks and 2) how a differentiable-physics-based algorithm compares with the non-differentiable ones in general. In this work, we present DaXBench, a differentiable DOM benchmark with a wide object and task coverage. DaXBench includes 9 challenging high-fidelity simulated tasks, covering rope, cloth, and liquid manipulation with various difficulty levels. To better understand the performance of general algorithms on different DOM tasks, we conduct comprehensive experiments over representative DOM methods, ranging from planning to imitation learning and reinforcement learning. In addition, we provide careful empirical studies of existing decision-making algorithms based on differentiable physics, and discuss their limitations, as well as potential future directions.
翻译:可变对象操纵(DOM)对于日常和工业应用都具有重大意义。在不同的物理模拟器中,最近的成功使学习算法能够通过环境动态对分析梯度的政策进行培训,从而极大地促进了DOM算法的发展。然而,现有的DOM基准要么是单对象制的,要么是无差别的。这留下了以下几个问题:(1)任务特定算法如何执行其他任务;(2)基于不同物理的算法如何与一般非差别的算法相比较。在这项工作中,我们提出了DaXUBench,这是一个具有广泛对象和任务覆盖面的可变DOM基准。DaXUBench包括9项具有挑战性的高纤维度模拟任务,涵盖绳索、布料和液体操纵等不同程度的困难任务。为了更好地了解不同DOM任务的一般算法的运作情况,我们从规划到模仿学习和强化学习,对具有代表性的DOM方法进行了全面试验。此外,我们对基于不同目的物理学的现有决策算法进行了仔细的经验研究,并讨论了其局限性以及未来方向。