Effective planning of long-horizon deformable object manipulation requires suitable abstractions at both the spatial and temporal levels. Previous methods typically either focus on short-horizon tasks or make strong assumptions that full-state information is available, which prevents their use on deformable objects. In this paper, we propose PlAnning with Spatial-Temporal Abstraction (PASTA), which incorporates both spatial abstraction (reasoning about objects and their relations to each other) and temporal abstraction (reasoning over skills instead of low-level actions). Our framework maps high-dimension 3D observations such as point clouds into a set of latent vectors and plans over skill sequences on top of the latent set representation. We show that our method can effectively perform challenging sequential deformable object manipulation tasks in the real world, which require combining multiple tool-use skills such as cutting with a knife, pushing with a pusher, and spreading the dough with a roller.
翻译:对长视距变形物体操纵的有效规划需要在空间和时间层面进行适当的抽象分析。 以往的方法通常侧重于短视距任务,或者对全状态信息的存在作出强烈的假设,从而防止对变形物体的使用。 在本文中,我们提议采用空间-时空抽象(PASTA)进行PlAnning,其中既包括空间抽象(对物体及其相互关系的考虑),也包括时间抽象(对技能的考虑,而不是对低级行动)。我们的框架绘制了高分界三维观测图,例如点云,将其分为一组潜伏矢量和计划,以覆盖在潜伏定形代表面上的技能序列。我们显示,我们的方法可以有效地在现实世界中执行连续变形物体操纵任务,这需要结合多种工具使用技能,例如用刀切,用推推推,用滚动法将大字袋传播。