PDDLStream solvers have recently emerged as viable solutions for Task and Motion Planning (TAMP) problems, extending PDDL to problems with continuous action spaces. Prior work has shown how PDDLStream problems can be reduced to a sequence of PDDL planning problems, which can then be solved using off-the-shelf planners. However, this approach can suffer from long runtimes. In this paper we propose LAZY, a solver for PDDLStream problems that maintains a single integrated search over action skeletons, which gets progressively more geometrically informed, as samples of possible motions are lazily drawn during motion planning. We explore how learned models of goal-directed policies and current motion sampling data can be incorporated in LAZY to adaptively guide the task planner. We show that this leads to significant speed-ups in the search for a feasible solution evaluated over unseen test environments of varying numbers of objects, goals, and initial conditions. We evaluate our TAMP approach by comparing to existing solvers for PDDLStream problems on a range of simulated 7DoF rearrangement/manipulation problems.
翻译:PDDLStream 解答器最近成为任务和动作规划问题的可行解决办法,将PDDL扩大到连续行动空间的问题;先前的工作已经表明如何将PDDLStream 问题减为PDDDLStream 规划问题序列,然后使用现成的规划器加以解决;然而,这种办法可能长期存在。在本文件中,我们提议了PDDLStream 问题解答器LAZY,该解答器对行动骨骼进行单一的综合搜索,并逐渐得到几何学上的信息,因为可能动作的样本在运动规划期间是悬浮的。我们探讨了如何将目标导向政策和当前运动抽样数据的学习模型纳入LAZY,以适应性地指导任务规划器。我们表明,这导致在寻找一种可行的解决方案的过程中,对不同数量的对象、目标和初始条件的不可见的试验环境进行了评估。我们评估了我们的TAMP 方法,通过在模拟的7DoF重新布局/manipulturing 一系列问题上将现有的PDDLSream问题解答器进行比较。</s>