PDDLStream solvers have recently emerged as viable solutions for Task and Motion Planning (TAMP) problems, extending PDDL to problems with continuous action spaces. Prior work has shown how PDDLStream problems can be reduced to a sequence of PDDL planning problems, which can then be solved using off-the-shelf planners. However, this approach can suffer from long runtimes. In this paper we propose LAZY, a solver for PDDLStream problems that maintains a single integrated search over action skeletons, which gets progressively more geometrically informed as samples of possible motions are lazily drawn during motion planning. We explore how learned models of goal-directed policies and current motion sampling data can be incorporated in LAZY to adaptively guide the task planner. We show that this leads to significant speed-ups in the search for a feasible solution evaluated over unseen test environments of varying numbers of objects, goals, and initial conditions. We evaluate our TAMP approach by comparing to existing solvers for PDDLStream problems on a range of simulated 7DoF rearrangement/manipulation problems.
翻译:PDDLStream 解答器最近成为任务和动作规划问题的可行解决办法,将PDDL扩大到连续行动空间的问题;先前的工作已经表明如何将PDDLStream 问题减为PDDDLStream 规划问题序列,然后利用现成的规划人员解决这些问题;然而,这种办法可能长期存在。在本文件中,我们提议了PDDLStream 问题解答器LAZY,该解答器对行动骨骼进行单一的综合搜索,在运动规划期间,可能动作的样本是悬浮的,从而逐步得到几何学上的信息。我们探讨了如何将目标导向政策和当前运动抽样数据的学习模型纳入LAZY,以适应性地指导任务规划人员。我们表明,这导致在寻找一种可行的解决方案的过程中,对不同数量的对象、目标和初始条件的无形测试环境进行了评估。我们评估了我们的TAMP 方法,通过将模拟的7DoF重新定位/manipulting 问题的范围与现有的PDDLStream 问题解答器进行比较。