配有任务和运动规划反馈的政策指导懒惰搜索</s> (Policy-Guided Lazy Search with Feedback for Task and Motion Planning) - 专知论文

会员服务 ·

0

可约的 · Continuity · 样本 · Integration · 值域 ·

2023 年 3 月 11 日

Policy-Guided Lazy Search with Feedback for Task and Motion Planning

翻译：配有任务和运动规划反馈的政策指导懒惰搜索

Mohamed Khodeir,Atharv Sonwane,Ruthrash Hari,Florian Shkurti

from arxiv, Camera-Ready ICRA 2023

PDDLStream solvers have recently emerged as viable solutions for Task and Motion Planning (TAMP) problems, extending PDDL to problems with continuous action spaces. Prior work has shown how PDDLStream problems can be reduced to a sequence of PDDL planning problems, which can then be solved using off-the-shelf planners. However, this approach can suffer from long runtimes. In this paper we propose LAZY, a solver for PDDLStream problems that maintains a single integrated search over action skeletons, which gets progressively more geometrically informed, as samples of possible motions are lazily drawn during motion planning. We explore how learned models of goal-directed policies and current motion sampling data can be incorporated in LAZY to adaptively guide the task planner. We show that this leads to significant speed-ups in the search for a feasible solution evaluated over unseen test environments of varying numbers of objects, goals, and initial conditions. We evaluate our TAMP approach by comparing to existing solvers for PDDLStream problems on a range of simulated 7DoF rearrangement/manipulation problems.

翻译：PDDLStream 解答器最近成为任务和动作规划问题的可行解决办法,将PDDL扩大到连续行动空间的问题;先前的工作已经表明如何将PDDLStream 问题减为PDDDLStream 规划问题序列,然后使用现成的规划器加以解决;然而,这种办法可能长期存在。在本文件中,我们提议了PDDLStream 问题解答器LAZY,该解答器对行动骨骼进行单一的综合搜索,并逐渐得到几何学上的信息,因为可能动作的样本在运动规划期间是悬浮的。我们探讨了如何将目标导向政策和当前运动抽样数据的学习模型纳入LAZY,以适应性地指导任务规划器。我们表明,这导致在寻找一种可行的解决方案的过程中,对不同数量的对象、目标和初始条件的不可见的试验环境进行了评估。我们评估了我们的TAMP 方法,通过在模拟的7DoF重新布局/manipulturing 一系列问题上将现有的PDDLSream问题解答器进行比较。</s>

0

相关内容

可约的

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

53+阅读 · 2020年9月7日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Cidea和Fsp27蛋白调控机体脂代谢的功能研究

国家自然科学基金

0+阅读 · 2017年12月31日

RNF43介导的AKT/MDM2和NEDL1途径调控肝癌细胞恶性行为的分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

Hedgehog信号诱导神经前体细胞恶性转化中miRNA的作用及其机制

国家自然科学基金

0+阅读 · 2014年12月31日

miRNA介导的GA调控板栗雌花形成的生理及分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

Nrf2/ARE调控的乙二醛酶1在糖尿病脑病防治中的作用及芒果苷的效应和机制

国家自然科学基金

0+阅读 · 2012年12月31日

Tandem型染敏太阳电池p型光阴极准一维微纳结构调控

国家自然科学基金

0+阅读 · 2012年12月31日

肿瘤细胞中凋亡抑制蛋白CFLAR乙酰化调控的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

MiR-27a/b靶向沉默ABCA1调控胆固醇逆向转运

国家自然科学基金

0+阅读 · 2011年12月31日

Curcumin双向调控HO-1/HO-2协同抑制Aβeme复合物防治AD的分子机制

国家自然科学基金

0+阅读 · 2009年12月31日

Autonomous search of real-life environments combining dynamical system-based path planning and unsupervised learning

Arxiv

0+阅读 · 2023年5月3日

Multimodal Procedural Planning via Dual Text-Image Prompting

Arxiv

0+阅读 · 2023年5月2日

Constraint Inference in Control Tasks from Expert Demonstrations via Inverse Optimization

Arxiv

0+阅读 · 2023年5月2日

Fast Path Planning Through Large Collections of Safe Boxes

Arxiv

0+阅读 · 2023年5月1日

Multi-Fidelity Data-Driven Design and Analysis of Reactor and Tube Simulations

Arxiv

0+阅读 · 2023年5月1日

Offline RL for Natural Language Generation with Implicit Language Q Learning

Arxiv

0+阅读 · 2023年5月1日

SAM-RL: Sensing-Aware Model-Based Reinforcement Learning via Differentiable Physics-Based Simulation and Rendering

Arxiv

0+阅读 · 2023年4月30日

Bi-AM-RRT*: A Fast and Efficient Sampling-Based Motion Planning Algorithm in Dynamic Environments

Arxiv

0+阅读 · 2023年4月30日

A Direct Sampling-Based Deep Learning Approach for Inverse Medium Scattering Problems

Arxiv

0+阅读 · 2023年4月29日

Distributed and Scalable Optimization for Robust Proton Treatment Planning

Arxiv

0+阅读 · 2023年4月27日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

53+阅读 · 2020年9月7日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《物联网（IoT）中的无人机通信高效控制》135页

《在GNSS信号降级环境中利用共识实现无人机集群稳健协调》

中程单向攻击无人机的战略意义：俄乌战争启示

《面向无人机集群的避障动态传感器覆盖算法》最新38页

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Autonomous search of real-life environments combining dynamical system-based path planning and unsupervised learning

Arxiv

0+阅读 · 2023年5月3日

Multimodal Procedural Planning via Dual Text-Image Prompting

Arxiv

0+阅读 · 2023年5月2日

Constraint Inference in Control Tasks from Expert Demonstrations via Inverse Optimization

Arxiv

0+阅读 · 2023年5月2日

Fast Path Planning Through Large Collections of Safe Boxes

Arxiv

0+阅读 · 2023年5月1日

Multi-Fidelity Data-Driven Design and Analysis of Reactor and Tube Simulations

Arxiv

0+阅读 · 2023年5月1日

Offline RL for Natural Language Generation with Implicit Language Q Learning

Arxiv

0+阅读 · 2023年5月1日

SAM-RL: Sensing-Aware Model-Based Reinforcement Learning via Differentiable Physics-Based Simulation and Rendering

Arxiv

0+阅读 · 2023年4月30日

Bi-AM-RRT*: A Fast and Efficient Sampling-Based Motion Planning Algorithm in Dynamic Environments

Arxiv

0+阅读 · 2023年4月30日

A Direct Sampling-Based Deep Learning Approach for Inverse Medium Scattering Problems

Arxiv

0+阅读 · 2023年4月29日

Distributed and Scalable Optimization for Robust Proton Treatment Planning

Arxiv

0+阅读 · 2023年4月27日

相关基金

Cidea和Fsp27蛋白调控机体脂代谢的功能研究

国家自然科学基金

0+阅读 · 2017年12月31日

RNF43介导的AKT/MDM2和NEDL1途径调控肝癌细胞恶性行为的分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

Hedgehog信号诱导神经前体细胞恶性转化中miRNA的作用及其机制

国家自然科学基金

0+阅读 · 2014年12月31日

miRNA介导的GA调控板栗雌花形成的生理及分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

Nrf2/ARE调控的乙二醛酶1在糖尿病脑病防治中的作用及芒果苷的效应和机制

国家自然科学基金

0+阅读 · 2012年12月31日

Tandem型染敏太阳电池p型光阴极准一维微纳结构调控

国家自然科学基金

0+阅读 · 2012年12月31日

肿瘤细胞中凋亡抑制蛋白CFLAR乙酰化调控的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

MiR-27a/b靶向沉默ABCA1调控胆固醇逆向转运

国家自然科学基金

0+阅读 · 2011年12月31日

Curcumin双向调控HO-1/HO-2协同抑制Aβeme复合物防治AD的分子机制

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员