规划部分可视远地问题 (Probabilistic Inference in Planning for Partially Observable Long Horizon Problems) - 专知论文

会员服务 ·

0

Performer · 推断 · 信念传播 · Continuity · state-of-the-art ·

2021 年 10 月 18 日

Probabilistic Inference in Planning for Partially Observable Long Horizon Problems

翻译：规划部分可视远地问题

Alphonsus Adu-Bredu,Nikhil Devraj,Pin-Han Lin,Zhen Zeng,Odest Chadwicke Jenkins

from arxiv, International Conference on Intelligent Robots and Systems (IROS), 2021

For autonomous service robots to successfully perform long horizon tasks in the real world, they must act intelligently in partially observable environments. Most Task and Motion Planning approaches assume full observability of their state space, making them ineffective in stochastic and partially observable domains that reflect the uncertainties in the real world. We propose an online planning and execution approach for performing long horizon tasks in partially observable domains. Given the robot's belief and a plan skeleton composed of symbolic actions, our approach grounds each symbolic action by inferring continuous action parameters needed to execute the plan successfully. To achieve this, we formulate the problem of joint inference of action parameters as a Hybrid Constraint Satisfaction Problem (H-CSP) and solve the H-CSP using Belief Propagation. The robot executes the resulting parameterized actions, updates its belief of the world and replans when necessary. Our approach is able to efficiently solve partially observable tasks in a realistic kitchen simulation environment. Our approach outperformed an adaptation of the state-of-the-art method across our experiments.

翻译：自主服务机器人要想在现实世界中成功完成长期任务,就必须在部分可观测环境中明智地采取行动。大多数任务和运动规划方法都承担完全可观测到的状态空间,使其在反映现实世界不确定性的随机和部分可观测领域无效。我们提议了在部分可观测领域执行长期任务的在线规划和执行方法。鉴于机器人的信念和一个由象征性行动组成的计划骨架,我们的方法通过推断成功执行计划所需的连续行动参数,为每一项象征性行动提供了依据。为此,我们将行动参数的联合推断问题作为混合约束性满意度问题(H-CSP),并利用信仰促进解决H-CSP问题。机器人执行由此产生的参数化行动,更新其对世界的信念,并在必要时进行重新规划。我们的方法能够在现实的厨房模拟环境中有效解决部分可观察的任务。我们的方法超越了我们整个实验中最先进的方法的适应性。

0

相关内容

Performer

【伯克利-Pieter Abbeel】深度强化学习基础，附slides与视频

专知会员服务

29+阅读 · 2021年8月26日

深度概率图模型，Deep Probabilistic Models

专知会员服务

29+阅读 · 2021年8月2日

机器学习组合优化

机器学习组合优化

专知会员服务

110+阅读 · 2021年2月16日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

【微软Alekh等开放新书】强化学习理论与算法（Reinforcement Learning:Theory and Algorithms），附83页pdf

【微软Alekh等开放新书】强化学习理论与算法（Reinforcement Learning:Theory and Algorithms），附83页pdf

专知会员服务

121+阅读 · 2019年11月24日

【UAI 2019 Tutorials】可处理概率模型：表示、算法、学习和应用（Tractable Probabilistic Models: Representations, Algorithms, Learning, and Applications）

【UAI 2019 Tutorials】可处理概率模型：表示、算法、学习和应用（Tractable Probabilistic Models: Representations, Algorithms, Learning, and Applications）

专知会员服务

18+阅读 · 2019年11月16日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

33+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

【AAMSA 2019 | tutorial】多智能体系统中的认知推理Epistemic Reasoning In Multiagent Systems ,法国雷恩François Schwarzentruber

【AAMSA 2019 | tutorial】多智能体系统中的认知推理Epistemic Reasoning In Multiagent Systems ,法国雷恩François Schwarzentruber

专知会员服务

24+阅读 · 2019年5月14日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

专知

17+阅读 · 2018年4月28日

条件GAN重大改进！cGANs with Projection Discriminator

条件GAN重大改进！cGANs with Projection Discriminator

CreateAMind

8+阅读 · 2018年2月7日

Adversarial Variational Bayes: Unifying VAE and GAN 代码

Adversarial Variational Bayes: Unifying VAE and GAN 代码

CreateAMind

7+阅读 · 2017年10月4日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

Interaction-Aware Trajectory Prediction and Planning for Autonomous Vehicles in Forced Merge Scenarios

Arxiv

0+阅读 · 2021年12月14日

Parameter Efficient Deep Probabilistic Forecasting

Arxiv

0+阅读 · 2021年12月14日

Sampling-Based Robust Control of Autonomous Systems with Non-Gaussian Noise

Arxiv

0+阅读 · 2021年12月13日

Multi-agent Soft Actor-Critic Based Hybrid Motion Planner for Mobile Robots

Arxiv

0+阅读 · 2021年12月13日

Online Information-Aware Motion Planning with Inertial Parameter Learning for Robotic Free-Flyers

Arxiv

0+阅读 · 2021年12月11日

Zero-Shot Uncertainty-Aware Deployment of Simulation Trained Policies on Real-World Robots

Arxiv

0+阅读 · 2021年12月10日

Imitation by Predicting Observations

Imitation by Predicting Observations

Arxiv

4+阅读 · 2021年7月8日

RNN with Particle Flow for Probabilistic Spatio-temporal Forecasting

Arxiv

5+阅读 · 2021年6月10日

Learning and Planning in Complex Action Spaces

Arxiv

4+阅读 · 2021年4月13日

The Search Problem in Mixture Models

Arxiv

3+阅读 · 2018年2月24日

VIP会员

文章信息

相关主题

state-of-the-art

相关VIP内容

【伯克利-Pieter Abbeel】深度强化学习基础，附slides与视频

专知会员服务

29+阅读 · 2021年8月26日

深度概率图模型，Deep Probabilistic Models

专知会员服务

29+阅读 · 2021年8月2日

机器学习组合优化

机器学习组合优化

专知会员服务

110+阅读 · 2021年2月16日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

【微软Alekh等开放新书】强化学习理论与算法（Reinforcement Learning:Theory and Algorithms），附83页pdf

【微软Alekh等开放新书】强化学习理论与算法（Reinforcement Learning:Theory and Algorithms），附83页pdf

专知会员服务

121+阅读 · 2019年11月24日

【UAI 2019 Tutorials】可处理概率模型：表示、算法、学习和应用（Tractable Probabilistic Models: Representations, Algorithms, Learning, and Applications）

【UAI 2019 Tutorials】可处理概率模型：表示、算法、学习和应用（Tractable Probabilistic Models: Representations, Algorithms, Learning, and Applications）

专知会员服务

18+阅读 · 2019年11月16日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

33+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

【AAMSA 2019 | tutorial】多智能体系统中的认知推理Epistemic Reasoning In Multiagent Systems ,法国雷恩François Schwarzentruber

【AAMSA 2019 | tutorial】多智能体系统中的认知推理Epistemic Reasoning In Multiagent Systems ,法国雷恩François Schwarzentruber

专知会员服务

24+阅读 · 2019年5月14日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】迈向具有高维结果的可靠且稳健的因果推断

《美海军分布式海上作战（DMO）概念：最新情况》

Gemini 2.5：推动前沿，具备先进推理、多模态、长上下文及下一代智能体能力

【ICML2025教程】联想记忆的现代方法

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

专知

17+阅读 · 2018年4月28日

条件GAN重大改进！cGANs with Projection Discriminator

条件GAN重大改进！cGANs with Projection Discriminator

CreateAMind

8+阅读 · 2018年2月7日

Adversarial Variational Bayes: Unifying VAE and GAN 代码

Adversarial Variational Bayes: Unifying VAE and GAN 代码

CreateAMind

7+阅读 · 2017年10月4日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

相关论文

Interaction-Aware Trajectory Prediction and Planning for Autonomous Vehicles in Forced Merge Scenarios

Arxiv

0+阅读 · 2021年12月14日

Parameter Efficient Deep Probabilistic Forecasting

Arxiv

0+阅读 · 2021年12月14日

Sampling-Based Robust Control of Autonomous Systems with Non-Gaussian Noise

Arxiv

0+阅读 · 2021年12月13日

Multi-agent Soft Actor-Critic Based Hybrid Motion Planner for Mobile Robots

Arxiv

0+阅读 · 2021年12月13日

Online Information-Aware Motion Planning with Inertial Parameter Learning for Robotic Free-Flyers

Arxiv

0+阅读 · 2021年12月11日

Zero-Shot Uncertainty-Aware Deployment of Simulation Trained Policies on Real-World Robots

Arxiv

0+阅读 · 2021年12月10日

Imitation by Predicting Observations

Imitation by Predicting Observations

Arxiv

4+阅读 · 2021年7月8日

RNN with Particle Flow for Probabilistic Spatio-temporal Forecasting

Arxiv

5+阅读 · 2021年6月10日

Learning and Planning in Complex Action Spaces

Arxiv

4+阅读 · 2021年4月13日

The Search Problem in Mixture Models

Arxiv

3+阅读 · 2018年2月24日

微信扫码咨询专知VIP会员