复杂环境影响增强的在线规划 (Influence-Augmented Online Planning for Complex Environments) - 专知论文

会员服务 ·

0

回合 · Performer · 在线 · Principle · Less ·

2021 年 6 月 9 日

Influence-Augmented Online Planning for Complex Environments

翻译：复杂环境影响增强的在线规划

Jinke He,Miguel Suau,Frans A. Oliehoek

from arxiv, NeurIPS2020 - results have been updated after fixing minor bugs in the code

How can we plan efficiently in real time to control an agent in a complex environment that may involve many other agents? While existing sample-based planners have enjoyed empirical success in large POMDPs, their performance heavily relies on a fast simulator. However, real-world scenarios are complex in nature and their simulators are often computationally demanding, which severely limits the performance of online planners. In this work, we propose influence-augmented online planning, a principled method to transform a factored simulator of the entire environment into a local simulator that samples only the state variables that are most relevant to the observation and reward of the planning agent and captures the incoming influence from the rest of the environment using machine learning methods. Our main experimental results show that planning on this less accurate but much faster local simulator with POMCP leads to higher real-time planning performance than planning on the simulator that models the entire environment.

翻译：在可能涉及许多其他代理人的复杂环境中,我们如何能够实时有效地计划控制一个代理人?虽然现有的抽样规划者在大型POMDPs中取得了经验性的成功,但其性能在很大程度上依赖于快速模拟器。然而,现实世界的情景在性质上是复杂的,其模拟器往往在计算上要求很高,这严重限制了在线规划者的业绩。在这项工作中,我们建议采用影响力增强的在线规划,这是将整个环境的因子模拟器转换成一个当地模拟器的一种原则方法,该模拟器只对与规划者观察和奖励最相关的状态变量进行取样,并利用机器学习方法捕捉环境其余部分的影响。我们的主要实验结果显示,在这种不太准确但速度更快的地方模拟器上与POMCP一起进行规划会提高实时规划性,而不是对模拟整个环境的模拟器进行规划。

0

相关内容

【2020Manning新书】微型化Python项目，325页pdf，Tiny Python Projects

【2020Manning新书】微型化Python项目，325页pdf，Tiny Python Projects

专知会员服务

45+阅读 · 2020年8月18日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【Manning新书】微服务安全实战，616页pdf，Microservices Security in Action

【Manning新书】微服务安全实战，616页pdf，Microservices Security in Action

专知会员服务

46+阅读 · 2020年7月22日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

视觉惯性SLAM综述

专知会员服务

87+阅读 · 2019年12月13日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【CVPR 2019 | tutorial】视觉识别Visual Recognition and Beyond，Facebook|Ross Girshick，Justin Johnson（李飞飞高徒）

【CVPR 2019 | tutorial】视觉识别Visual Recognition and Beyond，Facebook|Ross Girshick，Justin Johnson（李飞飞高徒）

专知会员服务

29+阅读 · 2019年6月16日

【资源】问答阅读理解资源列表

【资源】问答阅读理解资源列表

专知

3+阅读 · 2020年7月25日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

计算机视觉近一年进展综述

计算机视觉近一年进展综述

机器学习研究会

9+阅读 · 2017年11月25日

【推荐】深度学习目标检测全面综述

【推荐】深度学习目标检测全面综述

机器学习研究会

21+阅读 · 2017年9月13日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

Leveraging Multiple Environments for Learning and Decision Making: a Dismantling Use Case

Arxiv

0+阅读 · 2021年8月3日

Augmented World Models Facilitate Zero-Shot Dynamics Generalization From a Single Offline Environment

Arxiv

0+阅读 · 2021年8月3日

Learning-based Preference Prediction for Constrained Multi-Criteria Path-Planning

Arxiv

0+阅读 · 2021年8月2日

PANTHER: Perception-Aware Trajectory Planner in Dynamic Environments

Arxiv

0+阅读 · 2021年8月2日

Multi-objective Conflict-based Search Using Safe-interval Path Planning

Arxiv

0+阅读 · 2021年8月2日

3D Reactive Control and Frontier-Based Exploration for Unstructured Environments

Arxiv

0+阅读 · 2021年8月1日

PILOT: Efficient Planning by Imitation Learning and Optimisation for Safe Autonomous Driving

Arxiv

0+阅读 · 2021年7月30日

Path Planning using Neural A* Search

Arxiv

5+阅读 · 2021年2月8日

Type-augmented Relation Prediction in Knowledge Graphs

Type-augmented Relation Prediction in Knowledge Graphs

Arxiv

6+阅读 · 2020年9月16日

IQA: Visual Question Answering in Interactive Environments

Arxiv

5+阅读 · 2018年4月5日

VIP会员

文章信息

相关主题

相关VIP内容

【2020Manning新书】微型化Python项目，325页pdf，Tiny Python Projects

【2020Manning新书】微型化Python项目，325页pdf，Tiny Python Projects

专知会员服务

45+阅读 · 2020年8月18日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【Manning新书】微服务安全实战，616页pdf，Microservices Security in Action

【Manning新书】微服务安全实战，616页pdf，Microservices Security in Action

专知会员服务

46+阅读 · 2020年7月22日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

视觉惯性SLAM综述

专知会员服务

87+阅读 · 2019年12月13日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【CVPR 2019 | tutorial】视觉识别Visual Recognition and Beyond，Facebook|Ross Girshick，Justin Johnson（李飞飞高徒）

【CVPR 2019 | tutorial】视觉识别Visual Recognition and Beyond，Facebook|Ross Girshick，Justin Johnson（李飞飞高徒）

专知会员服务

29+阅读 · 2019年6月16日

热门VIP内容

开通专知VIP会员享更多权益服务

《人与智能体在系统工程建模语言V2任务中的性能表现：基于用户中心化的评估方法》308页

《数据安全国家标准体系（2025版）》征求意见稿

AlphaMosaic：人工智能赋能的作战管理系统

《军事行动中通信平台的战略价值：提升战术效能与作战优势》

相关资讯

【资源】问答阅读理解资源列表

【资源】问答阅读理解资源列表

专知

3+阅读 · 2020年7月25日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

计算机视觉近一年进展综述

计算机视觉近一年进展综述

机器学习研究会

9+阅读 · 2017年11月25日

【推荐】深度学习目标检测全面综述

【推荐】深度学习目标检测全面综述

机器学习研究会

21+阅读 · 2017年9月13日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Leveraging Multiple Environments for Learning and Decision Making: a Dismantling Use Case

Arxiv

0+阅读 · 2021年8月3日

Augmented World Models Facilitate Zero-Shot Dynamics Generalization From a Single Offline Environment

Arxiv

0+阅读 · 2021年8月3日

Learning-based Preference Prediction for Constrained Multi-Criteria Path-Planning

Arxiv

0+阅读 · 2021年8月2日

PANTHER: Perception-Aware Trajectory Planner in Dynamic Environments

Arxiv

0+阅读 · 2021年8月2日

Multi-objective Conflict-based Search Using Safe-interval Path Planning

Arxiv

0+阅读 · 2021年8月2日

3D Reactive Control and Frontier-Based Exploration for Unstructured Environments

Arxiv

0+阅读 · 2021年8月1日

PILOT: Efficient Planning by Imitation Learning and Optimisation for Safe Autonomous Driving

Arxiv

0+阅读 · 2021年7月30日

Path Planning using Neural A* Search

Arxiv

5+阅读 · 2021年2月8日

Type-augmented Relation Prediction in Knowledge Graphs

Type-augmented Relation Prediction in Knowledge Graphs

Arxiv

6+阅读 · 2020年9月16日

IQA: Visual Question Answering in Interactive Environments

Arxiv

5+阅读 · 2018年4月5日

微信扫码咨询专知VIP会员