利用自我改进模拟器进行POMDPs的在线规划 (Online Planning in POMDPs with Self-Improving Simulators) - 专知论文

会员服务 ·

0

近似 · 模型评估 · 回合 · 原点 · Integration ·

2022 年 1 月 27 日

Online Planning in POMDPs with Self-Improving Simulators

翻译：利用自我改进模拟器进行POMDPs的在线规划

Jinke He,Miguel Suau,Hendrik Baier,Michael Kaisers,Frans A. Oliehoek

How can we plan efficiently in a large and complex environment when the time budget is limited? Given the original simulator of the environment, which may be computationally very demanding, we propose to learn online an approximate but much faster simulator that improves over time. To plan reliably and efficiently while the approximate simulator is learning, we develop a method that adaptively decides which simulator to use for every simulation, based on a statistic that measures the accuracy of the approximate simulator. This allows us to use the approximate simulator to replace the original simulator for faster simulations when it is accurate enough under the current context, thus trading off simulation speed and accuracy. Experimental results in two large domains show that when integrated with POMCP, our approach allows to plan with improving efficiency over time.

翻译：当时间预算有限时,我们如何在大而复杂的环境中有效规划?鉴于最初的环境模拟器,其计算要求可能很高,我们提议在网上学习一个近似但更快的模拟器,随着时间的推移不断改进。在大约模拟器正在学习的同时,为了可靠和高效地规划,我们开发了一种方法,根据测量近似模拟器准确性的统计数据,在每次模拟中使用哪种模拟器时,以适应性的方式决定该模拟器。这使我们能够使用近似模拟器取代原始模拟器,以更快地进行模拟,而在当前情况下它足够准确,从而交换模拟速度和准确性。两个大领域的实验结果显示,与POMCP相结合时,我们的方法允许在与POMCP相结合时,以提高效率的方式进行规划。

0

相关内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

以PqsR为靶点筛选铜绿假单胞菌群体感应调控抑制剂及联合用药研究

国家自然科学基金

0+阅读 · 2015年12月31日

腺病毒介导的miRNA干扰策略抗MERS-CoV的研究

国家自然科学基金

0+阅读 · 2013年12月31日

褐家鼠群体MHC遗传多态性与SEOV感染相关性的研究

国家自然科学基金

0+阅读 · 2013年12月31日

BAFF干扰的树突状细胞参与自身免疫性关节炎免疫耐受的作用和机制

国家自然科学基金

0+阅读 · 2012年12月31日

从中性粒细胞自释DNA经TLR9激活自身免疫探讨解毒祛瘀滋肾法对系统性红斑狼疮的作用机理

国家自然科学基金

0+阅读 · 2012年12月31日

单分子自旋电子器件输运特性的理论表征和调控

国家自然科学基金

0+阅读 · 2012年12月31日

小麦抗麦红吸浆虫QTL定位与关联分析

国家自然科学基金

0+阅读 · 2012年12月31日

抗体阻断效应在恒河猴感染日本血吸虫自愈中的作用研究

国家自然科学基金

0+阅读 · 2011年12月31日

Tim-3/Tim-3L信号通路在幽门螺杆菌感染免疫致病和免疫防治中的作用

国家自然科学基金

0+阅读 · 2009年12月31日

幽门螺杆菌益生菌型口服疫苗的研制

国家自然科学基金

0+阅读 · 2008年12月31日

MooAFEM: An object oriented Matlab code for higher-order (nonlinear) adaptive FEM

Arxiv

0+阅读 · 2022年4月20日

Differentiable Collision Avoidance Using Collision Primitives

Arxiv

0+阅读 · 2022年4月20日

Learning to Retrieve Relevant Experiences for Motion Planning

Arxiv

0+阅读 · 2022年4月18日

Imitating, Fast and Slow: Robust learning from demonstrations via decision-time planning

Arxiv

0+阅读 · 2022年4月18日

Configuration-Aware Safe Control for Mobile Robotic Arm with Control Barrier Functions

Configuration-Aware Safe Control for Mobile Robotic Arm with Control Barrier Functions

Arxiv

1+阅读 · 2022年4月18日

M-Estimation based on quasi-processes from discrete samples of Levy processes

Arxiv

0+阅读 · 2022年4月18日

Augmentation Invariance and Adaptive Sampling in Semantic Segmentation of Agricultural Aerial Images

Arxiv

0+阅读 · 2022年4月17日

Improving Frame-Online Neural Speech Enhancement with Overlapped-Frame Prediction

Arxiv

0+阅读 · 2022年4月15日

Prefix-Free Coding for LQG Control

Prefix-Free Coding for LQG Control

Arxiv

0+阅读 · 2022年4月15日

PL-VINS: Real-Time Monocular Visual-Inertial SLAM with Point and Line Features

Arxiv

1+阅读 · 2022年4月15日

VIP会员

文章信息

相关主题

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【NeurIPS 2025】视觉指令瓶颈微调

什么是模块化开放系统方法（MOSA）？从美陆军新型倾转旋翼机视角解读

【牛津博士论文】面向视觉、物理与语言应用的可信机器学习模型

医学领域大型语言模型的新进展

相关资讯

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

MooAFEM: An object oriented Matlab code for higher-order (nonlinear) adaptive FEM

Arxiv

0+阅读 · 2022年4月20日

Differentiable Collision Avoidance Using Collision Primitives

Arxiv

0+阅读 · 2022年4月20日

Learning to Retrieve Relevant Experiences for Motion Planning

Arxiv

0+阅读 · 2022年4月18日

Imitating, Fast and Slow: Robust learning from demonstrations via decision-time planning

Arxiv

0+阅读 · 2022年4月18日

Configuration-Aware Safe Control for Mobile Robotic Arm with Control Barrier Functions

Configuration-Aware Safe Control for Mobile Robotic Arm with Control Barrier Functions

Arxiv

1+阅读 · 2022年4月18日

M-Estimation based on quasi-processes from discrete samples of Levy processes

Arxiv

0+阅读 · 2022年4月18日

Augmentation Invariance and Adaptive Sampling in Semantic Segmentation of Agricultural Aerial Images

Arxiv

0+阅读 · 2022年4月17日

Improving Frame-Online Neural Speech Enhancement with Overlapped-Frame Prediction

Arxiv

0+阅读 · 2022年4月15日

Prefix-Free Coding for LQG Control

Prefix-Free Coding for LQG Control

Arxiv

0+阅读 · 2022年4月15日

PL-VINS: Real-Time Monocular Visual-Inertial SLAM with Point and Line Features

Arxiv

1+阅读 · 2022年4月15日

相关基金

以PqsR为靶点筛选铜绿假单胞菌群体感应调控抑制剂及联合用药研究

国家自然科学基金

0+阅读 · 2015年12月31日

腺病毒介导的miRNA干扰策略抗MERS-CoV的研究

国家自然科学基金

0+阅读 · 2013年12月31日

褐家鼠群体MHC遗传多态性与SEOV感染相关性的研究

国家自然科学基金

0+阅读 · 2013年12月31日

BAFF干扰的树突状细胞参与自身免疫性关节炎免疫耐受的作用和机制

国家自然科学基金

0+阅读 · 2012年12月31日

从中性粒细胞自释DNA经TLR9激活自身免疫探讨解毒祛瘀滋肾法对系统性红斑狼疮的作用机理

国家自然科学基金

0+阅读 · 2012年12月31日

单分子自旋电子器件输运特性的理论表征和调控

国家自然科学基金

0+阅读 · 2012年12月31日

小麦抗麦红吸浆虫QTL定位与关联分析

国家自然科学基金

0+阅读 · 2012年12月31日

抗体阻断效应在恒河猴感染日本血吸虫自愈中的作用研究

国家自然科学基金

0+阅读 · 2011年12月31日

Tim-3/Tim-3L信号通路在幽门螺杆菌感染免疫致病和免疫防治中的作用

国家自然科学基金

0+阅读 · 2009年12月31日

幽门螺杆菌益生菌型口服疫苗的研制

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员