记住并忘记多机构强化学习的经验回放 (Remember and Forget Experience Replay for Multi-Agent Reinforcement Learning) - 专知论文

会员服务 ·

0

经验回放 · Performer · 估计/估计量 · Extensibility · 学成 ·

2022 年 5 月 19 日

Remember and Forget Experience Replay for Multi-Agent Reinforcement Learning

翻译：记住并忘记多机构强化学习的经验回放

Pascal Weber,Daniel Wälchli,Mustafa Zeqiri,Petros Koumoutsakos

We present the extension of the Remember and Forget for Experience Replay (ReF-ER) algorithm to Multi-Agent Reinforcement Learning (MARL). {ReF-ER} was shown to outperform state of the art algorithms for continuous control in problems ranging from the OpenAI Gym to complex fluid flows. In MARL, the dependencies between the agents are included in the state-value estimator and the environment dynamics are modeled via the importance weights used by ReF-ER. In collaborative environments, we find the best performance when the value is estimated using individual rewards and we ignore the effects of other actions on the transition map. We benchmark the performance of ReF-ER MARL on the Stanford Intelligent Systems Laboratory (SISL) environments. We find that employing a single feed-forward neural network for the policy and the value function in ReF-ER MARL, outperforms state of the art algorithms that rely on complex neural network architectures.

翻译：我们将记忆和遗忘经验回放(ReF-ER)算法延伸至多力强化学习(MARL) 。 {ReF-ER} 显示在从 OpenAI Gym 到复杂的流体流动等问题上,持续控制的问题比艺术算法的状态要好。在MARL 中, 代理商之间的依赖性包含在国家价值估计器中, 而环境动态则通过 ReF-ER 使用的重要性权重来建模。在合作环境中, 当使用个人奖赏来估计价值时, 我们发现最佳的性能, 我们忽略了其他动作对过渡图的影响。我们用斯坦福智能系统实验室( SISL) 环境来衡量ReF- ER MARL 的性能。我们发现, 使用单一的向导神经网络网络网络网络(SISL) 环境的性能网络, 以及ReF- ER MARL 的值函数, 超越了依赖复杂神经网络结构的艺术算法的状态。

0

相关内容

经验回放

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

TGM2在活化肝星状细胞诱导肝癌细胞糖代谢重构中的作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

关于 Finsler 流形上调和映射与 Laplacian 的若干问题研究

国家自然科学基金

1+阅读 · 2014年12月31日

间充质干细胞对肝癌发生中Kupffer细胞相关炎症反应的作用及机制

国家自然科学基金

0+阅读 · 2013年12月31日

Periostin/整合素途径在前列腺癌骨转移微环境中的作用及其机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

IPS细胞调节培养基抑制增生性瘢痕的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

SDF-1/CXCR4介导间质干细胞与结直肠癌干细胞crosstalk的效应及机制

国家自然科学基金

0+阅读 · 2012年12月31日

BEC的保几何结构数值模拟与研究

国家自然科学基金

0+阅读 · 2011年12月31日

肿瘤抑制基因XAF1治疗肿瘤转移的作用及机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

Internet环境中基于语义Web的开放式决策支持系统关键技术研究

国家自然科学基金

0+阅读 · 2010年12月31日

瘢痕疙瘩中TIEG1对Smad7转录调控的研究

国家自然科学基金

0+阅读 · 2009年12月31日

How to Leverage Unlabeled Data in Offline Reinforcement Learning

How to Leverage Unlabeled Data in Offline Reinforcement Learning

Arxiv

0+阅读 · 2022年7月8日

Interaction Pattern Disentangling for Multi-Agent Reinforcement Learning

Arxiv

0+阅读 · 2022年7月8日

Reinforced Hybrid Genetic Algorithm for the Traveling Salesman Problem

Reinforced Hybrid Genetic Algorithm for the Traveling Salesman Problem

Arxiv

0+阅读 · 2022年7月8日

Storehouse: a Reinforcement Learning Environment for Optimizing Warehouse Management

Arxiv

0+阅读 · 2022年7月8日

Stochastic optimal well control in subsurface reservoirs using reinforcement learning

Arxiv

0+阅读 · 2022年7月7日

Vessel-following model for inland waterways based on deep reinforcement learning

Vessel-following model for inland waterways based on deep reinforcement learning

Arxiv

0+阅读 · 2022年7月7日

Transformers are Meta-Reinforcement Learners

Arxiv

15+阅读 · 2022年6月14日

MetaCURE: Meta Reinforcement Learning with Empowerment-Driven Exploration

Arxiv

12+阅读 · 2021年2月7日

Transfer Learning in Deep Reinforcement Learning: A Survey

Transfer Learning in Deep Reinforcement Learning: A Survey

Arxiv

23+阅读 · 2020年9月16日

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Arxiv

26+阅读 · 2020年2月10日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《人工智能绝不能完全自主》

《人工智能的法律与伦理：军事自主机器独特挑战的深度剖析》316页

从数据到主导：AI与兵棋推演构筑决策优势

《特洛伊木马货柜：武器化集装箱的战略威胁》最新报告

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

相关论文

How to Leverage Unlabeled Data in Offline Reinforcement Learning

How to Leverage Unlabeled Data in Offline Reinforcement Learning

Arxiv

0+阅读 · 2022年7月8日

Interaction Pattern Disentangling for Multi-Agent Reinforcement Learning

Arxiv

0+阅读 · 2022年7月8日

Reinforced Hybrid Genetic Algorithm for the Traveling Salesman Problem

Reinforced Hybrid Genetic Algorithm for the Traveling Salesman Problem

Arxiv

0+阅读 · 2022年7月8日

Storehouse: a Reinforcement Learning Environment for Optimizing Warehouse Management

Arxiv

0+阅读 · 2022年7月8日

Stochastic optimal well control in subsurface reservoirs using reinforcement learning

Arxiv

0+阅读 · 2022年7月7日

Vessel-following model for inland waterways based on deep reinforcement learning

Vessel-following model for inland waterways based on deep reinforcement learning

Arxiv

0+阅读 · 2022年7月7日

Transformers are Meta-Reinforcement Learners

Arxiv

15+阅读 · 2022年6月14日

MetaCURE: Meta Reinforcement Learning with Empowerment-Driven Exploration

Arxiv

12+阅读 · 2021年2月7日

Transfer Learning in Deep Reinforcement Learning: A Survey

Transfer Learning in Deep Reinforcement Learning: A Survey

Arxiv

23+阅读 · 2020年9月16日

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Arxiv

26+阅读 · 2020年2月10日

相关基金

TGM2在活化肝星状细胞诱导肝癌细胞糖代谢重构中的作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

关于 Finsler 流形上调和映射与 Laplacian 的若干问题研究

国家自然科学基金

1+阅读 · 2014年12月31日

间充质干细胞对肝癌发生中Kupffer细胞相关炎症反应的作用及机制

国家自然科学基金

0+阅读 · 2013年12月31日

Periostin/整合素途径在前列腺癌骨转移微环境中的作用及其机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

IPS细胞调节培养基抑制增生性瘢痕的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

SDF-1/CXCR4介导间质干细胞与结直肠癌干细胞crosstalk的效应及机制

国家自然科学基金

0+阅读 · 2012年12月31日

BEC的保几何结构数值模拟与研究

国家自然科学基金

0+阅读 · 2011年12月31日

肿瘤抑制基因XAF1治疗肿瘤转移的作用及机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

Internet环境中基于语义Web的开放式决策支持系统关键技术研究

国家自然科学基金

0+阅读 · 2010年12月31日

瘢痕疙瘩中TIEG1对Smad7转录调控的研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员