减少，重复利用，回收：多智能体强化学习中的选择性转世 (Reduce, Reuse, Recycle: Selective Reincarnation in Multi-Agent Reinforcement Learning) - 专知论文

会员服务 ·

0

智能体 · 多智能体 · 多智能体强化学习 · 强化学习 · 异构系统 ·

2023 年 3 月 31 日

Reduce, Reuse, Recycle: Selective Reincarnation in Multi-Agent Reinforcement Learning

翻译：减少，重复利用，回收：多智能体强化学习中的选择性转世

Claude Formanek,Callum Rhys Tilbury,Jonathan Shock,Kale-ab Tessera,Arnu Pretorius

from arxiv, Accepted as oral presentation at Reincarnating Reinforcement Learning workshop at ICLR 2023

'Reincarnation' in reinforcement learning has been proposed as a formalisation of reusing prior computation from past experiments when training an agent in an environment. In this paper, we present a brief foray into the paradigm of reincarnation in the multi-agent (MA) context. We consider the case where only some agents are reincarnated, whereas the others are trained from scratch -- selective reincarnation. In the fully-cooperative MA setting with heterogeneous agents, we demonstrate that selective reincarnation can lead to higher returns than training fully from scratch, and faster convergence than training with full reincarnation. However, the choice of which agents to reincarnate in a heterogeneous system is vitally important to the outcome of the training -- in fact, a poor choice can lead to considerably worse results than the alternatives. We argue that a rich field of work exists here, and we hope that our effort catalyses further energy in bringing the topic of reincarnation to the multi-agent realm.

翻译：“转世(reincarnation)”在强化学习中被提出作为重用先前的实验计算来训练一个智能体的形式化方法。在本文中，我们简要探讨了在多智能体(MA)环境中的转世范式。我们考虑只有一些智能体会转世，而其他智能体将从头开始训练 —— 选择性转世。在完全合作的MA设置中，我们展示了选择性转世可以比从头开始训练获得更高的回报，并且比全面转世更快地收敛。然而，在异构系统中选择哪些智能体转世是至关重要的，事实上，一个糟糕的选择会导致比其他方法更糟糕的结果。我们认为这里有丰富的研究领域存在，希望我们的努力可以诱导更多的研究者将转世的主题带到多智能体领域。

0

相关内容

智能体

智能体，顾名思义，就是具有智能的实体，英文名是Agent。

【“大量”智能体的强化学习】《Many-Agent Reinforcement Learning》，327页博士论文，伦敦大学学院（UCL）

【“大量”智能体的强化学习】《Many-Agent Reinforcement Learning》，327页博士论文，伦敦大学学院（UCL）

专知会员服务

118+阅读 · 2022年5月7日

【AI+军事】美国HRL实验室AAAI2020《基于强化学习的多智能体任务规划》，Multi-Agent Mission Planning with Reinforcement Learning

【AI+军事】美国HRL实验室AAAI2020《基于强化学习的多智能体任务规划》，Multi-Agent Mission Planning with Reinforcement Learning

专知会员服务

231+阅读 · 2022年4月10日

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

专知会员服务

23+阅读 · 2022年3月19日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

84+阅读 · 2020年2月18日

【AAAI2020教程】强化学习中的Exploration-Exploitation in Reinforcement Learning

专知会员服务

101+阅读 · 2020年2月8日

【伯克利，基于模型的强化学习：理论与实践】《Model-Based Reinforcement Learning:Theory and Practice》，Michael Janner

【伯克利，基于模型的强化学习：理论与实践】《Model-Based Reinforcement Learning:Theory and Practice》，Michael Janner

专知会员服务

35+阅读 · 2019年12月12日

实时强化学习《Real-Time Reinforcement Learning》S Ramstedt, C Pal [Mila, Element AI] (2019)

实时强化学习《Real-Time Reinforcement Learning》S Ramstedt, C Pal [Mila, Element AI] (2019)

专知会员服务

13+阅读 · 2019年11月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

量化金融强化学习论文集合

量化金融强化学习论文集合

专知

14+阅读 · 2019年12月18日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

CexZr1-xO2固溶体催化剂催化CO2与甲醇合成碳酸二甲酯的DFT研究

国家自然科学基金

0+阅读 · 2015年12月31日

过渡金属催化C(sp3)-H键氟化反应的研究

国家自然科学基金

0+阅读 · 2013年12月31日

环境友好媒介中新型配合物催化剂的反应性能及二氧化碳循环利用研究

国家自然科学基金

0+阅读 · 2012年12月31日

高效率TiO2基光热协同催化剂的制备

国家自然科学基金

0+阅读 · 2012年12月31日

路易斯碱催化的贫电子烯（炔）烃环加成反应的机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

新疆土著钙化念珠藻去除矿山废水中Cd2+的机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

恶臭假单胞菌扁桃酸消旋酶催化底物多样性产生机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

Clusterin通过线粒体凋亡通路调节肝细胞肝癌化疗耐受机理的研究

国家自然科学基金

0+阅读 · 2011年12月31日

亲核有机膦催化的偶氮次甲基亚胺与联烯酯的环加成反应

国家自然科学基金

0+阅读 · 2011年12月31日

AB2O4(B=Al、Ga、In)基尖晶石型可见光催化剂结构和性能的理论与实验研究

国家自然科学基金

0+阅读 · 2011年12月31日

Lucy-SKG: Learning to Play Rocket League Efficiently Using Deep Reinforcement Learning

Arxiv

0+阅读 · 2023年5月25日

MARC: A multi-agent robots control framework for enhancing reinforcement learning in construction tasks

Arxiv

0+阅读 · 2023年5月23日

Conditional Mutual Information for Disentangled Representations in Reinforcement Learning

Arxiv

0+阅读 · 2023年5月23日

Pretraining in Deep Reinforcement Learning: A Survey

Arxiv

21+阅读 · 2022年11月8日

Reinforcement Learning on Graph: A Survey

Arxiv

67+阅读 · 2022年4月13日

Automated Reinforcement Learning (AutoRL): A Survey and Open Problems

Automated Reinforcement Learning (AutoRL): A Survey and Open Problems

Arxiv

33+阅读 · 2022年1月11日

MetaCURE: Meta Reinforcement Learning with Empowerment-Driven Exploration

Arxiv

12+阅读 · 2021年2月7日

Transfer Learning in Deep Reinforcement Learning: A Survey

Transfer Learning in Deep Reinforcement Learning: A Survey

Arxiv

23+阅读 · 2020年9月16日

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Arxiv

26+阅读 · 2020年2月10日

A Multi-Objective Deep Reinforcement Learning Framework

A Multi-Objective Deep Reinforcement Learning Framework

Arxiv

16+阅读 · 2018年6月27日

VIP会员

文章信息

相关主题

多智能体强化学习

相关VIP内容

【“大量”智能体的强化学习】《Many-Agent Reinforcement Learning》，327页博士论文，伦敦大学学院（UCL）

【“大量”智能体的强化学习】《Many-Agent Reinforcement Learning》，327页博士论文，伦敦大学学院（UCL）

专知会员服务

118+阅读 · 2022年5月7日

【AI+军事】美国HRL实验室AAAI2020《基于强化学习的多智能体任务规划》，Multi-Agent Mission Planning with Reinforcement Learning

【AI+军事】美国HRL实验室AAAI2020《基于强化学习的多智能体任务规划》，Multi-Agent Mission Planning with Reinforcement Learning

专知会员服务

231+阅读 · 2022年4月10日

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

专知会员服务

23+阅读 · 2022年3月19日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

84+阅读 · 2020年2月18日

【AAAI2020教程】强化学习中的Exploration-Exploitation in Reinforcement Learning

专知会员服务

101+阅读 · 2020年2月8日

【伯克利，基于模型的强化学习：理论与实践】《Model-Based Reinforcement Learning:Theory and Practice》，Michael Janner

【伯克利，基于模型的强化学习：理论与实践】《Model-Based Reinforcement Learning:Theory and Practice》，Michael Janner

专知会员服务

35+阅读 · 2019年12月12日

实时强化学习《Real-Time Reinforcement Learning》S Ramstedt, C Pal [Mila, Element AI] (2019)

实时强化学习《Real-Time Reinforcement Learning》S Ramstedt, C Pal [Mila, Element AI] (2019)

专知会员服务

13+阅读 · 2019年11月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

《生成式人工智能与大/小语言模型在供应链管理决策优化与可持续性提升中的作用评估》最新51页

白宫发布《赢得AI竞赛：美国人工智能行动计划》最新28页

地下战：地下空间的战略博弈

《美地下作战条令手册》228页

相关资讯

量化金融强化学习论文集合

量化金融强化学习论文集合

专知

14+阅读 · 2019年12月18日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Lucy-SKG: Learning to Play Rocket League Efficiently Using Deep Reinforcement Learning

Arxiv

0+阅读 · 2023年5月25日

MARC: A multi-agent robots control framework for enhancing reinforcement learning in construction tasks

Arxiv

0+阅读 · 2023年5月23日

Conditional Mutual Information for Disentangled Representations in Reinforcement Learning

Arxiv

0+阅读 · 2023年5月23日

Pretraining in Deep Reinforcement Learning: A Survey

Arxiv

21+阅读 · 2022年11月8日

Reinforcement Learning on Graph: A Survey

Arxiv

67+阅读 · 2022年4月13日

Automated Reinforcement Learning (AutoRL): A Survey and Open Problems

Automated Reinforcement Learning (AutoRL): A Survey and Open Problems

Arxiv

33+阅读 · 2022年1月11日

MetaCURE: Meta Reinforcement Learning with Empowerment-Driven Exploration

Arxiv

12+阅读 · 2021年2月7日

Transfer Learning in Deep Reinforcement Learning: A Survey

Transfer Learning in Deep Reinforcement Learning: A Survey

Arxiv

23+阅读 · 2020年9月16日

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Arxiv

26+阅读 · 2020年2月10日

A Multi-Objective Deep Reinforcement Learning Framework

A Multi-Objective Deep Reinforcement Learning Framework

Arxiv

16+阅读 · 2018年6月27日

相关基金

CexZr1-xO2固溶体催化剂催化CO2与甲醇合成碳酸二甲酯的DFT研究

国家自然科学基金

0+阅读 · 2015年12月31日

过渡金属催化C(sp3)-H键氟化反应的研究

国家自然科学基金

0+阅读 · 2013年12月31日

环境友好媒介中新型配合物催化剂的反应性能及二氧化碳循环利用研究

国家自然科学基金

0+阅读 · 2012年12月31日

高效率TiO2基光热协同催化剂的制备

国家自然科学基金

0+阅读 · 2012年12月31日

路易斯碱催化的贫电子烯（炔）烃环加成反应的机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

新疆土著钙化念珠藻去除矿山废水中Cd2+的机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

恶臭假单胞菌扁桃酸消旋酶催化底物多样性产生机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

Clusterin通过线粒体凋亡通路调节肝细胞肝癌化疗耐受机理的研究

国家自然科学基金

0+阅读 · 2011年12月31日

亲核有机膦催化的偶氮次甲基亚胺与联烯酯的环加成反应

国家自然科学基金

0+阅读 · 2011年12月31日

AB2O4(B=Al、Ga、In)基尖晶石型可见光催化剂结构和性能的理论与实验研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员