进一步探索与混合行动空间一起深层多机构强化学习 (A further exploration of deep Multi-Agent Reinforcement Learning with Hybrid Action Space) - 专知论文

会员服务 ·

0

Learning · 强化学习 · 回合 · 确定性策略 · 深度强化学习 ·

2022 年 8 月 30 日

A further exploration of deep Multi-Agent Reinforcement Learning with Hybrid Action Space

翻译：进一步探索与混合行动空间一起深层多机构强化学习

Hongzhi Hua,Guixuan Wen,Kaigui Wu

from arxiv, arXiv admin note: substantial text overlap with arXiv:2206.05108

The research of extending deep reinforcement learning (drl) to multi-agent field has solved many complicated problems and made great achievements. However, almost all these studies only focus on discrete or continuous action space and there are few works having ever used multi-agent deep reinforcement learning to real-world environment problems which mostly have a hybrid action space. Therefore, in this paper, we propose two algorithms: deep multi-agent hybrid soft actor-critic (MAHSAC) and multi-agent hybrid deep deterministic policy gradients (MAHDDPG) to fill this gap. This two algorithms follow the centralized training and decentralized execution (CTDE) paradigm and could handle hybrid action space problems. Our experiences are running on multi-agent particle environment which is an easy multi-agent particle world, along with some basic simulated physics. The experimental results show that these algorithms have good performances.

翻译：将深度强化学习(drl)的研究扩大到多试剂领域,解决了许多复杂的问题并取得了巨大成就,然而,几乎所有这些研究都只侧重于离散或连续的行动空间,而且很少有工作曾将多剂深度强化学习用于现实世界环境问题,而现实世界环境问题大多具有混合行动空间。因此,我们在本文件中提出了两种算法:深多剂混合软体行为者-critic(MAHSAC)和多剂混合深海确定性政策梯度(MAHDDPG),以填补这一空白。这两种算法遵循集中培训和分散执行模式(CTDE),可以处理混合行动空间问题。我们的经验是在多剂粒子环境中运行的,这是一个容易的多剂粒子世界,以及一些基本的模拟物理学。实验结果表明,这些算法具有良好的性能。

0

相关内容

Learning

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

84+阅读 · 2020年2月18日

【强化学习论文推荐集合】2019年必读的10篇TOP强化学习论文，My Top 10 Deep RL Papers of 2019

【强化学习论文推荐集合】2019年必读的10篇TOP强化学习论文，My Top 10 Deep RL Papers of 2019

专知会员服务

42+阅读 · 2020年1月15日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

161+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

AlGaN极化场调控对内量子效率的影响

国家自然科学基金

1+阅读 · 2016年12月31日

应用膜蛋白纳米组装研究EGFR/HER2过表达致癌的分子机理与结构

国家自然科学基金

0+阅读 · 2014年12月31日

共轭高分子界面层的结构及其形成过程研究

国家自然科学基金

0+阅读 · 2013年12月31日

大变形结构无网格拓扑优化方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

miR-146a靶向IRAK1与TRAF6调控非小细胞肺癌转移的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

求解具有张量积结构系统的算法研究

国家自然科学基金

1+阅读 · 2012年12月31日

万寿菊花器官同源异型突变的转录组分析及相关基因功能研究

国家自然科学基金

0+阅读 · 2012年12月31日

同步辐射高分辨X射线发射谱方法及其在材料电子结构中应用

国家自然科学基金

0+阅读 · 2011年12月31日

C0-029诱导上皮间变促进肝癌侵袭转移的研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于本体的Deep Web搜索技术

国家自然科学基金

2+阅读 · 2009年12月31日

Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning

Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning

Arxiv

0+阅读 · 2022年10月18日

RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning

Arxiv

0+阅读 · 2022年10月18日

Factored Adaptation for Non-Stationary Reinforcement Learning

Arxiv

0+阅读 · 2022年10月18日

Learning Control Admissibility Models with Graph Neural Networks for Multi-Agent Navigation

Arxiv

0+阅读 · 2022年10月17日

Distributional Reward Estimation for Effective Multi-Agent Deep Reinforcement Learning

Arxiv

0+阅读 · 2022年10月14日

A Scalable Finite Difference Method for Deep Reinforcement Learning

Arxiv

0+阅读 · 2022年10月14日

Harfang3D Dog-Fight Sandbox: A Reinforcement Learning Research Platform for the Customized Control Tasks of Fighter Aircrafts

Arxiv

1+阅读 · 2022年10月13日

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Arxiv

19+阅读 · 2022年5月13日

Automated Reinforcement Learning (AutoRL): A Survey and Open Problems

Automated Reinforcement Learning (AutoRL): A Survey and Open Problems

Arxiv

33+阅读 · 2022年1月11日

A Multi-Objective Deep Reinforcement Learning Framework

A Multi-Objective Deep Reinforcement Learning Framework

Arxiv

16+阅读 · 2018年6月27日

VIP会员

文章信息

相关主题

确定性策略

深度强化学习

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

84+阅读 · 2020年2月18日

【强化学习论文推荐集合】2019年必读的10篇TOP强化学习论文，My Top 10 Deep RL Papers of 2019

【强化学习论文推荐集合】2019年必读的10篇TOP强化学习论文，My Top 10 Deep RL Papers of 2019

专知会员服务

42+阅读 · 2020年1月15日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

161+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《运用大语言模型支持空天防御系统工程项目》2025最新208页

《美空军转型：打造分布式空战力量以应对大国竞争》2025最新报告

消耗性无人机：认识战争演变中的技术特性与本质特征

《人体状态多模态推断·美陆军报告：风险环境下的认知追踪研究》2025最新100页

相关资讯

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning

Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning

Arxiv

0+阅读 · 2022年10月18日

RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning

Arxiv

0+阅读 · 2022年10月18日

Factored Adaptation for Non-Stationary Reinforcement Learning

Arxiv

0+阅读 · 2022年10月18日

Learning Control Admissibility Models with Graph Neural Networks for Multi-Agent Navigation

Arxiv

0+阅读 · 2022年10月17日

Distributional Reward Estimation for Effective Multi-Agent Deep Reinforcement Learning

Arxiv

0+阅读 · 2022年10月14日

A Scalable Finite Difference Method for Deep Reinforcement Learning

Arxiv

0+阅读 · 2022年10月14日

Harfang3D Dog-Fight Sandbox: A Reinforcement Learning Research Platform for the Customized Control Tasks of Fighter Aircrafts

Arxiv

1+阅读 · 2022年10月13日

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Arxiv

19+阅读 · 2022年5月13日

Automated Reinforcement Learning (AutoRL): A Survey and Open Problems

Automated Reinforcement Learning (AutoRL): A Survey and Open Problems

Arxiv

33+阅读 · 2022年1月11日

A Multi-Objective Deep Reinforcement Learning Framework

A Multi-Objective Deep Reinforcement Learning Framework

Arxiv

16+阅读 · 2018年6月27日

相关基金

AlGaN极化场调控对内量子效率的影响

国家自然科学基金

1+阅读 · 2016年12月31日

应用膜蛋白纳米组装研究EGFR/HER2过表达致癌的分子机理与结构

国家自然科学基金

0+阅读 · 2014年12月31日

共轭高分子界面层的结构及其形成过程研究

国家自然科学基金

0+阅读 · 2013年12月31日

大变形结构无网格拓扑优化方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

miR-146a靶向IRAK1与TRAF6调控非小细胞肺癌转移的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

求解具有张量积结构系统的算法研究

国家自然科学基金

1+阅读 · 2012年12月31日

万寿菊花器官同源异型突变的转录组分析及相关基因功能研究

国家自然科学基金

0+阅读 · 2012年12月31日

同步辐射高分辨X射线发射谱方法及其在材料电子结构中应用

国家自然科学基金

0+阅读 · 2011年12月31日

C0-029诱导上皮间变促进肝癌侵袭转移的研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于本体的Deep Web搜索技术

国家自然科学基金

2+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员