使用进化游戏理论来查找多代理程序路径 (Multi Agent Path Finding using Evolutionary Game Theory) - 专知论文

会员服务 ·

0

Agent · 路径 · 博弈论 · 情景 · Performer ·

2022 年 12 月 5 日

Multi Agent Path Finding using Evolutionary Game Theory

翻译：使用进化游戏理论来查找多代理程序路径

Sheryl Paul,Jyotirmoy V. Deshmukh

In this paper, we consider the problem of path finding for a set of homogeneous and autonomous agents navigating a previously unknown stochastic environment. In our problem setting, each agent attempts to maximize a given utility function while respecting safety properties. Our solution is based on ideas from evolutionary game theory, namely replicating policies that perform well and diminishing ones that do not. We do a comprehensive comparison with related multiagent planning methods, and show that our technique beats state of the art RL algorithms in minimizing path length by nearly 30% in large spaces. We show that our algorithm is computationally faster than deep RL methods by at least an order of magnitude. We also show that it scales better with an increase in the number of agents as compared to other methods, path planning methods in particular. Lastly, we empirically prove that the policies that we learn are evolutionarily stable and thus impervious to invasion by any other policy.

翻译：在本文中,我们考虑了寻找一组在以往未知的随机环境中航行的同质和自主的代理商的路径问题。在我们的问题环境中, 每个代理商都试图在尊重安全特性的同时最大限度地增加一个特定的实用功能。我们的解决方案是基于进化游戏理论的理念, 即复制运作良好的政策, 并减少不起作用的政策。我们与相关的多试剂规划方法进行全面比较, 并表明我们的技术在将大空间的路径长度减少近30%方面胜过最先进的RL算法。我们显示我们的算法比深RL方法的计算速度要快得多, 至少要一个数量级。我们还表明它比其他方法, 特别是路径规划方法, 还要用更多的代理商数量来衡量得更好。最后, 我们从经验上证明我们学到的政策是进化稳定的, 因而不会受到任何其他政策的入侵。

0

相关内容

Agent

【2022新书】强化学习工业应用，408页pdf

【2022新书】强化学习工业应用，408页pdf

专知会员服务

231+阅读 · 2022年2月3日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

53+阅读 · 2020年9月7日

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

专知会员服务

102+阅读 · 2020年6月21日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

掺杂的稀土氧化物非晶态纳米管可控制备及其热电性能研究

国家自然科学基金

0+阅读 · 2014年12月31日

Anderson型多酸的不对称修饰及可控组装研究

国家自然科学基金

1+阅读 · 2014年12月31日

量子群与Tewilliger代数的相关问题研究

国家自然科学基金

1+阅读 · 2013年12月31日

基于Exemplar-Classifier思想的高分辨率光学遥感影像目标识别研究

国家自然科学基金

2+阅读 · 2013年12月31日

聚合物光敏的小分子宽光谱有机太阳能电池的研究

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

离子束石墨烯-半导体氧化物复合光催化材料的可控制备及性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

银屑病中皮肤DC的免疫调节机制

国家自然科学基金

0+阅读 · 2012年12月31日

高表面能晶面暴露的金属氧化物纳米晶体合成及性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

编码密码学中若干组合对象研究

国家自然科学基金

0+阅读 · 2009年12月31日

Learning to Shape Rewards using a Game of Two Partners

Arxiv

0+阅读 · 2023年2月6日

Offline Learning in Markov Games with General Function Approximation

Arxiv

0+阅读 · 2023年2月6日

Quantized-Constraint Concatenation and the Covering Radius of Constrained Systems

Arxiv

0+阅读 · 2023年2月5日

Learning-based Collision-free Planning on Arbitrary Optimization Criteria in the Latent Space through cGANs

Arxiv

0+阅读 · 2023年2月5日

A Game-Theoretic Approach to Solving the Roman Domination Problem

Arxiv

0+阅读 · 2023年2月5日

Numerical methods for backward stochastic differential equations: A survey

Arxiv

0+阅读 · 2023年2月4日

Variational Latent Branching Model for Off-Policy Evaluation

Arxiv

0+阅读 · 2023年2月3日

Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain

Arxiv

0+阅读 · 2023年2月2日

The Principles of Deep Learning Theory

Arxiv

65+阅读 · 2021年6月18日

Multiagent Soft Q-Learning

Arxiv

11+阅读 · 2018年4月25日

VIP会员

文章信息

相关主题

相关VIP内容

【2022新书】强化学习工业应用，408页pdf

【2022新书】强化学习工业应用，408页pdf

专知会员服务

231+阅读 · 2022年2月3日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

53+阅读 · 2020年9月7日

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

专知会员服务

102+阅读 · 2020年6月21日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】以人为中心的强化学习

任务规划与地形分析：现代复杂环境作战导航体系

认知优势：人工智能在国家安全决策中的核心作用

大模型赋能的具身智能：决策与具身学习综述

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

相关论文

Learning to Shape Rewards using a Game of Two Partners

Arxiv

0+阅读 · 2023年2月6日

Offline Learning in Markov Games with General Function Approximation

Arxiv

0+阅读 · 2023年2月6日

Quantized-Constraint Concatenation and the Covering Radius of Constrained Systems

Arxiv

0+阅读 · 2023年2月5日

Learning-based Collision-free Planning on Arbitrary Optimization Criteria in the Latent Space through cGANs

Arxiv

0+阅读 · 2023年2月5日

A Game-Theoretic Approach to Solving the Roman Domination Problem

Arxiv

0+阅读 · 2023年2月5日

Numerical methods for backward stochastic differential equations: A survey

Arxiv

0+阅读 · 2023年2月4日

Variational Latent Branching Model for Off-Policy Evaluation

Arxiv

0+阅读 · 2023年2月3日

Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain

Arxiv

0+阅读 · 2023年2月2日

The Principles of Deep Learning Theory

Arxiv

65+阅读 · 2021年6月18日

Multiagent Soft Q-Learning

Arxiv

11+阅读 · 2018年4月25日

相关基金

掺杂的稀土氧化物非晶态纳米管可控制备及其热电性能研究

国家自然科学基金

0+阅读 · 2014年12月31日

Anderson型多酸的不对称修饰及可控组装研究

国家自然科学基金

1+阅读 · 2014年12月31日

量子群与Tewilliger代数的相关问题研究

国家自然科学基金

1+阅读 · 2013年12月31日

基于Exemplar-Classifier思想的高分辨率光学遥感影像目标识别研究

国家自然科学基金

2+阅读 · 2013年12月31日

聚合物光敏的小分子宽光谱有机太阳能电池的研究

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

离子束石墨烯-半导体氧化物复合光催化材料的可控制备及性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

银屑病中皮肤DC的免疫调节机制

国家自然科学基金

0+阅读 · 2012年12月31日

高表面能晶面暴露的金属氧化物纳米晶体合成及性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

编码密码学中若干组合对象研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员