商店市场运动会 (Stochastic Market Games) - 专知论文

会员服务 ·

0

Agent · Learning · ForCES · motivation · 情景 ·

2022 年 7 月 18 日

Stochastic Market Games

翻译：商店市场运动会

Kyrill Schmid,Lenz Belzner,Robert Müller,Johannes Tochtermann,Claudia Linhoff-Popien

from arxiv, IJCAI-21

Some of the most relevant future applications of multi-agent systems like autonomous driving or factories as a service display mixed-motive scenarios, where agents might have conflicting goals. In these settings agents are likely to learn undesirable outcomes in terms of cooperation under independent learning, such as overly greedy behavior. Motivated from real world societies, in this work we propose to utilize market forces to provide incentives for agents to become cooperative. As demonstrated in an iterated version of the Prisoner's Dilemma, the proposed market formulation can change the dynamics of the game to consistently learn cooperative policies. Further we evaluate our approach in spatially and temporally extended settings for varying numbers of agents. We empirically find that the presence of markets can improve both the overall result and agent individual returns via their trading activities.

翻译：多试剂系统(如自主驾驶或工厂,作为一种服务形式)今后最相关的一些应用,如自主驾驶或工厂,显示混合动机的情景,其中代理商可能具有相互冲突的目标。在这些环境中,代理商有可能在独立学习的合作中学到不良的结果,例如过度贪婪的行为。我们从现实世界的社会出发,在这项工作中提议利用市场力量激励代理商成为合作者。正如《囚犯困境》的迭代版所显示的那样,拟议的市场配置可以改变游戏的动态,以不断学习合作政策。我们进一步评估我们在空间和时间上为不同代理商扩展的环境下的做法。我们从经验中发现,市场的存在能够通过交易活动改善总体结果和代理商个人回报。

0

相关内容

Agent

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

分布式混合SAR/ISAR对复杂运动舰船目标成像关键技术与新方法研究

国家自然科学基金

2+阅读 · 2014年12月31日

考虑观测值时空相关性的InSAR三维形变估计方法

国家自然科学基金

0+阅读 · 2013年12月31日

基于梯度Kriging方法的轮胎花纹形状优化

国家自然科学基金

0+阅读 · 2011年12月31日

高光度blazar的甚高能伽马射线辐射研究

国家自然科学基金

0+阅读 · 2009年12月31日

干涉SAR与LIDAR森林参数协同反演模型与方法

国家自然科学基金

0+阅读 · 2008年12月31日

Evaluating tests for cluster-randomized trials with few clusters under generalized linear mixed models with covariate adjustment: a simulation study

Arxiv

0+阅读 · 2022年9月9日

Optimal Offloading Strategies for Edge-Computing via Mean-Field Games and Control

Optimal Offloading Strategies for Edge-Computing via Mean-Field Games and Control

Arxiv

0+阅读 · 2022年9月8日

Taking Advice from (Dis)Similar Machines: The Impact of Human-Machine Similarity on Machine-Assisted Decision-Making

Arxiv

0+阅读 · 2022年9月8日

Stochastic gradient descent with gradient estimator for categorical features

Arxiv

0+阅读 · 2022年9月8日

DAVE Aquatic Virtual Environment: Toward a General Underwater Robotics Simulator

Arxiv

0+阅读 · 2022年9月6日

VIP会员

文章信息

相关主题

相关VIP内容

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

前沿人工智能趋势报告（Frontier AI Trends Report）

【AAAI2026】善始则事半功倍：基于前缀优化的大语言模型推理强化学习

Andrej Karpathy：2025 年 LLM 年度回顾（2025 LLM Year in Review）

音退化问题：基于输入操控的鲁棒语音转换综述

相关资讯

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Evaluating tests for cluster-randomized trials with few clusters under generalized linear mixed models with covariate adjustment: a simulation study

Arxiv

0+阅读 · 2022年9月9日

Optimal Offloading Strategies for Edge-Computing via Mean-Field Games and Control

Optimal Offloading Strategies for Edge-Computing via Mean-Field Games and Control

Arxiv

0+阅读 · 2022年9月8日

Taking Advice from (Dis)Similar Machines: The Impact of Human-Machine Similarity on Machine-Assisted Decision-Making

Arxiv

0+阅读 · 2022年9月8日

Stochastic gradient descent with gradient estimator for categorical features

Arxiv

0+阅读 · 2022年9月8日

DAVE Aquatic Virtual Environment: Toward a General Underwater Robotics Simulator

Arxiv

0+阅读 · 2022年9月6日

相关基金

分布式混合SAR/ISAR对复杂运动舰船目标成像关键技术与新方法研究

国家自然科学基金

2+阅读 · 2014年12月31日

考虑观测值时空相关性的InSAR三维形变估计方法

国家自然科学基金

0+阅读 · 2013年12月31日

基于梯度Kriging方法的轮胎花纹形状优化

国家自然科学基金

0+阅读 · 2011年12月31日

高光度blazar的甚高能伽马射线辐射研究

国家自然科学基金

0+阅读 · 2009年12月31日

干涉SAR与LIDAR森林参数协同反演模型与方法

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员