在多人游戏中学习可合理实现的平衡 (Learning Rationalizable Equilibria in Multiplayer Games) - 专知论文

会员服务 ·

0

Learning · 样本复杂度 · 相关系数 · 样本 · 赌博机/老虎机 ·

2022 年 10 月 20 日

Learning Rationalizable Equilibria in Multiplayer Games

翻译：在多人游戏中学习可合理实现的平衡

Yuanhao Wang,Dingwen Kong,Yu Bai,Chi Jin

from arxiv, 33 pages

A natural goal in multiagent learning besides finding equilibria is to learn rationalizable behavior, where players learn to avoid iteratively dominated actions. However, even in the basic setting of multiplayer general-sum games, existing algorithms require a number of samples exponential in the number of players to learn rationalizable equilibria under bandit feedback. This paper develops the first line of efficient algorithms for learning rationalizable Coarse Correlated Equilibria (CCE) and Correlated Equilibria (CE) whose sample complexities are polynomial in all problem parameters including the number of players. To achieve this result, we also develop a new efficient algorithm for the simpler task of finding one rationalizable action profile (not necessarily an equilibrium), whose sample complexity substantially improves over the best existing results of Wu et al. (2021). Our algorithms incorporate several novel techniques to guarantee rationalizability and no (swap-)regret simultaneously, including a correlated exploration scheme and adaptive learning rates, which may be of independent interest. We complement our results with a sample complexity lower bound showing the sharpness of our guarantees.

翻译：多试剂学习的自然目标,除了找到平衡之外,还包括学习可合理化的行为,让玩家学会避免反复支配的行为。然而,即使在多玩家普通游戏和游戏的基本设置中,现有的算法也需要一些玩家数目的样本指数,以便根据土匪反馈学习可合理化的平衡。本文开发了第一行高效算法,用于学习可合理化的Coarse Corcontal Equilibria(CCCE)和Corcontern Equilibria(CE),其样本复杂性在所有问题参数(包括玩家数目)中是多元的。为了实现这一结果,我们还开发了一种新的高效算法,用于更简单的任务,即找到一个可合理化的行动特征(不一定是均衡的),其样本复杂性大大高于Wu等人(2021年)的现有最佳结果。我们的算法包含若干新技术,以保证合理化,而没有(swap)regret,包括一个相关的勘探计划和适应性学习率,这可能具有独立的兴趣。我们用一个比较复杂度较低的样本来补充我们的结果,显示我们的保证的精确度。

0

相关内容

Learning

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

专知会员服务

49+阅读 · 2022年11月13日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

阵列式BiOCl/C60单晶面暴露自组装薄膜的制备与产氢性能研究

国家自然科学基金

0+阅读 · 2015年12月31日

Decorin对急性缺血性卒中后血脑屏障中ZO-1蛋白的作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

非凸稀疏正则化模型与算法的研究

国家自然科学基金

3+阅读 · 2015年12月31日

EGF对猪小肠磷吸收通道蛋白NPT-IIb调控的分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

Prohibitin1在胆管癌中的作用及分子机制

国家自然科学基金

0+阅读 · 2013年12月31日

钛酸钡负载纳米银微粒填充的聚合物基复合材料介电性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

Mg2SiO4纳米晶陶瓷及其纳米复合微波介质的制备、介电性能调控

国家自然科学基金

0+阅读 · 2012年12月31日

新型免疫负调控分子TIPE2调控CD4+T细胞的功能及在HBV感染中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

函数域中的Vinogradov中值定理

国家自然科学基金

0+阅读 · 2012年12月31日

钛基铂系、镍系双金属纳米电极材料的一步法制备与电催化活性

国家自然科学基金

0+阅读 · 2008年12月31日

Private Multiparty Perception for Navigation

Arxiv

0+阅读 · 2022年12月2日

Learning to Select from Multiple Options

Arxiv

0+阅读 · 2022年12月1日

Polynomial-Time Optimal Equilibria with a Mediator in Extensive-Form Games

Arxiv

0+阅读 · 2022年12月1日

Differentially-private Distributed Algorithms for Aggregative Games with Guaranteed Convergence

Arxiv

0+阅读 · 2022年11月30日

Newton Method with Variable Selection by the Proximal Gradient Method

Arxiv

0+阅读 · 2022年11月30日

Understanding transit ridership in an equity context through a comparison of statistical and machine learning algorithms

Arxiv

0+阅读 · 2022年11月30日

Regret Pruning for Learning Equilibria in Simulation-Based Games

Arxiv

0+阅读 · 2022年11月30日

A Survey of Decision Making in Adversarial Games

Arxiv

84+阅读 · 2022年7月16日

Balanced Multimodal Learning via On-the-fly Gradient Modulation

Arxiv

13+阅读 · 2022年3月29日

Automated Graph Machine Learning: Approaches, Libraries and Directions

Arxiv

20+阅读 · 2022年1月4日

VIP会员

文章信息

相关主题

样本复杂度

赌博机/老虎机

相关VIP内容

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

专知会员服务

49+阅读 · 2022年11月13日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

面向向量的机器学习系统：跨栈方法

【ICCV2025】SO(3) 上连续非保守动力系统的预测

【ICCV2025】Lay2Story：扩展扩散 Transformer 以实现可切换布局的故事生成

人工智能治理全景综述

相关资讯

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Private Multiparty Perception for Navigation

Arxiv

0+阅读 · 2022年12月2日

Learning to Select from Multiple Options

Arxiv

0+阅读 · 2022年12月1日

Polynomial-Time Optimal Equilibria with a Mediator in Extensive-Form Games

Arxiv

0+阅读 · 2022年12月1日

Differentially-private Distributed Algorithms for Aggregative Games with Guaranteed Convergence

Arxiv

0+阅读 · 2022年11月30日

Newton Method with Variable Selection by the Proximal Gradient Method

Arxiv

0+阅读 · 2022年11月30日

Understanding transit ridership in an equity context through a comparison of statistical and machine learning algorithms

Arxiv

0+阅读 · 2022年11月30日

Regret Pruning for Learning Equilibria in Simulation-Based Games

Arxiv

0+阅读 · 2022年11月30日

A Survey of Decision Making in Adversarial Games

Arxiv

84+阅读 · 2022年7月16日

Balanced Multimodal Learning via On-the-fly Gradient Modulation

Arxiv

13+阅读 · 2022年3月29日

Automated Graph Machine Learning: Approaches, Libraries and Directions

Arxiv

20+阅读 · 2022年1月4日

相关基金

阵列式BiOCl/C60单晶面暴露自组装薄膜的制备与产氢性能研究

国家自然科学基金

0+阅读 · 2015年12月31日

Decorin对急性缺血性卒中后血脑屏障中ZO-1蛋白的作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

非凸稀疏正则化模型与算法的研究

国家自然科学基金

3+阅读 · 2015年12月31日

EGF对猪小肠磷吸收通道蛋白NPT-IIb调控的分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

Prohibitin1在胆管癌中的作用及分子机制

国家自然科学基金

0+阅读 · 2013年12月31日

钛酸钡负载纳米银微粒填充的聚合物基复合材料介电性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

Mg2SiO4纳米晶陶瓷及其纳米复合微波介质的制备、介电性能调控

国家自然科学基金

0+阅读 · 2012年12月31日

新型免疫负调控分子TIPE2调控CD4+T细胞的功能及在HBV感染中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

函数域中的Vinogradov中值定理

国家自然科学基金

0+阅读 · 2012年12月31日

钛基铂系、镍系双金属纳米电极材料的一步法制备与电催化活性

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员