一种简单的适应程序, 与免罪相关的平衡性相融合 (A Simple Adaptive Procedure Converging to Forgiving Correlated Equilibria) - 专知论文

会员服务 ·

0

相关系数 · SimPLe · Extensibility · 知识 (knowledge) · INFORMS ·

2022 年 7 月 13 日

A Simple Adaptive Procedure Converging to Forgiving Correlated Equilibria

翻译：一种简单的适应程序, 与免罪相关的平衡性相融合

from arxiv, Originally published in 2020 as a senior honors thesis under the supervision of Gabriel Carroll at https://purl.stanford.edu/hk596cg1085

Simple adaptive procedures that converge to correlated equilibria are known to exist for normal form games (Hart and Mas-Colell 2000), but no such analogue exists for extensive-form games. Leveraging inspiration from Zinkevich et al. (2008), we show that any internal regret minimization procedure designed for normal-form games can be efficiently extended to finite extensive-form games of perfect recall. Our procedure converges to the set of forgiving correlated equilibria, a refinement of various other proposed extensions of the correlated equilibrium solution concept to extensive-form games (Forges 1986a; Forges 1986b; von Stengel and Forges 2008). In a forgiving correlated equilibrium, players receive move recommendations only upon reaching the relevant information set instead of all at once at the beginning of the game. Assuming all other players follow their recommendations, each player is incentivized to follow her recommendations regardless of whether she has done so at previous infosets. The resulting procedure is completely decentralized: players need neither knowledge of their opponents' actions nor even a complete understanding of the game itself beyond their own payoffs and strategies.

翻译：已知普通形式游戏(Hart和Mas-Colell 2000)存在与相关平衡相趋合的简单适应程序,但大型形式游戏却不存在这种类比。利用Zinkevich等人(2008年)的灵感,我们显示,为普通形式游戏设计的任何内部最小遗憾程序都可以有效地扩大到有限的、完全回想的广型游戏。我们的程序与一套原谅相关平衡概念的组合相趋一致,这是对广泛形式游戏相关平衡解决方案概念的其他各种拟议扩展的改进( Forges 1986a; Forges 1986b; von Stengel和Forges 2008) 。在宽容的关联平衡中,玩家只有在获得相关信息,而不是在游戏开始时一次性收到移动建议。假设所有其他玩家都遵循他们的建议,每个玩家都被鼓励遵循她的建议,而不管她是否在以前的组合中这样做了。由此产生的程序是完全分散化的:玩家不需要了解他们的对手的行动,甚至完全理解游戏本身的付款和战略。

0

相关内容

相关系数

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

少汗型外胚叶发育不全综合征患者突变基因的分子生物学研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于相依数据的梯度学习理论研究

国家自然科学基金

1+阅读 · 2015年12月31日

利用小鼠模型研究PCOS对小鼠卵母细胞及后代生殖细胞DNA甲基化印迹影响

国家自然科学基金

0+阅读 · 2014年12月31日

茉莉素调控水稻花分生组织发育分子机制的研究

国家自然科学基金

0+阅读 · 2014年12月31日

黎曼流形上椭圆算子的谱估计

国家自然科学基金

0+阅读 · 2013年12月31日

渐近锥流形上色散方程的研究

国家自然科学基金

0+阅读 · 2013年12月31日

幂零李群上热核估计的几个问题

国家自然科学基金

0+阅读 · 2012年12月31日

La2-xSrxCuO4双层薄膜制备及载流子调控超导电性研究

国家自然科学基金

0+阅读 · 2012年12月31日

Navier-Stokes方程解的适定性和粘性消失问题

国家自然科学基金

0+阅读 · 2011年12月31日

微分算子自共轭域的实谱参数解刻画及谱分析

国家自然科学基金

0+阅读 · 2009年12月31日

Sequential Information Design: Learning to Persuade in the Dark

Arxiv

0+阅读 · 2022年9月8日

Strong Optimistic Solving for Dynamic Symbolic Execution

Arxiv

0+阅读 · 2022年9月8日

Scheduling Operator Assistance for Shared Autonomy in Multi-Robot Teams

Arxiv

0+阅读 · 2022年9月7日

On the Sparse DAG Structure Learning Based on Adaptive Lasso

Arxiv

0+阅读 · 2022年9月7日

An augmented fully-mixed formulation for the quasistatic Navier--Stokes--Biot model

Arxiv

0+阅读 · 2022年9月7日

Convergence and error estimates of a penalization finite volume method for the compressible Navier-Stokes system

Arxiv

0+阅读 · 2022年9月6日

Multi-Armed Bandits with Self-Information Rewards

Arxiv

0+阅读 · 2022年9月6日

Multiobjective Ranking and Selection Using Stochastic Kriging

Arxiv

0+阅读 · 2022年9月5日

Resolving Infeasibility of Linear Systems: A Parameterized Approach

Arxiv

0+阅读 · 2022年9月5日

Learn to Adapt to New Environment from Past Experience and Few Pilot

Arxiv

0+阅读 · 2022年9月2日

VIP会员

文章信息

相关主题

知识 (knowledge)

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《城市滨海地区：理解复杂多变环境下的指挥控制框架》50页报告

《理解城市战及其在俄乌战争中的表现》报告

美空军“顶点2025”实验：推进AI在C2、动态目标锁定与联盟集成中的应用

《建设式兵棋模拟作为战术集群配置优化的关键组成部分》

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

Sequential Information Design: Learning to Persuade in the Dark

Arxiv

0+阅读 · 2022年9月8日

Strong Optimistic Solving for Dynamic Symbolic Execution

Arxiv

0+阅读 · 2022年9月8日

Scheduling Operator Assistance for Shared Autonomy in Multi-Robot Teams

Arxiv

0+阅读 · 2022年9月7日

On the Sparse DAG Structure Learning Based on Adaptive Lasso

Arxiv

0+阅读 · 2022年9月7日

An augmented fully-mixed formulation for the quasistatic Navier--Stokes--Biot model

Arxiv

0+阅读 · 2022年9月7日

Convergence and error estimates of a penalization finite volume method for the compressible Navier-Stokes system

Arxiv

0+阅读 · 2022年9月6日

Multi-Armed Bandits with Self-Information Rewards

Arxiv

0+阅读 · 2022年9月6日

Multiobjective Ranking and Selection Using Stochastic Kriging

Arxiv

0+阅读 · 2022年9月5日

Resolving Infeasibility of Linear Systems: A Parameterized Approach

Arxiv

0+阅读 · 2022年9月5日

Learn to Adapt to New Environment from Past Experience and Few Pilot

Arxiv

0+阅读 · 2022年9月2日

相关基金

少汗型外胚叶发育不全综合征患者突变基因的分子生物学研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于相依数据的梯度学习理论研究

国家自然科学基金

1+阅读 · 2015年12月31日

利用小鼠模型研究PCOS对小鼠卵母细胞及后代生殖细胞DNA甲基化印迹影响

国家自然科学基金

0+阅读 · 2014年12月31日

茉莉素调控水稻花分生组织发育分子机制的研究

国家自然科学基金

0+阅读 · 2014年12月31日

黎曼流形上椭圆算子的谱估计

国家自然科学基金

0+阅读 · 2013年12月31日

渐近锥流形上色散方程的研究

国家自然科学基金

0+阅读 · 2013年12月31日

幂零李群上热核估计的几个问题

国家自然科学基金

0+阅读 · 2012年12月31日

La2-xSrxCuO4双层薄膜制备及载流子调控超导电性研究

国家自然科学基金

0+阅读 · 2012年12月31日

Navier-Stokes方程解的适定性和粘性消失问题

国家自然科学基金

0+阅读 · 2011年12月31日

微分算子自共轭域的实谱参数解刻画及谱分析

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员