被约束的 Min-Max 运动会的交替后台 (Alternating Mirror Descent for Constrained Min-Max Games) - 专知论文

会员服务 ·

0

单纯形 · 国际学习理论会议 · 能量函数 · 约束 · 离散化 ·

2022 年 6 月 8 日

Alternating Mirror Descent for Constrained Min-Max Games

翻译：被约束的 Min-Max 运动会的交替后台

Andre Wibisono,Molei Tao,Georgios Piliouras

In this paper we study two-player bilinear zero-sum games with constrained strategy spaces. An instance of natural occurrences of such constraints is when mixed strategies are used, which correspond to a probability simplex constraint. We propose and analyze the alternating mirror descent algorithm, in which each player takes turns to take action following the mirror descent algorithm for constrained optimization. We interpret alternating mirror descent as an alternating discretization of a skew-gradient flow in the dual space, and use tools from convex optimization and modified energy function to establish an $O(K^{-2/3})$ bound on its average regret after $K$ iterations. This quantitatively verifies the algorithm's better behavior than the simultaneous version of mirror descent algorithm, which is known to diverge and yields an $O(K^{-1/2})$ average regret bound. In the special case of an unconstrained setting, our results recover the behavior of alternating gradient descent algorithm for zero-sum games which was studied in (Bailey et al., COLT 2020).

翻译：在本文中,我们研究了具有限制战略空间的双球双线零和游戏。这种制约的自然发生实例是使用混合策略,这与概率简单度限制相对应。我们提出并分析交替反镜下游算法,其中每个玩家转而根据镜下游算法采取行动以优化限制优化。我们把交替反镜下游解释为双重空间中一个扭曲的分流,并使用来自二次优化和经修改的能量函数的工具,按美元外延后的平均遗憾来设定一个美元(K ⁇ -2/3})美元。这在数量上验证了算法的好于反镜下游算法的同步版本,该版本已知差异并产生一个美元(K ⁇ -1/2})平均遗憾。在未受控制的特殊情况下,我们的结果恢复了在研究的零和零和游戏中(Bailey等人,COLT,2020年)的交替梯梯下下基下游算法的行为。

0

相关内容

单纯形

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Resveratrol联合MSCs移植对阿尔茨海默鼠的干预效果及Sirt1分子信号的介导作用

国家自然科学基金

0+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

组蛋白去乙酰化酶（HDAC）在肝细胞再生表观遗传调控中的作用机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

Adiponectin在肝脏缺血再灌注损伤中的抗肝细胞凋亡机制

国家自然科学基金

0+阅读 · 2009年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

Stochastic Gradient Descent with Exponential Convergence Rates of Expected Classification Errors

Arxiv

0+阅读 · 2022年7月25日

Exploration in Linear Bandits with Rich Action Sets and its Implications for Inference

Arxiv

0+阅读 · 2022年7月23日

Efficient Stackelberg Strategies for Finitely Repeated Games

Arxiv

0+阅读 · 2022年7月22日

On the stability of totally upwind schemes for the hyperbolic initial boundary value problem

Arxiv

0+阅读 · 2022年7月22日

High-Dimensional $L_2$Boosting: Rate of Convergence

Arxiv

0+阅读 · 2022年7月21日

VIP会员

文章信息

相关主题

国际学习理论会议

相关VIP内容

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

《将空中力量带向海洋：美国海军航空发展的四条竞争路径及其教训》报告

【MIT博士论文】以语言为中心的医学影像理解

《无人机系统 - 反无人机系统：测试方法》364页

《无人机蜂群攻击防御的预测建模：面向美军战备的人工智能轨迹预测与最优拦截策略设计》最新报告

相关资讯

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

相关论文

Stochastic Gradient Descent with Exponential Convergence Rates of Expected Classification Errors

Arxiv

0+阅读 · 2022年7月25日

Exploration in Linear Bandits with Rich Action Sets and its Implications for Inference

Arxiv

0+阅读 · 2022年7月23日

Efficient Stackelberg Strategies for Finitely Repeated Games

Arxiv

0+阅读 · 2022年7月22日

On the stability of totally upwind schemes for the hyperbolic initial boundary value problem

Arxiv

0+阅读 · 2022年7月22日

High-Dimensional $L_2$Boosting: Rate of Convergence

Arxiv

0+阅读 · 2022年7月21日

相关基金

Resveratrol联合MSCs移植对阿尔茨海默鼠的干预效果及Sirt1分子信号的介导作用

国家自然科学基金

0+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

组蛋白去乙酰化酶（HDAC）在肝细胞再生表观遗传调控中的作用机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

Adiponectin在肝脏缺血再灌注损伤中的抗肝细胞凋亡机制

国家自然科学基金

0+阅读 · 2009年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员