中巴-巴伊夫对立运动会的易变性和强健性 (Fragility and Robustness in Mean-Payoff Adversarial Stackelberg Games) - 专知论文

会员服务 ·

0

稳健性 · Weight · MoDELS · CASE · HER ·

2021 年 8 月 2 日

Fragility and Robustness in Mean-Payoff Adversarial Stackelberg Games

翻译：中巴-巴伊夫对立运动会的易变性和强健性

Mrudula Balachander,Shibashis Guha,Jean-François Raskin

from arxiv, Full version of paper accepted in CONCUR 2021

Two-player mean-payoff Stackelberg games are nonzero-sum infinite duration games played on a bi-weighted graph by Leader (Player 0) and Follower (Player 1). Such games are played sequentially: first, Leader announces her strategy, second, Follower chooses his best-response. If we cannot impose which best-response is chosen by Follower, we say that Follower, though strategic, is adversarial towards Leader. The maximal value that Leader can get in this nonzero-sum game is called the adversarial Stackelberg value (ASV) of the game. We study the robustness of strategies for Leader in these games against two types of deviations: (i) Modeling imprecision - the weights on the edges of the game arena may not be exactly correct, they may be delta-away from the right one. (ii) Sub-optimal response - Follower may play epsilon-optimal best-responses instead of perfect best-responses. First, we show that if the game is zero-sum then robustness is guaranteed while in the nonzero-sum case, optimal strategies for ASV are fragile. Second, we provide a solution concept to obtain strategies for Leader that are robust to both modeling imprecision, and as well as to the epsilon-optimal responses of Follower, and study several properties and algorithmic problems related to this solution concept.

翻译：玩家 Stackelberg 游戏不是零和无限的游戏, 由领导者( Player 0) 和追随者( Player 1) 在双加权图表上玩这种游戏。这种游戏按顺序进行: 首先, 领导者宣布其战略, 第二, 追随者选择他的最佳反应。如果我们不能强制实施跟踪者选择的最佳反应, 我们说, 追随者虽然具有战略意义, 却对领导者持对立态度。领导者在这场非零和游戏中能得到的最大价值被称为游戏的对抗性Stackelberg 值( ASV ) 。我们研究这些游戏中领导者战略的稳健性, 对抗两种偏差:( 一) 建模不精度 — 游戏场边缘的权重可能不完全正确。如果我们无法强制实施什么最佳反应, 跟踪者可能会对领导者做出最优性反应, 而不是完美的最佳反应。首先, 我们显示, 如果游戏是零和随后的稳性策略, 我们就会保证这些游戏的策略的稳健性, 而在不稳的亚行者的策略中, 将获得一个最稳性的策略。

0

相关内容

稳健性

【Google】平滑对抗训练，Smooth Adversarial Training

【Google】平滑对抗训练，Smooth Adversarial Training

专知会员服务

49+阅读 · 2020年7月4日

【SIGIR2020】多检索系统的贝叶斯推理风险评估，Bayesian Inferential Risk Evaluation On Multiple IR Systems

【SIGIR2020】多检索系统的贝叶斯推理风险评估，Bayesian Inferential Risk Evaluation On Multiple IR Systems

专知会员服务

9+阅读 · 2020年6月10日

【ACL2020】对抗性文本生成，Improving Adversarial Text Generation

专知会员服务

52+阅读 · 2020年5月5日

【硬核书】数学博弈论与应用，431页pdf，Mathematical Game Theory and Applications

【硬核书】数学博弈论与应用，431页pdf，Mathematical Game Theory and Applications

专知会员服务

170+阅读 · 2020年4月18日

【MIT】对抗鲁棒性的流形正则化，Manifold Regularization for Adversarial Robustness

【MIT】对抗鲁棒性的流形正则化，Manifold Regularization for Adversarial Robustness

专知会员服务

28+阅读 · 2020年3月11日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

【O'Reilly TensorFlow Conference 2019】不要打败市场；击败机器人：金融对抗网络（Don’t beat the market; beat the bots: Adversarial networks in finance），Manceps机器学习架构师Garrett Lander、首席执行官兼首席顾问Al Kari

【O'Reilly TensorFlow Conference 2019】不要打败市场；击败机器人：金融对抗网络（Don’t beat the market; beat the bots: Adversarial networks in finance），Manceps机器学习架构师Garrett Lander、首席执行官兼首席顾问Al Kari

专知会员服务

16+阅读 · 2019年11月13日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

19篇ICML2019论文摘录选读！

19篇ICML2019论文摘录选读！

专知

28+阅读 · 2019年4月28日

【TED】生命中的每一年的智慧

【TED】生命中的每一年的智慧

英语演讲视频每日一推

10+阅读 · 2019年1月29日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

Adversarial Variational Bayes: Unifying VAE and GAN 代码

Adversarial Variational Bayes: Unifying VAE and GAN 代码

CreateAMind

7+阅读 · 2017年10月4日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Characterizing Omega-Regularity through Finite-Memory Determinacy of Games on Infinite Graphs

Arxiv

0+阅读 · 2021年10月4日

Inducing Equilibria via Incentives: Simultaneous Design-and-Play Finds Global Optima

Arxiv

0+阅读 · 2021年10月4日

Simulations and the Lamplighter group

Arxiv

0+阅读 · 2021年10月1日

Robustness of non-computability

Arxiv

0+阅读 · 2021年9月30日

Privacy and Robustness in Federated Learning: Attacks and Defenses

Arxiv

35+阅读 · 2020年12月7日

Attribute-Guided Adversarial Training for Robustness to Natural Perturbations

Arxiv

15+阅读 · 2020年12月3日

Opportunities and Challenges in Deep Learning Adversarial Robustness: A Survey

Arxiv

3+阅读 · 2020年7月3日

Deflecting Adversarial Attacks

Deflecting Adversarial Attacks

Arxiv

8+阅读 · 2020年2月18日

Sequential Attacks on Agents for Long-Term Adversarial Goals

Sequential Attacks on Agents for Long-Term Adversarial Goals

Arxiv

5+阅读 · 2018年7月5日

Adversarial Meta-Learning

Arxiv

7+阅读 · 2018年6月8日

VIP会员

文章信息

相关主题

相关VIP内容

【Google】平滑对抗训练，Smooth Adversarial Training

【Google】平滑对抗训练，Smooth Adversarial Training

专知会员服务

49+阅读 · 2020年7月4日

【SIGIR2020】多检索系统的贝叶斯推理风险评估，Bayesian Inferential Risk Evaluation On Multiple IR Systems

【SIGIR2020】多检索系统的贝叶斯推理风险评估，Bayesian Inferential Risk Evaluation On Multiple IR Systems

专知会员服务

9+阅读 · 2020年6月10日

【ACL2020】对抗性文本生成，Improving Adversarial Text Generation

专知会员服务

52+阅读 · 2020年5月5日

【硬核书】数学博弈论与应用，431页pdf，Mathematical Game Theory and Applications

【硬核书】数学博弈论与应用，431页pdf，Mathematical Game Theory and Applications

专知会员服务

170+阅读 · 2020年4月18日

【MIT】对抗鲁棒性的流形正则化，Manifold Regularization for Adversarial Robustness

【MIT】对抗鲁棒性的流形正则化，Manifold Regularization for Adversarial Robustness

专知会员服务

28+阅读 · 2020年3月11日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

【O'Reilly TensorFlow Conference 2019】不要打败市场；击败机器人：金融对抗网络（Don’t beat the market; beat the bots: Adversarial networks in finance），Manceps机器学习架构师Garrett Lander、首席执行官兼首席顾问Al Kari

【O'Reilly TensorFlow Conference 2019】不要打败市场；击败机器人：金融对抗网络（Don’t beat the market; beat the bots: Adversarial networks in finance），Manceps机器学习架构师Garrett Lander、首席执行官兼首席顾问Al Kari

专知会员服务

16+阅读 · 2019年11月13日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《复杂工程系统模型驱动设计决策支持系统：早期设计阶段挑战》最新138页

《日本陆上自卫队2040年作战方式与未来作战研究》最新23页slides

人工智能作为战争武器

《后勤保障》最新23页

相关资讯

19篇ICML2019论文摘录选读！

19篇ICML2019论文摘录选读！

专知

28+阅读 · 2019年4月28日

【TED】生命中的每一年的智慧

【TED】生命中的每一年的智慧

英语演讲视频每日一推

10+阅读 · 2019年1月29日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

Adversarial Variational Bayes: Unifying VAE and GAN 代码

Adversarial Variational Bayes: Unifying VAE and GAN 代码

CreateAMind

7+阅读 · 2017年10月4日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Characterizing Omega-Regularity through Finite-Memory Determinacy of Games on Infinite Graphs

Arxiv

0+阅读 · 2021年10月4日

Inducing Equilibria via Incentives: Simultaneous Design-and-Play Finds Global Optima

Arxiv

0+阅读 · 2021年10月4日

Simulations and the Lamplighter group

Arxiv

0+阅读 · 2021年10月1日

Robustness of non-computability

Arxiv

0+阅读 · 2021年9月30日

Privacy and Robustness in Federated Learning: Attacks and Defenses

Arxiv

35+阅读 · 2020年12月7日

Attribute-Guided Adversarial Training for Robustness to Natural Perturbations

Arxiv

15+阅读 · 2020年12月3日

Opportunities and Challenges in Deep Learning Adversarial Robustness: A Survey

Arxiv

3+阅读 · 2020年7月3日

Deflecting Adversarial Attacks

Deflecting Adversarial Attacks

Arxiv

8+阅读 · 2020年2月18日

Sequential Attacks on Agents for Long-Term Adversarial Goals

Sequential Attacks on Agents for Long-Term Adversarial Goals

Arxiv

5+阅读 · 2018年7月5日

Adversarial Meta-Learning

Arxiv

7+阅读 · 2018年6月8日

微信扫码咨询专知VIP会员