In this paper, we study the framework of two-player Stackelberg games played on graphs in which Player 0 announces a strategy and Player 1 responds rationally with a strategy that is an optimal response. While it is usually assumed that Player 1 has a single objective, we consider here the new setting where he has several. In this context, after responding with his strategy, Player 1 gets a payoff in the form of a vector of Booleans corresponding to his satisfied objectives. Rationality of Player 1 is encoded by the fact that his response must produce a Pareto-optimal payoff given the strategy of Player 0. We study the Stackelberg-Pareto Synthesis problem which asks whether Player 0 can announce a strategy which satisfies his objective, whatever the rational response of Player 1. For games in which objectives are either all parity or all reachability objectives, we show that this problem is fixed-parameter tractable and NEXPTIME-complete. This problem is already NP-complete in the simple case of reachability objectives and graphs that are trees.


翻译:在本文中,我们研究了玩家0在图表上播放的双玩家Stackelberg游戏的框架,玩家0在图表中宣布策略,玩家1以最佳回应策略做出合理反应。虽然通常假定玩家1有一个单一目标,但我们在这里考虑他有几个目标的新环境。在这个背景下,玩家1在响应其战略后,以符合其满意目标的布尔人矢量的形式获得报酬。玩家1的合理性被编译为他的反应必须产生符合玩家0策略的Pareto最佳回报。我们研究Stackelberg-Pareto合成问题,其中询问玩家0能否宣布一个满足他目标的战略,而不管玩家1的合理反应是什么。对于目标完全对等或所有可达目标的游戏,我们表明这个问题是固定的参数可绘制和 NEXPTIME-完成的。在可达目标和树形图的简单例子中,这个问题已经是NPNP-完整的。

0
下载
关闭预览

相关内容

Fariz Darari简明《博弈论Game Theory》介绍,35页ppt
专知会员服务
109+阅读 · 2020年5月15日
Stabilizing Transformers for Reinforcement Learning
专知会员服务
58+阅读 · 2019年10月17日
强化学习最新教程,17页pdf
专知会员服务
174+阅读 · 2019年10月11日
已删除
将门创投
5+阅读 · 2018年1月24日
Adversarial Variational Bayes: Unifying VAE and GAN 代码
CreateAMind
7+阅读 · 2017年10月4日
Auto-Encoding GAN
CreateAMind
7+阅读 · 2017年8月4日
VIP会员
相关资讯
已删除
将门创投
5+阅读 · 2018年1月24日
Adversarial Variational Bayes: Unifying VAE and GAN 代码
CreateAMind
7+阅读 · 2017年10月4日
Auto-Encoding GAN
CreateAMind
7+阅读 · 2017年8月4日
Top
微信扫码咨询专知VIP会员