Stackelberg equilibrium is a solution concept that describes optimal strategies to commit: Player 1 (the leader) first commits to a strategy that is publicly announced, then Player 2 (the follower) plays a best response to the leader's commitment. We study the problem of computing Stackelberg equilibria in sequential games with finite and indefinite horizons, when players can play history-dependent strategies. Using the alternate formulation called strategies with memory, we establish that strategy profiles with polynomial memory size can be described efficiently. We prove that there exist a polynomial time algorithm which computes the Strong Stackelberg Equilibrium in sequential games defined on directed acyclic graphs, where the strategies depend only on the memory states from a set which is linear in the size of the graph. We extend this result to games on general directed graphs which may contain cycles. We also analyze the setting for approximate version of Strong Stackelberg Equilibrium in the games with chance nodes.
翻译:Stakkelberg 平衡是一个描述最佳承诺战略的解决办法概念:玩家1(领导者)首先承诺实施公开宣布的战略,然后玩家2(追随者)对领导者的承诺做出最佳反应。我们研究在连续游戏中计算Stackelberg 平衡的问题,游戏者可以玩以历史为依存的战略。我们使用称为记忆战略的替代配方,确定可有效描述多式内存大小的战略剖面。我们证明,在定向圆形图定义的连续游戏中,存在着计算强力Stackelberg Equilibrium的多元时间算法,在连续游戏中,战略仅依赖图尺寸线性的一组记忆状态。我们把这个结果推广到一般定向图上,这些图中可能包含循环。我们还用机会节点分析游戏中强 Stackelberg Equilibrium的大致版本的设置。