From large-scale organizations to decentralized political systems, hierarchical strategic decision making is commonplace. We introduce a novel class of structured hierarchical games (SHGs) that formally capture such hierarchical strategic interactions. In an SHG, each player is a node in a tree, and strategic choices of players are sequenced from root to leaves, with root moving first, followed by its children, then followed by their children, and so on until the leaves. A player's utility in an SHG depends on its own decision, and on the choices of its parent and all the tree leaves. SHGs thus generalize simultaneous-move games, as well as Stackelberg games with many followers. We leverage the structure of both the sequence of player moves as well as payoff dependence to develop a gradient-based back propagation-style algorithm, which we call Differential Backward Induction (DBI), for approximating equilibria of SHGs. We provide a sufficient condition for convergence of DBI and demonstrate its efficacy in finding approximate equilibrium solutions to several SHG models of hierarchical policy-making problems.
翻译:从大型组织到分散的政治系统,等级战略决策是司空见惯的。我们引入了新型的结构性等级游戏(SHGs),正式捕捉到这种等级战略互动。在SHG中,每个玩家都是树上的节点,玩家的战略选择从根到叶的顺序,先是根,后是孩子,然后是孩子,然后是树叶。玩家在SHG中的效用取决于自己的决定、父母的选择和所有树叶。SHGs因此将同时移动游戏以及Stackelberg游戏与许多追随者普遍化。我们利用玩家运动的顺序结构以及回报依赖性来开发一种基于梯度的后传播型算法,我们称之为“梯度向后演化 ” (DBI), 以适应SHGs 的偏向后演化。我们为DBI的趋同提供了充分的条件,并展示其为SHG的几类决策问题寻找近平衡解决方案的效力。