We study incentive designs for a class of stochastic Stackelberg games with one leader and a large number of (finite as well as infinite population of) followers. We investigate whether the leader can craft a strategy under a dynamic information structure that induces a desired behavior among the followers. For the finite population setting, under sufficient conditions, we show that there exist symmetric incentive strategies for the leader that attain approximately optimal performance from the leader's viewpoint and lead to an approximate symmetric (pure) Nash best response among the followers. Driving the follower population to infinity, we arrive at the interesting result that in this infinite-population regime the leader cannot design a smooth "finite-energy" incentive strategy, namely, a mean-field limit for such games is not well-defined. As a way around this, we introduce a class of stochastic Stackelberg games with a leader, a major follower, and a finite or infinite population of minor followers, where the leader provides an incentive only for the major follower, who in turn influences the rest of the followers through her strategy. For this class of problems, we are able to establish the existence of an incentive strategy with finitely many minor followers. We also show that if the leader's strategy with finitely many minor followers converges as their population size grows, then the limit defines an incentive strategy for the corresponding mean-field Stackelberg game. Examples of quadratic Gaussian games are provided to illustrate both positive and negative results. In addition, as a byproduct of our analysis, we establish existence of a randomized incentive strategy for the class mean-field Stackelberg games, which in turn provides an approximation for an incentive strategy of the corresponding finite population Stackelberg game.
翻译:我们研究与一位领导人和大量(无限和无限的)追随者一起进行一系列Stackelberg游戏的激励设计。 我们调查领导者能否在动态信息结构下制定战略,让追随者产生理想的行为。 对于有限的人口环境, 在足够的条件下, 我们显示, 领导者有对称激励战略, 从领导的观点中可以达到大致最佳的性能, 并导致追随者之间大致的对称( 纯) Nash 最佳反应。 引导追随者人口走向无限, 我们得出有趣的结果, 在这个无限的人口结构制度中, 领导者无法设计一个顺畅的“ 固定能源” 激励战略, 也就是说, 这种游戏的平均场限制没有被很好地界定。 作为环绕,我们引入了一组与领导者、 主要的追随者、 有限或无限的追随者, 领导者只能为主要追随者提供一种随机的激励因素, 而主要追随者又通过对应的战略影响追随者的其他部分。 对于这个游戏的游戏, 领导者们无法设计一个平坦的策略, 我们也可以以固定的策略来定义一个固定的策略。