In this paper, we consider a discrete-time Stackelberg graphon mean field game with a finite number of leaders, a finite number of major followers and an infinite number of minor followers. The leaders and the followers each observe types privately that evolve as conditionally independent controlled Markov processes. The leaders and the followers sequentially make strategic decisions where each follower's actions affect her neighbors, which is captured in a graph generated by a known graphon, however, the leaders' actions affect everyone. The leaders are of "Stackelberg" kind which means each of them commits to a dynamic policy and all the followers(both major and minor) best respond to that policy and each other. Knowing that the minor followers would best respond (in the sense of a mean-field game) while the major followers will best respond (in the sense of NAsh) based on their policies, each leader chooses a policy that maximizes her reward knowing that other leader's are doing the same. We refer to the resulting outcome as a Stackelberg Graphon Mean Field Equilibrium with multiple leaders (SGMFE-ML). In this paper, we provide a master equation of this game that allows one to compute all SGMFE-ML. We further extend this notion to the case when there are an infinite number of leaders.
翻译:在本文中,我们认为一个离散时间的Stackelberg石墨图意味着场外游戏,由数量有限的领导人、数量有限的主要追随者和数量极小的追随者组成。领导人和追随者各自观察作为有条件独立控制的Markov过程而演变的私人类型。领导人和追随者依次作出战略决定,让每个追随者的行动影响到她的邻居,但领导人的行动却影响到每个人。领导人们的“斯塔克尔贝格”是一种“斯塔克尔贝格”类型的“斯塔克尔贝格”意味着他们每个人都承诺执行动态政策,所有追随者(主要和次要的)都最好对这项政策和其他政策作出反应。在本文中,认识到小追随者最好(从中一种平均游戏的意义上)作出反应,而主要追随者则根据他们的政策(从纳什的意义上)作出最佳反应。 每位领导人选择了一种政策,使她知道其他领导人也这样做了。我们称之为“斯塔克尔贝格·古地平面场的平衡”, 意味着他们每个人都承诺执行动态政策,所有(主要和次要的)追随者都对政策作出反应。在本文中,我们提供了这个无限的游戏的主方程式,让这个游戏能够把这个游戏的游戏扩大到。