This work considers a novel information design problem and studies how the craft of payoff-relevant environmental signals solely can influence the behaviors of intelligent agents. The agents' strategic interactions are captured by an incomplete-information Markov game, in which each agent first selects one environmental signal from multiple signal sources as additional payoff-relevant information and then takes an action. There is a rational information designer (principal) who possesses one signal source and aims to control the equilibrium behaviors of the agents by designing the information structure of her signals sent to the agents. An obedient principle is established which states that it is without loss of generality to focus on the direct information design when the information design incentivizes each agent to select the signal sent by the principal, such that the design process avoids the predictions of the agents' strategic selection behaviors. Based on the obedient principle, we introduce the design protocol given a goal of the principal referred to as obedient implementability (OIL) and study a Myersonian information design that characterizes the OIL in a class of obedient sequential Markov perfect Bayesian equilibria (O-SMPBE). A framework is proposed based on an approach which we refer to as the fixed-point alignment that incentivizes the agents to choose the signal sent by the principal, makes sure that the agents' policy profile of taking actions is the policy component of an O-SMPBE, and the principal's goal is achieved. The proposed approach can be applied to elicit desired behaviors of multi-agent systems in competing as well as cooperating settings and be extended to heterogeneous stochastic games in the complete- and the incomplete-information environments.
翻译:这项工作考虑到一个新的信息设计问题和研究,与支付相关的环境信号的手法完全能够影响智能剂的行为。代理商的战略互动被一个不完整的信息马可夫游戏所捕捉,在这个游戏中,每个代理商首先从多个信号源中选择一个环境信号作为额外的与支付相关的补充信息,然后采取行动。有一个理性的信息设计师(Principal)拥有一个信号源,目的是通过设计向代理商发送的信号的信息结构来控制代理商的平衡行为。确立了一个服从原则,该原则指出,当信息设计激励每个代理商选择委托人发送的信号时,直接信息设计是没有失失失的。在设计过程中,每个代理商首先从多个信号中选择一个环境,即根据执行原则,我们引入了设计协议,本公司的一个目标被称为服从性执行(OIL),并研究Myersonian信息设计,这是OIL在顺从的完整Bayesian equiliria (O-SMBEE) 时,一个框架是我们所推荐的精细的精细的代理商,这个框架是用来选择一个方向的精细的精细的精细的精细的动作,,这个框架是我们所发送的精细的精细的精细的精细的精细的精确的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的逻辑,它是用来用来用来作为我们所所所所所所 的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精细的精