Systems of interacting agents can often be modeled as contextual games, where the context encodes additional information, beyond the control of any agent (e.g. weather for traffic and fiscal policy for market economies). In such systems, the most likely outcome is given by a Nash equilibrium. In many practical settings, only game equilibria are observed, while the optimal parameters for a game model are unknown. This work introduces Nash Fixed Point Networks (N-FPNs), a class of implicit-depth neural networks that output Nash equilibria of contextual games. The N-FPN architecture fuses data-driven modeling with provided constraints. Given equilibrium observations of a contextual game, N-FPN parameters are learnt to predict equilibria outcomes given only the context. We present an end-to-end training scheme for N-FPNs that is simple and memory efficient to implement with existing autodifferentiation tools. N-FPNs also exploit a novel constraint decoupling scheme to avoid costly projections. Provided numerical examples show the efficacy of N-FPNs on atomic and non-atomic games (e.g. traffic routing).
翻译:互动代理器的系统往往可以模拟为背景游戏,其背景将额外信息编码,超出任何代理器的控制范围(例如交通的天气和市场经济的财政政策)。在这种系统中,最可能的结果是由纳什平衡提供的。在许多实际环境中,只观察到游戏的平衡,而游戏模式的最佳参数则未知。这项工作引入了Nash固定点网络(N-FPNs),这是一种隐性深入的神经网络,产生Nash环境游戏的平衡。N-FPN结构将数据驱动的模型结合到所提供的限制。鉴于对背景游戏的均衡观察,N-FPN参数被学会预测平衡结果,仅根据背景预测平衡结果。我们为N-FPN提供了一种简单和记忆高效的培训计划,以便利用现有的自动差异工具执行。N-FPNs还利用一种新的约束性脱钩计划,以避免昂贵的预测。提供数字例子,说明N-FPN在原子和非原子游戏(例如交通路)上的效率。