Theory of mean-field games (MFGs) has recently experienced an exponential growth. Existing analytical approaches to find Nash equilibrium (NE) solutions for MFGs are, however, by and large restricted to contractive or monotone settings, or rely on the uniqueness of the NE. This paper proposes a new mathematical paradigm to analyze discrete-time MFGs without any of these restrictions. The key idea is to reformulate the problem of finding NE solutions in MFGs as solving an equivalent optimization problem, called MF-OMO, with bounded variables and trivial convex constraints. It is built on the classical work of reformulating a Markov decision process as a linear program, and by adding the consistency constraint for MFGs in terms of occupation measures, and by exploiting the complementarity structure of the linear program. This equivalence framework enables finding multiple (and possibly all) NE solutions of MFGs by standard algorithms such as projected gradient descent, and with convergence guarantees under appropriate conditions. In particular, analyzing MFGs with linear rewards and with mean-field independent dynamics is reduced to solving a finite number of linear programs, hence solvable in finite time. This optimization reformulation of MFGs can be extended to variants of MFGs such as personalized MFGs.
翻译:中场游戏的理论最近出现了指数性增长,但现有为中场游戏找到纳什平衡(NE)解决方案的分析方法基本上局限于合同性或单调或依赖NE的独特性。本文提出了一个新的数学模式,用于分析不设任何这些限制的离散时间MFG(MFG),关键的想法是重新界定在MFG(MF-OMO)中找到NE解决方案的问题,作为解决类似优化问题的办法,称为MF-OMO(MF-OMO),具有受约束变量和微小的锥形限制。它建立在将Markov决策过程重新拟订成线性方案的典型工作的基础上,在占领措施方面增加了对MFG的一致性限制,并利用线性方案的互补性结构。这一等同框架使得能够通过标准算法(如预测梯度下降)和在适当条件下获得融合保证来找到MFG(MFG)的多重(可能全部)NE解决方案。特别是以线性奖励和中中中位独立动态分析MFG(MFMF)的中,因此,可以将MFFMF的定数减为MFF的固定性升级。