The field of multi-agent reinforcement learning (MARL) has made considerable progress towards controlling challenging multi-agent systems by employing various learning methods. Numerous of these approaches focus on empirical and algorithmic aspects of the MARL problems and lack a rigorous theoretical foundation. Graphon mean field games (GMFGs) on the other hand provide a scalable and mathematically well-founded approach to learning problems that involve a large number of connected agents. In standard GMFGs, the connections between agents are undirected, unweighted and invariant over time. Our paper introduces colored digraphon mean field games (CDMFGs) which allow for weighted and directed links between agents that are also adaptive over time. Thus, CDMFGs are able to model more complex connections than standard GMFGs. Besides a rigorous theoretical analysis including both existence and convergence guarantees, we provide a learning scheme and illustrate our findings with an epidemics model and a model of the systemic risk in financial markets.
翻译:多试剂强化学习领域通过采用各种学习方法,在控制具有挑战性的多试剂系统方面取得了相当大的进展,其中许多方法侧重于MARL问题的经验和算法方面,缺乏严格的理论基础。另一方面,Greamon平均野外游戏(GMFGs)为学习涉及大量关联剂的问题提供了可扩缩和有数学依据的学习方法。在标准的GMFGs中,代理器之间的联系是无方向的、不加权的和不变化的。我们的论文介绍了彩色平面平均野外游戏(CDMFGs),这些游戏可以使代理人之间的加权和定向联系能够随着时间的适应。因此,CDMFGs能够模拟比标准GFGs更复杂的连接。除了严格的理论分析,包括存在和趋同保证,我们还提供学习计划,用一种流行病模型和金融市场系统性风险模型来说明我们的调查结果。