Agent based modelling (ABM) is a computational approach to modelling complex systems by specifying the behaviour of autonomous decision-making components or agents in the system and allowing the system dynamics to emerge from their interactions. Recent advances in the field of Multi-agent reinforcement learning (MARL) have made it feasible to study the equilibrium of complex environments where multiple agents learn simultaneously. However, most ABM frameworks are not RL-native, in that they do not offer concepts and interfaces that are compatible with the use of MARL to learn agent behaviours. In this paper, we introduce a new open-source framework, Phantom, to bridge the gap between ABM and MARL. Phantom is an RL-driven framework for agent-based modelling of complex multi-agent systems including, but not limited to economic systems and markets. The framework aims to provide the tools to simplify the ABM specification in a MARL-compatible way - including features to encode dynamic partial observability, agent utility functions, heterogeneity in agent preferences or types, and constraints on the order in which agents can act (e.g. Stackelberg games, or more complex turn-taking environments). In this paper, we present these features, their design rationale and present two new environments leveraging the framework.
翻译:基于代理的建模(ABM)是模拟复杂系统的一种计算方法,具体指明系统中自主决策组成部分或代理人的行为,并允许系统动态从其相互作用中产生。在多剂加固学习领域最近的进展使得可以研究复杂环境的平衡,多重剂同时学习。然而,大多数反弹道导弹框架不是RL-Native,因为它们没有提供与MARL使用兼容的概念和界面,以学习代理行为。在本文件中,我们引入了新的开放源框架,即Phantom,以弥合反弹道导弹和MARL之间的差距。幻影是基于代理人的复杂多剂系统建模框架,包括但不局限于经济系统和市场。该框架旨在提供工具,以与MARL兼容的方式简化反弹道导弹的规格,包括将动态部分易用性编码、代理使用功能功能功能、代理人偏好或种类的异质以及代理人可以采取行动的顺序限制(例如Stakelberg游戏,或更复杂的现有理论设计环境)。