The analysis and control of large-population systems is of great interest to diverse areas of research and engineering, ranging from epidemiology over robotic swarms to economics and finance. An increasingly popular and effective approach to realizing sequential decision-making in multi-agent systems is through multi-agent reinforcement learning, as it allows for an automatic and model-free analysis of highly complex systems. However, the key issue of scalability complicates the design of control and reinforcement learning algorithms particularly in systems with large populations of agents. While reinforcement learning has found resounding empirical success in many scenarios with few agents, problems with many agents quickly become intractable and necessitate special consideration. In this survey, we will shed light on current approaches to tractably understanding and analyzing large-population systems, both through multi-agent reinforcement learning and through adjacent areas of research such as mean-field games, collective intelligence, or complex network theory. These classically independent subject areas offer a variety of approaches to understanding or modeling large-population systems, which may be of great use for the formulation of tractable MARL algorithms in the future. Finally, we survey potential areas of application for large-scale control and identify fruitful future applications of learning algorithms in practical systems. We hope that our survey could provide insight and future directions to junior and senior researchers in theoretical and applied sciences alike.
翻译:对大型人口系统的分析和控制,对于从机器人群流行病学到经济和金融等不同研究和工程领域来说,对大型人口系统的分析和控制是十分有意义的,从对机器人群的流行病学到经济和金融。在多试剂系统中实现连续决策的日益流行和有效的方法是通过多试剂强化学习来实现的,因为它允许对高度复杂的系统进行自动和不使用模型的分析。然而,可扩缩性的关键问题使控制和强化学习算法的设计复杂化,特别是在拥有大量物剂的系统里。虽然加强学习发现在许多情况中与少数物剂相比,经验非常成功,但许多物剂的问题很快变得难以解决,需要特别考虑。在这次调查中,我们将说明目前通过多试剂强化学习和通过类似研究领域,例如中场游戏、集体情报或复杂的网络理论,来深入理解和分析大型人口系统。这些典型的独立主题领域为了解或建模大型人口系统提供了各种方法,这些方法对于今后制定可扩增的MAL算法可能大有用处,因此需要特别考虑。我们将通过多试剂强化技术强化研究,阐明目前对大型人口系统进行深入了解和分析的方法,并查明今后对高级研究人员进行实际研究的用途。