In this paper we take the first steps in studying a new approach to synthesis of efficient communication schemes in multi-agent systems, trained via reinforcement learning. We combine symbolic methods with machine learning, in what is referred to as a neuro-symbolic system. The agents are not restricted to only use initial primitives: reinforcement learning is interleaved with steps to extend the current language with novel higher-level concepts, allowing generalisation and more informative communication via shorter messages. We demonstrate that this approach allow agents to converge more quickly on a small collaborative construction task.
翻译:在本文中,我们采取初步步骤,研究一种通过强化学习培训的综合性多试剂系统高效通信计划的新办法,我们把象征性方法与机械学习相结合,称之为神经-精神-共体系统。代理不仅限于使用初始原始系统:强化学习与以新的更高层次概念扩展现有语言的步骤相互交织,允许通过更短的信息进行通俗化和更加信息化的交流。我们证明,这种方法可以使代理更快地将小型的合作建设任务集中到一起。