Reinforcement learning provides effective results with agents learning from their observations, received rewards, and internal interactions between agents. This study proposes a new open-source MARL framework, called COGMENT, to efficiently leverage human and agent interactions as a source of learning. We demonstrate these innovations by using a designed real-time environment with unmanned aerial vehicles driven by RL agents, collaborating with a human. The results of this study show that the proposed collaborative paradigm and the open-source framework leads to significant reductions in both human effort and exploration costs.
翻译:强化学习能够提供有效的成果,使代理商从其观察、得到的奖励和代理商之间的内部互动中学习。本研究报告提出一个新的开放源码MARL框架,称为COGment,以有效地利用人和代理商的互动作为学习的源泉。我们通过利用由RL代理商驱动的无人驾驶飞行器与人合作而设计的实时环境来展示这些创新。本研究结果表明,拟议的协作模式和开放源码框架导致人类努力和勘探成本的大幅降低。