An eco-system of agents each having their own policy with some, but limited, generalizability has proven to be a reliable approach to increase generalization across procedurally generated environments. In such an approach, new agents are regularly added to the eco-system when encountering a new environment that is outside of the scope of the eco-system. The speed of adaptation and general effectiveness of the eco-system approach highly depends on the initialization of new agents. In this paper we propose different techniques for such initialization and study their impact.
翻译:实践证明,一种由每个机构各自制定政策的生态系统,其政策有一些但有限的通用性,这是在程序产生的环境中增加普遍化的可靠方法;在这种方法中,当遇到生态系统范围以外的新环境时,经常在生态系统中增加新的代理物;生态系统方法的适应速度和一般效力在很大程度上取决于新代理物的初始化。在本文件中,我们为这种初始化和研究其影响提出了不同的技术。