In this paper, we provide an overview of the existing methods for integrating human advice into a Reinforcement Learning process. We first propose a taxonomy of the different forms of advice that can be provided to a learning agent. We then describe the methods that can be used for interpreting advice when its meaning is not determined beforehand. Finally, we review different approaches for integrating advice into the learning process.
翻译:在本文中,我们概述了将人类建议纳入强化学习进程的现有方法,我们首先建议对可以提供给学习机构的不同形式建议进行分类,然后我们描述在建议的意义没有事先确定时可用于解释建议的方法,最后,我们审查将建议纳入学习进程的不同方法。