Reinforcement learning methods have recently been very successful at performing complex sequential tasks like playing Atari games, Go and Poker. These algorithms have outperformed humans in several tasks by learning from scratch, using only scalar rewards obtained through interaction with their environment. While there certainly has been considerable independent innovation to produce such results, many core ideas in reinforcement learning are inspired by phenomena in animal learning, psychology and neuroscience. In this paper, we comprehensively review a large number of findings in both neuroscience and psychology that evidence reinforcement learning as a promising candidate for modeling learning and decision making in the brain. In doing so, we construct a mapping between various classes of modern RL algorithms and specific findings in both neurophysiological and behavioral literature. We then discuss the implications of this observed relationship between RL, neuroscience and psychology and its role in advancing research in both AI and brain science.
翻译:强化学习方法最近非常成功地完成了诸如玩Atari游戏、Go和Poker等复杂的连续任务。这些算法通过从零起步学习,只利用与环境互动获得的天平奖励,在几项任务中比人类表现得更好。虽然在产生这种结果方面确实有相当大的独立创新,但在强化学习的许多核心想法都受到动物学习、心理学和神经科学现象的启发。在本文件中,我们全面审查了神经科学和心理学方面的大量发现,这些发现证明强化了学习,成为模拟大脑学习和决策的有希望的候选者。在这样做的时候,我们在各种现代RL算法和神经生理学和行为学文献中的具体结果之间绘制了一张地图。然后我们讨论了观察到的RL、神经科学和心理学之间的关系及其在推进AI和脑科学研究中的作用。