Reinforcement learning methods have recently been very successful at performing complex sequential tasks like playing Atari games, Go and Poker. These algorithms have outperformed humans in several tasks by learning from scratch, using only scalar rewards obtained through interaction with their environment. While there certainly has been considerable independent innovation in the area to produce such results, many core ideas in reinforcement learning are inspired by phenomena of animal learning, psychology and neuroscience. In this paper, we comprehensively review a number of findings in neuroscience and psychology that provide evidence for the plausibility of reinforcement learning being a promising model for phenomena in human learning, decision making and behavior. We do so by a) exploring neuroscientific evidence for various classes of RL algorithms along with their building blocks, and b) mapping specific RL ideas to findings in neuroscience and psychology. Finally, we discuss the implications of these findings and their role in advancing research in both AI and brain science.
翻译:强化学习方法最近非常成功地完成了诸如玩Atari游戏、Go和Poker等复杂的相继任务。这些算法通过从零开始学习,只使用与环境互动获得的天平奖励,在几项任务中比人类表现得更好。虽然在这一领域中确实有大量独立的创新来产生这种结果,但加强学习的许多核心想法是动物学习、心理学和神经科学等现象所启发的。在本文件中,我们全面审查了神经科学和心理学方面的一些发现,这些发现证明强化学习是人类学习、决策和行为中一种有希望的现象的典范。我们这样做的方法是:(a) 探索各种RL算法的神经科学证据,连同其构件,以及(b) 将具体的RL概念与神经科学与心理学的发现联系起来。最后,我们讨论了这些发现的影响及其在推进AI和脑科学研究中的作用。