In this work we propose Neuro-Nav, an open-source library for neurally plausible reinforcement learning (RL). RL is among the most common modeling frameworks for studying decision making, learning, and navigation in biological organisms. In utilizing RL, cognitive scientists often handcraft environments and agents to meet the needs of their particular studies. On the other hand, artificial intelligence researchers often struggle to find benchmarks for neurally and biologically plausible representation and behavior (e.g., in decision making or navigation). In order to streamline this process across both fields with transparency and reproducibility, Neuro-Nav offers a set of standardized environments and RL algorithms drawn from canonical behavioral and neural studies in rodents and humans. We demonstrate that the toolkit replicates relevant findings from a number of studies across both cognitive science and RL literatures. We furthermore describe ways in which the library can be extended with novel algorithms (including deep RL) and environments to address future research needs of the field.
翻译:在这项工作中,我们建议Neuro-Nav(一个开放源码图书馆,用于神经上可信的强化学习),Ruro-Nav(RL)是研究生物生物中的决策、学习和导航的最常见示范框架。在利用RL时,认知科学家往往使用手工艺环境和代理人来满足其特定研究的需要。另一方面,人工智能研究人员常常努力寻找神经和生物上合理代表性和行为的基准(例如在决策或导航中)。为了以透明和可再生的方式简化这两个领域的这一过程,Neuro-Nav提供了一套标准化的环境和RL算法,这些算法来自对老鼠和人类的罐体行为学和神经学研究。我们证明,该工具包复制了从一系列关于认知科学和RL文献的研究中得出的相关结果。我们进一步描述了如何利用新的算法(包括深RL)和环境扩展图书馆,以满足该领域的未来研究需要。