Recent experimental observations have shown that the reactivation of hippocampal place cells (PC) during sleep or immobility depicts trajectories that can go around barriers and can flexibly adapt to a changing maze layout. Such layout-conforming replay sheds a light on how the activity of place cells supports the learning of flexible navigation of an animal in a dynamically changing maze. However, existing computational models of replay fall short of generating layout-conforming replay, restricting their usage to simple environments, like linear tracks or open fields. In this paper, we propose a computational model that generates layout-conforming replay and explains how such replay drives the learning of flexible navigation in a maze. First, we propose a Hebbian-like rule to learn the inter-PC synaptic strength during exploring a maze. Then we use a continuous attractor network (CAN) with feedback inhibition to model the interaction among place cells and hippocampal interneurons. The activity bump of place cells drifts along a path in the maze, which models layout-conforming replay. During replay in rest, the synaptic strengths from place cells to striatal medium spiny neurons (MSN) are learned by a novel dopamine-modulated three-factor rule to store place-reward associations. During goal-directed navigation, the CAN periodically generates replay trajectories from the animal's location for path planning, and the trajectory leading to a maximal MSN activity is followed by the animal. We have implemented our model into a high-fidelity virtual rat in the MuJoCo physics simulator. Extensive experiments have demonstrated that its superior flexibility during navigation in a maze is due to a continuous re-learning of inter-PC and PC-MSN synaptic strength.
翻译:最近实验观测显示,在睡眠或运动状态下恢复河马运动地点细胞(PC)在睡眠或运动状态期间的恢复(PC)可以描绘出一个轨迹,可以围绕屏障而灵活地适应变化中的迷宫布局。这种布局化的重放可以显示一个亮点,说明地方细胞的活动如何支持在动态变化的迷宫中学习动物的灵活导航。然而,现有的回放计算模型在生成布局整形的回放功能方面做得不够,将其使用限制在简单的环境,如线性轨道或开地。在本文中,我们提议了一个计算模型模型,产生布局整形的复放功能,并解释这种回放是如何驱动在迷宫内学习弹性导航的。首先,我们提议了一个类似于Hebbian的规则,用以在探索迷宫迷宫时学习PC之间的同步强度。然后,我们用一个连续的吸引网络(CAN)来模拟地方细胞和运动型中间中中中流体内内积体的相互作用。 活动中位细胞会沿一个路径流流流,用来模拟变动的变形变形位置,然后制的轨道,在中进行调的轨道运动内运动运动运动运动运动运动运动运动运动运动运动运动运动运动运动运动运动运动运动运动,在运动运动运动运动运动,在休息期间进行。在运动运动期间,在运动内进行。在运动内进行中,在运动内,在运动内,在运动内,在运动内,在运动内进行运动内进行。