Model-based planning and prospection are widely studied in both cognitive neuroscience and artificial intelligence (AI), but from different perspectives - and with different desiderata in mind (biological realism versus scalability) that are difficult to reconcile. Here, we introduce a novel method to plan in large POMDPs - Active Tree Search - that combines the normative character and biological realism of a leading planning theory in neuroscience (Active Inference) and the scalability of Monte-Carlo methods in AI. This unification is beneficial for both approaches. On the one hand, using Monte-Carlo planning permits scaling up the biologically grounded approach of Active Inference to large-scale problems. On the other hand, the theory of Active Inference provides a principled solution to the balance of exploration and exploitation, which is often addressed heuristically in Monte-Carlo methods. Our simulations show that Active Tree Search successfully navigates binary trees that are challenging for sampling-based methods, problems that require adaptive exploration, and the large POMDP problem Rocksample. Furthermore, we illustrate how Active Tree Search can be used to simulate neurophysiological responses (e.g., in the hippocampus and prefrontal cortex) of humans and other animals that contain large planning problems. These simulations show that Active Tree Search is a principled realisation of neuroscientific and AI theories of planning, which offers both biological realism and scalability.
翻译:在认知神经科学和人工智能(AI)中广泛研究基于模型的规划和前景,但从不同的角度研究模型的规划和前景,但从不同的角度看,以及从难以调和的不同思维上(生物现实主义与伸缩性)的不同偏差(生物现实主义与伸缩性)来研究。在这里,我们引入了一种新的方法,在大型POMDPs(积极树搜索)中进行规划,将神经科学(感知误判)中主要规划理论的规范性和生物现实主义与蒙特-卡洛方法的伸缩性结合起来。这种统一对这两种方法都有好处。一方面,利用蒙特-卡洛(Monte-Carlo)计划可以扩大以生物为基础的积极推断方法对大规模问题(生物现实性推论与伸缩性)的生物学方法。另一方面,主动推论理论为探索与开发之间的平衡提供了一种原则性解决方案,这常常在蒙特-卡洛(Monte-Carlo)方法中被过分地论述。我们的模拟表明,积极的树木搜索成功地引导了对基于采样方法的挑战、需要适应性探索的问题,以及大型的POMDP问题岩标。此外,我们还说明了如何利用积极的树木模拟神经物理和结构的深度反应的大规模预测。