The ability to plan ahead efficiently is key for both living organisms and artificial systems. Model-based planning and prospection are widely studied in cognitive neuroscience and artificial intelligence (AI), but from different perspectives--and with different desiderata in mind (biological realism versus scalability) that are difficult to reconcile. Here, we introduce a novel method to plan in POMDPs--Active Inference Tree Search (AcT)--that combines the normative character and biological realism of a leading planning theory in neuroscience (Active Inference) and the scalability of tree search methods in AI. This unification enhances both approaches. On the one hand, tree searches enable the biologically grounded, first principle method of active inference to be applied to large-scale problems. On the other hand, active inference provides a principled solution to the exploration-exploitation dilemma, which is often addressed heuristically in tree search methods. Our simulations show that AcT successfully navigates binary trees that are challenging for sampling-based methods, problems that require adaptive exploration, and the large POMDP problem 'RockSample'--in which AcT reproduces state-of-the-art POMDP solutions. Furthermore, we illustrate how AcT can be used to simulate neurophysiological responses (e.g., in the hippocampus and prefrontal cortex) of humans and other animals that solve large planning problems. These numerical analyses show that Active Tree Search is a principled realisation of neuroscientific and AI planning theories, which offer both biological realism and scalability.
翻译:对活生物体和人工系统来说,有效提前规划的能力是关键。基于模型的规划和前景规划在认知神经科学和人工智能(AI)中得到了广泛研究,但从不同的视角和不同的思维偏差(生物现实主义与伸缩性)中,很难调和。在这里,我们在POMDPs-感动推断树搜索(AcT)中引入了一种新的方法,将神经科学(感知推断)中领先规划理论的规范性和生物现实主义结合起来。在AI中,树木搜索方法的伸缩性得到了广泛的研究。这一统一加强了这两种方法。一方面,树木搜索使生物基础得以将主动推断的首要方法(生物现实主义与伸缩性)应用于大规模问题。另一方面,积极的推论为探索前的难题提供了一种原则性解决方案,这通常在树类搜索方法中以超常论方式加以解决。我们的模拟表明,Act成功地引导了基于采样方法的二进式树,需要适应性探索的问题,以及大规模的POMDP问题, 大规模的Sal-real-real-alalalalalal 分析,我们使用了另一种直径-al-alial-alial-alial-deal-deal-de-Prolial-Prolial-de-deal-Procial-ex-ex-Silence-ex-ex-ex-ex-ex-Silence-Sex-ex-ex-ex-ex-我们使用了了另一种方法。