We propose a hybrid combination of active inference and behavior trees (BTs) for reactive action planning and execution in dynamic environments, showing how robotic tasks can be formulated as a free-energy minimization problem. The proposed approach allows to handle partially observable initial states and improves the robustness of classical BTs against unexpected contingencies while at the same time reducing the number of nodes in a tree. In this work, the general nominal behavior is specified offline through BTs, where a new type of leaf node, the prior node, is introduced to specify the desired state to be achieved rather than an action to be executed as typically done in BTs. The decision of which action to execute to reach the desired state is performed online through active inference. This results in the combination of continual online planning and hierarchical deliberation, that is an agent is able to follow a predefined offline plan while still being able to locally adapt and take autonomous decisions at runtime. The properties of our algorithm, such as convergence and robustness, are thoroughly analyzed, and the theoretical results are validated in two different mobile manipulators performing similar tasks, both in a simulated and real retail environment.
翻译:我们建议将活跃的推断与行为树(BTs)混合起来,以便在动态环境中进行反应性行动规划和执行,表明机器人任务如何被设计成一个自由能源最小化的问题。拟议方法允许处理部分可观测的初步状态,提高传统BTs对意外意外事故的稳健性,同时减少树上节点的数量。在这项工作中,一般名义行为通过BTs在离线时通过BTs具体指明,引入了一种新的叶节点,即前节点,以具体说明需要达到的状态,而不是像在BTs通常所做的那样执行的行动。决定执行达到预期状态的行动是通过积极推断在网上完成的。这导致连续的在线规划和等级评议相结合,该代理人能够遵循预先确定的离线计划,同时仍然能够本地适应并在运行时做出自主决定。我们算法的特性,例如趋同性和稳健性,经过彻底分析,理论结果在两个不同的移动操纵器中得到了验证,在模拟和真实零售环境中都执行类似的任务。