Even if path planning can be solved using standard techniques from dynamic programming and control, the problem can also be approached using probabilistic inference. The algorithms that emerge using the latter framework bear some appealing characteristics that qualify the probabilistic approach as a powerful alternative to the more traditional control formulations. The idea of using estimation on stochastic models to solve control problems is not new and the inference approach considered here falls under the rubric of Active Inference (AI) and Control as Inference (CAI). In this work, we look at the specific recursions that arise from various cost functions that, although they may appear similar in scope, bear noticeable differences, at least when applied to typical path planning problems. We start by posing the path planning problem on a probabilistic factor graph, and show how the various algorithms translate into specific message composition rules. We then show how this unified approach, presented both in probability space and in log space, provides a very general framework that includes the Sum-product, the Max-product, Dynamic programming and mixed Reward/Entropy criteria-based algorithms. The framework also expands algorithmic design options for smoother or sharper policy distributions, including generalized Sum/Max-product algorithm, a Smooth Dynamic programming algorithm and modified versions of the Reward/Entropy recursions. We provide a comprehensive table of recursions and a comparison through simulations, first on a synthetic small grid with a single goal with obstacles, and then on a grid extrapolated from a real-world scene with multiple goals and a semantic map.
翻译:即使利用动态编程和控制的标准技术可以解决路径规划问题,问题也可以通过概率推算来解决。使用后一个框架的算法具有一些具有吸引力的特点,这些特点使概率推算法能够作为较传统的控制配方的有力替代物。使用随机推算模型估算解决控制问题的想法并不新鲜,此处考虑的推论方法属于主动推论和控制推理(AI)和逻辑空间(CAI)的范畴。在这项工作中,我们审视了各种成本函数产生的具体循环,这些功能虽然在范围上看起来相似,但具有明显的差异,至少在应用到典型路径规划问题时。我们首先在概率推算法要素图上提出路径规划问题,表明各种算法如何转化成具体的电文构成规则。然后,我们展示了这种统一方法如何在概率空间和日志空间中呈现出一个非常笼统的框架,包括Sum-Producal产品、动态编程和混合的Reward/Entocrial标准算法。框架还扩展了Sentral-assal-assal-assalal-assalal-assal ligal assal ligal sqal 和Sal-assal-assal-assal-assal-lieval-ligal-liversal listral 和Sqal-lational-lal-lational-lational-lational-lipal-viewsal 和Slational-sal-sal-sal-sal-sal-slational-sal-sal-sal-smal-lational-lational-lational-lational-smal-smal-vial-smal-sal-sal-sal-sal-sal-sal-sal-sal-sal-sal-l-l-sal-sal-sal-sal-sal-vial-vial-lal-sal-l-l-l-l-l-l-l-l-l-lal-sal-l-l-l-l-lal-lal-lal-l和制程和制程和制程和制程和制程流和制程流