Sampling-based model predictive control (MPC) optimization methods, such as Model Predictive Path Integral (MPPI), have recently shown promising results in various robotic tasks. However, it might produce an infeasible trajectory when the distributions of all sampled trajectories are concentrated within high-cost even infeasible regions. In this study, we propose a new method called log-MPPI equipped with a more effective trajectory sampling distribution policy which significantly improves the trajectory feasibility in terms of satisfying system constraints. The key point is to draw the trajectory samples from the normal log-normal (NLN) mixture distribution, rather than from Gaussian distribution. Furthermore, this work presents a method for collision-free navigation in unknown cluttered environments by incorporating the 2D occupancy grid map into the optimization problem of the sampling-based MPC algorithm. We first validate the efficiency and robustness of our proposed control strategy through extensive simulations of 2D autonomous navigation in different types of cluttered environments as well as the cartpole swing-up task. We further demonstrate, through real-world experiments, the applicability of log-MPPI for performing a 2D grid-based collision-free navigation in an unknown cluttered environment, showing its superiority to be utilized with the local costmap without adding additional complexity to the optimization problem. A video demonstrating the real-world and simulation results is available at https://youtu.be/_uGWQEFJSN0.
翻译:以抽样为基础的模型预测控制(MPC)优化模型模型模型(MPC)的预测控制(MPC)优化方法,例如模型预测路径综合(MPPI),最近显示各种机器人任务有希望的结果,然而,如果所有抽样轨迹的分布集中在高成本甚至不可行的区域,可能会产生不可行的轨迹,因为所有抽样轨迹的分布都集中在高成本甚至不可行的区域。在本研究中,我们提出了一个称为log-MPPI的新方法,配有更有效的轨迹抽样分配政策,该政策大大改进了在满足系统制约方面轨迹的可行性。关键是要从正常的日志-正常(NLN)混合物分布而不是高斯的分布中抽取轨迹样本。此外,这项工作通过将2D占用网格图纳入基于采样的MPC算法的最佳问题,从而在未知的环境下无碰撞导航-MPPI,在不以成本为基础的本地导航-MPPI 上,进一步展示了在不以实际世界为主的轨道-MPPI 和以成本为主的升级变压环境中的可操作。