We present a novel method in this work to address the problem of multi-vehicle conflict resolution in highly constrained spaces. A high-fidelity optimal control problem is formulated to incorporate nonlinear, non-holonomic vehicle dynamics and exact collision avoidance constraints. Despite being high-dimensional and non-convex, we can obtain an optimal solution by learning configuration strategies with reinforcement learning (RL) in a simplified discrete environment and approaching high-quality initial guesses progressively. The simulation results show that our method can explore efficient actions to resolve conflicts in confined space and generate dexterous maneuvers that are both collision-free and kinematically feasible.
翻译:在这项工作中,我们提出了一个解决高度受限空间多车辆冲突解决办法的新方法,制定了高度忠诚的最佳控制问题,以纳入非线性、非超光层车辆动态和精确避免碰撞的制约因素。 尽管我们是高维和非曲线的,但我们可以通过学习配置战略,在简化的离散环境中学习强化学习(RL),并逐步接近高质量的初步猜测,从而获得最佳解决方案。模拟结果表明,我们的方法可以探索有效行动,解决封闭空间的冲突,并产生不碰撞和运动上可行的灵活动作。