Motion planning for autonomous robots and vehicles in presence of uncontrolled agents remains a challenging problem as the reactive behaviors of the uncontrolled agents must be considered. Since the uncontrolled agents usually demonstrate multimodal reactive behavior, the motion planner needs to solve a continuous motion planning problem under multimodal behaviors of the uncontrolled agents, which contains a discrete element. We propose a branch Model Predictive Control (MPC) framework that plans over feedback policies to leverage the reactive behavior of the uncontrolled agent. In particular, a scenario tree is constructed from a finite set of policies of the uncontrolled agent, and the branch MPC solves for a feedback policy in the form of a trajectory tree, which shares the same topology as the scenario tree. Moreover, coherent risk measures such as the Conditional Value at Risk (CVaR) are used as a tuning knob to adjust the tradeoff between performance and robustness. The proposed branch MPC framework is tested on an \textit{overtake and lane change} task and a \textit{merging} task for autonomous vehicles in simulation, and on the motion planning of an autonomous quadruped robot alongside an uncontrolled quadruped in experiments. The result demonstrates interesting human-like behaviors, achieving a balance between safety and performance.
翻译:由于不受控制的剂通常表现出多式反应行为,因此运动计划者需要解决不受控制的剂的多式行为下的持续运动规划问题,其中含有一个离散元素。我们提议了一个分支模型预测控制框架(MPC),用于规划反馈政策,以利用不受控制的剂的被动行为。特别是,一个情景树是根据不受控制的剂的有限政策构建的,而分支MPC则以轨迹树的形式解决反馈政策,而轨迹树与情景树具有相同的地形特征。此外,一致的风险措施,如风险条件值(CVaR),被用作调整性能和稳健性之间的权衡的调控点。拟议的分支预测控制框架是用一个 knextit{ overtake and path change} 任务和自动车辆模拟中的\ textitutit{mining}任务来测试的,以及一个自主四重力机器人动作的动作规划,在不受控制的实验中显示一种令人感兴趣的动作。