To achieve seamless human-robot interactions, robots need to intimately reason about complex interaction dynamics and future human behaviors within their motion planning process. However, there is a disconnect between state-of-the-art neural network-based human behavior models and robot motion planners -- either the behavior models are limited in their consideration of downstream planning or a simplified behavior model is used to ensure tractability of the planning problem. In this work, we present a framework that fuses together the interpretability and flexibility of trajectory optimization (TO) with the predictive power of state-of-the-art human trajectory prediction models. In particular, we leverage gradient information from data-driven prediction models to explicitly reason about human-robot interaction dynamics within a gradient-based TO problem. We demonstrate the efficacy of our approach in a multi-agent scenario whereby a robot is required to safely and efficiently navigate through a crowd of up to ten pedestrians. We compare against a variety of planning methods, and show that by explicitly accounting for interaction dynamics within the planner, our method offers safer and more efficient behaviors, even yielding proactive and nuanced behaviors such as waiting for a pedestrian to pass before moving.
翻译:为了实现无缝的人类机器人互动,机器人需要在其运动规划过程中深入地了解复杂的互动动态和未来人类行为。然而,基于人类行为的最新神经网络模型与机器人运动规划者之间存在脱节 -- -- 要么行为模型对下游规划的考虑有限,要么使用简化行为模型来确保规划问题的可移动性。在这项工作中,我们提出了一个框架,将轨迹优化的可解释性和灵活性与最新人类轨迹预测模型的预测力结合起来。特别是,我们利用数据驱动的预测模型的梯度信息,在基于梯度的问题中明确解释人类机器人互动动态。我们展示了我们多试样情景中的方法的功效,即机器人需要安全而高效地在多达10个行人人群中行驶。我们比较了各种规划方法,并表明,通过明确计算规划者内部的互动动态,我们的方法提供了更安全、更高效的行为,甚至产生了积极和细致的行为,例如等待行人行走前行走。