Predicting the motion of multiple agents is necessary for planning in dynamic environments. This task is challenging for autonomous driving since agents (e.g. vehicles and pedestrians) and their associated behaviors may be diverse and influence one another. Most prior work have focused on predicting independent futures for each agent based on all past motion, and planning against these independent predictions. However, planning against independent predictions can make it challenging to represent the future interaction possibilities between different agents, leading to sub-optimal planning. In this work, we formulate a model for predicting the behavior of all agents jointly, producing consistent futures that account for interactions between agents. Inspired by recent language modeling approaches, we use a masking strategy as the query to our model, enabling one to invoke a single model to predict agent behavior in many ways, such as potentially conditioned on the goal or full future trajectory of the autonomous vehicle or the behavior of other agents in the environment. Our model architecture employs attention to combine features across road elements, agent interactions, and time steps. We evaluate our approach on autonomous driving datasets for both marginal and joint motion prediction, and achieve state of the art performance across two popular datasets. Through combining a scene-centric approach, agent permutation equivariant model, and a sequence masking strategy, we show that our model can unify a variety of motion prediction tasks from joint motion predictions to conditioned prediction.
翻译:预测多个代理商的动作是动态环境中规划的必要条件。 这项任务对于自主驾驶来说具有挑战性,因为代理商(例如车辆和行人)及其相关行为可能各不相同,相互影响。 多数先前的工作侧重于根据过去的所有动作预测每个代理商的独立未来,并对照这些独立预测进行规划。 但是,根据独立预测进行规划会给代表不同代理商之间未来互动可能性带来挑战,从而导致次优化规划。 在这项工作中,我们制定一个模型,共同预测所有代理商的行为,产生考虑到代理商之间互动的一致未来。在近期语言模型方法的启发下,我们使用遮掩战略作为我们模型的查询,使我们能够援引单一模型来预测代理商行为,以多种方式预测,例如可能以自主车辆的目标或未来轨道或环境其他代理商的行为为条件。 我们的模式架构需要注意将不同道路要素、代理商互动和时间步骤结合起来。 我们评估了我们自主驾驶数据集的方法,以进行边际和联合动作预测,并实现艺术表现状态状态,从两个用户级预测策略的模型,我们能够将一个模型化的模型和模型化的顺序显示我们两个用户级的周期的预测。