One of the most crucial yet challenging tasks for autonomous vehicles in urban environments is predicting the future behaviour of nearby pedestrians, especially at points of crossing. Predicting behaviour depends on many social and environmental factors, particularly interactions between road users. Capturing such interactions requires a global view of the scene and dynamics of the road users in three-dimensional space. This information, however, is missing from the current pedestrian behaviour benchmark datasets. Motivated by these challenges, we propose 1) a novel graph-based model for predicting pedestrian crossing action. Our method models pedestrians' interactions with nearby road users through clustering and relative importance weighting of interactions using features obtained from the bird's-eye-view. 2) We introduce a new dataset that provides 3D bounding box and pedestrian behavioural annotations for the existing nuScenes dataset. On the new data, our approach achieves state-of-the-art performance by improving on various metrics by more than 10% in comparison to existing methods. Upon publishing of this paper, our dataset will be made publicly available.
翻译:对于城市环境中的自治车辆来说,最关键但最具有挑战性的任务之一是预测附近行人的未来行为,特别是在过境点。预测行为取决于许多社会和环境因素,特别是道路使用者之间的互动。了解这种互动需要三维空间对道路使用者的场景和动态进行全球观察。然而,目前行人行为基准数据集中缺少这一信息。受这些挑战的驱动,我们提议:(1) 以图表为基础的预测行人过境行动的新模型。我们的方法模型模拟行人与附近道路使用者的互动,通过分组和相对重要性的比重,利用鸟眼观获得的特征进行互动。(2) 我们推出一个新的数据集,为现有的小行星数据集提供3D边框和行人行为说明。在新数据上,我们的方法通过改进各种计量方法,比现有方法提高10%以上,从而达到最新业绩。在公布本文后,我们将公布我们的数据集。