Predicting the behavior of road users, particularly pedestrians, is vital for safe motion planning in the context of autonomous driving systems. Traditionally, pedestrian behavior prediction has been realized in terms of forecasting future trajectories. However, recent evidence suggests that predicting higher-level actions, such as crossing the road, can help improve trajectory forecasting and planning tasks accordingly. There are a number of existing datasets that cater to the development of pedestrian action prediction algorithms, however, they lack certain characteristics, such as bird's eye view semantic map information, 3D locations of objects in the scene, etc., which are crucial in the autonomous driving context. To this end, we propose a new pedestrian action prediction dataset created by adding per-frame 2D/3D bounding box and behavioral annotations to the popular autonomous driving dataset, nuScenes. In addition, we propose a hybrid neural network architecture that incorporates various data modalities for predicting pedestrian crossing action. By evaluating our model on the newly proposed dataset, the contribution of different data modalities to the prediction task is revealed. The dataset is available at https://github.com/huawei-noah/PePScenes.
翻译:预测道路使用者,特别是行人的行为,对于自主驾驶系统的安全运动规划至关重要。传统上,行人行为预测是在预测未来轨迹方面实现的。然而,最近的证据表明,预测更高层次的行动,如跨过道路,有助于相应改进轨迹预测和规划任务。现有一些数据集有利于发展行人行动预测算法,但它们缺乏某些特征,如鸟眼眼视语义地图信息、现场物体的3D位置等,这对自主驾驶至关重要。为此,我们提议建立一个新的行人行动预测数据集,通过在流行的自主驾驶数据集(nuScenes)中添加一个 Perframe 2D/3D 边框和行为说明。此外,我们提议建立一个混合神经网络架构,纳入预测行人过境行动的各种数据模式。通过对新提议的数据集的模型进行评估,可以揭示不同数据模式对预测任务的贡献。数据集可在 https://github.com/hua-noah/PEScen查阅 https://github.com/wai-pes/PESen.