Prediction of pedestrian behavior is critical for fully autonomous vehicles to drive in busy city streets safely and efficiently. The future autonomous cars need to fit into mixed conditions with not only technical but also social capabilities. As more algorithms and datasets have been developed to predict pedestrian behaviors, these efforts lack the benchmark labels and the capability to estimate the temporal-dynamic intent changes of the pedestrians, provide explanations of the interaction scenes, and support algorithms with social intelligence. This paper proposes and shares another benchmark dataset called the IUPUI-CSRC Pedestrian Situated Intent (PSI) data with two innovative labels besides comprehensive computer vision labels. The first novel label is the dynamic intent changes for the pedestrians to cross in front of the ego-vehicle, achieved from 24 drivers with diverse backgrounds. The second one is the text-based explanations of the driver reasoning process when estimating pedestrian intents and predicting their behaviors during the interaction period. These innovative labels can enable several computer vision tasks, including pedestrian intent/behavior prediction, vehicle-pedestrian interaction segmentation, and video-to-language mapping for explainable algorithms. The released dataset can fundamentally improve the development of pedestrian behavior prediction models and develop socially intelligent autonomous cars to interact with pedestrians efficiently. The dataset has been evaluated with different tasks and is released to the public to access.
翻译:对行人行为的预测对于完全自主的车辆安全和高效地在繁忙的城市街道上驾驶至关重要。未来的自主汽车需要适应混合条件,不仅具有技术能力,而且具有社会能力。随着更多的算法和数据集已经开发出来,可以预测行人的行为,这些努力缺乏基准标签和能力来估计行人的时间动力意图变化,解释互动场景,支持社会智能的算法。本文件提议并分享另一个基准数据集,称为IUUUI-CSRC Pedestrian Siteuated Intent(PSI)数据,除了综合计算机视觉标签之外,还有两个创新标签。第一个新标签是行人从具有不同背景的24名司机到自我驾驶车辆前的动态意图变化。第二个是,在估计行人意图和预测其互动期间的行为时,对司机推理过程的文字解释。这些创新标签可以使计算机的视觉任务得以实现,包括行人意图/行为预测、车辆行人互动分化(PSIPSI)数据,以及可解释的视频语言图解图绘制,这是由具有不同背景背景背景的行进和智能的智能的行距分析任务。所发布的数据可以改善数据,可以使行进和行进率改进到具有智能的智能的行距分析,可以提高的行距分析,可以改进到可提高的行距分析。