Pedestrian trajectory prediction in urban scenarios is essential for automated driving. This task is challenging because the behavior of pedestrians is influenced by both their own history paths and the interactions with others. Previous research modeled these interactions with pooling mechanisms or aggregating with hand-crafted attention weights. In this paper, we present the Social Interaction-Weighted Spatio-Temporal Convolutional Neural Network (Social-IWSTCNN), which includes both the spatial and the temporal features. We propose a novel design, namely the Social Interaction Extractor, to learn the spatial and social interaction features of pedestrians. Most previous works used ETH and UCY datasets which include five scenes but do not cover urban traffic scenarios extensively for training and evaluation. In this paper, we use the recently released large-scale Waymo Open Dataset in urban traffic scenarios, which includes 374 urban training scenes and 76 urban testing scenes to analyze the performance of our proposed algorithm in comparison to the state-of-the-art (SOTA) models. The results show that our algorithm outperforms SOTA algorithms such as Social-LSTM, Social-GAN, and Social-STGCNN on both Average Displacement Error (ADE) and Final Displacement Error (FDE). Furthermore, our Social-IWSTCNN is 54.8 times faster in data pre-processing speed, and 4.7 times faster in total test speed than the current best SOTA algorithm Social-STGCNN.
翻译:城市情景中的Pedestrial轨迹预测对于自动驾驶至关重要。 这项任务具有挑战性, 因为行人的行为既受自身历史路径的影响,也受与其他人的互动影响。 先前的研究以这些互动模式为模型,使用集合机制或手工制作的注意力重量来模拟这些互动。 在本文件中,我们介绍了社会互动-智能Spatio-Tempio-Tempal Convolution Convolual Neal网络(社会-IWSTCNN),其中包括空间和时间特点。 我们提议了一个新颖的设计,即社会互动提取器,以学习行人的空间和社会互动特征。 大部分以前的工作使用了包括五个场景的ETH和UCY数据集,但不涵盖广泛的城市交通情况,用于培训和评估。 在本文件中,我们使用最近发布的城市交通情景中的大规模Waymo Open数据集,其中包括374个城市培训场景和76个城市测试场景,用以分析我们提议的算法与最新(SOTA)模式相比的绩效。 我们的算法高于SATA算法,例如社会- LESTMTM、社会- GS-GAN-TER 和SIS- AS- Rest- Risax 时间中, 和SIS- Rislate- Rislate- sal- 和S- dislate- sal- sal- syx- slate- sal- sal- syxxxxxxxxxxxxxxxxxxxxxx 。