Pedestrian behavior prediction is one of the major challenges for intelligent driving systems in urban environments. Pedestrians often exhibit a wide range of behaviors and adequate interpretations of those depend on various sources of information such as pedestrian appearance, states of other road users, the environment layout, etc. To address this problem, we propose a novel multi-modal prediction algorithm that incorporates different sources of information captured from the environment to predict future crossing actions of pedestrians. The proposed model benefits from a hybrid learning architecture consisting of feedforward and recurrent networks for analyzing visual features of the environment and dynamics of the scene. Using the existing 2D pedestrian behavior benchmarks and a newly annotated 3D driving dataset, we show that our proposed model achieves state-of-the-art performance in pedestrian crossing prediction.
翻译:为解决这一问题,我们提议采用新的多模式预测算法,纳入从环境中收集的不同信息来源,以预测行人今后跨行人的行动。提议的模型得益于混合学习结构,由前进和经常性网络组成,用以分析环境和场景的视觉特征。我们利用现有的2D行人行为基准和一个新的3D驾驶数据集,显示我们提议的模型实现了行人跨行人跨行预测的最新性能。