Pedestrian crossing prediction has been a topic of active research, resulting in many new algorithmic solutions. While measuring the overall progress of those solutions over time tends to be more and more established due to the new publicly available benchmark and standardized evaluation procedures, knowing how well existing predictors react to unseen data remains an unanswered question. This evaluation is imperative as serviceable crossing behavior predictors should be set to work in various scenarii without compromising pedestrian safety due to misprediction. To this end, we conduct a study based on direct cross-dataset evaluation. Our experiments show that current state-of-the-art pedestrian behavior predictors generalize poorly in cross-dataset evaluation scenarii, regardless of their robustness during a direct training-test set evaluation setting. In the light of what we observe, we argue that the future of pedestrian crossing prediction, e.g. reliable and generalizable implementations, should not be about tailoring models, trained with very little available data, and tested in a classical train-test scenario with the will to infer anything about their behavior in real life. It should be about evaluating models in a cross-dataset setting while considering their uncertainty estimates under domain shift.
翻译:Pedestrian交叉路口预测是积极研究的一个主题,导致了许多新的算法解决办法。由于新的公开的基准和标准化评价程序,衡量这些解决办法的总体进展往往越来越容易确定,但了解现有预测者对无形数据反应的好程度仍是一个未解的问题。这一评估是绝对必要的,因为可以使用的跨行为预测者应该被安排在各种塞斯那里工作,而不会因为误解而损害行人的安全。为此,我们根据直接的交叉数据集评价进行了一项研究。我们的实验表明,目前最先进的行人行为预测者在交叉数据集评价 " 塞纳里 " 中一般地没有很好地概括地反映交叉数据集评价 " 。根据我们观察的情况,我们主张行人跨行人预测的未来,例如可靠和可普遍适用的实施,不应该是调整模型,用极少的数据培训,在典型的火车测试假设中测试,并愿意推断他们在现实生活中的任何行为。它应该是在考虑其不确定性变化领域估计数的同时,在交叉数据设置中评价模型。