Safe navigation of autonomous agents in human centric environments requires the ability to understand and predict motion of neighboring pedestrians. However, predicting pedestrian intent is a complex problem. Pedestrian motion is governed by complex social navigation norms, is dependent on neighbors' trajectories, and is multimodal in nature. In this work, we propose \textbf{SCAN}, a \textbf{S}patial \textbf{C}ontext \textbf{A}ttentive \textbf{N}etwork that can jointly predict socially-acceptable multiple future trajectories for all pedestrians in a scene. SCAN encodes the influence of spatially close neighbors using a novel spatial attention mechanism in a manner that relies on fewer assumptions, is parameter efficient, and is more interpretable compared to state-of-the-art spatial attention approaches. Through experiments on several datasets we demonstrate that our approach can also quantitatively outperform state of the art trajectory prediction methods in terms of accuracy of predicted intent.
翻译:人类中心环境中自主动力的安全导航要求能够理解和预测相邻行人的运动。 但是,预测行人的意图是一个复杂的问题。 远地点运动受复杂的社会导航规范的制约, 取决于邻居的轨迹, 并且是多式的。 在这项工作中, 我们提出\ textbf{ SSCAN}, 一种\ textbf{S} 空间\ textbf{C} ontext \ textb{A}textb{A}textb{ tentionive\ textbf{N}etwork, 它可以共同预测一个场景中所有行人可被社会接受的未来多轨迹。 SCAN 将空间接近邻居的影响编码成新的空间关注机制, 其方式依赖于较少的假设, 参数有效, 并且与最先进的空间关注方法相比更容易解释。 我们通过对几个数据集的实验, 我们的方法也可以在预测意图的准确性方面从数量上超过艺术轨迹预测方法的状态。