Pedestrian trajectory prediction in dynamic scenes remains a challenging and critical problem in numerous applications, such as self-driving cars and socially aware robots. Challenges concentrate on capturing pedestrians' motion patterns and social interactions, as well as handling the future uncertainties. Recent studies focus on modeling pedestrians' motion patterns with recurrent neural networks, capturing social interactions with pooling-based or graph-based methods, and handling future uncertainties by using random Gaussian noise as the latent variable. However, they do not integrate specific obstacle avoidance experience (OAE) that may improve prediction performance. For example, pedestrians' future trajectories are always influenced by others in front. Here we propose GTPPO (Graph-based Trajectory Predictor with Pseudo Oracle), an encoder-decoder-based method conditioned on pedestrians' future behaviors. Pedestrians' motion patterns are encoded with a long short-term memory unit, which introduces the temporal attention to highlight specific time steps. Their interactions are captured by a graph-based attention mechanism, which draws OAE into the data-driven learning process of graph attention. Future uncertainties are handled by generating multi-modal outputs with an informative latent variable. Such a variable is generated by a novel pseudo oracle predictor, which minimizes the knowledge gap between historical and ground-truth trajectories. Finally, the GTPPO is evaluated on ETH, UCY and Stanford Drone datasets, and the results demonstrate state-of-the-art performance. Besides, the qualitative evaluations show successful cases of handling sudden motion changes in the future. Such findings indicate that GTPPO can peek into the future.
翻译:动态场景中的Pedestrian轨迹预测在动态场景中仍是一个挑战性和关键问题,许多应用软件中,例如自驾汽车和社会认识的机器人,仍然是具有挑战性和关键意义的问题。挑战集中在捕捉行人的运动模式和社会互动,以及处理未来的不确定性。最近的研究侧重于用反复出现的神经网络模拟行人的运动模式,用基于集合或图形的方法捕捉社会互动,并通过使用随机高山噪音作为潜在变量来处理未来的不确定性。然而,它们并没有纳入可能提高预测性能的具体障碍避免经验(OAE),例如行人的未来轨迹总是受到前面其他人的影响。我们在这里建议GTPPO(用Pseudo Oraledo Oracle的 Graphimotor Timoryor Conventionor ), 一种以行人未来行为为主的编码方法, Pedestriorian的动作模式可以用一个长的短期记忆单元来编码,它能吸引时间关注特定的时间步骤。他们的相互作用被基于基于图表的注意机制所捕捉摸,它让OAE 进入数据-PROdealal Moveal Produal Produal Procial Produdeal Produde lade 。通过这种未来数据流的极前端分析过程来显示一个成功。