To accurately predict future positions of different agents in traffic scenarios is crucial for safely deploying intelligent autonomous systems in the real-world environment. However, it remains a challenge due to the behavior of a target agent being affected by other agents dynamically and there being more than one socially possible paths the agent could take. In this paper, we propose a novel framework, named Dynamic Context Encoder Network (DCENet). In our framework, first, the spatial context between agents is explored by using self-attention architectures. Then, the two-stream encoders are trained to learn temporal context between steps by taking the respective observed trajectories and the extracted dynamic spatial context as input. The spatial-temporal context is encoded into a latent space using a Conditional Variational Auto-Encoder (CVAE) module. Finally, a set of future trajectories for each agent is predicted conditioned on the learned spatial-temporal context by sampling from the latent space, repeatedly. DCENet is evaluated on one of the most popular challenging benchmarks for trajectory forecasting Trajnet and reports a new state-of-the-art performance. It also demonstrates superior performance evaluated on the benchmark inD for mixed traffic at intersections. A series of ablation studies is conducted to validate the effectiveness of each proposed module. Our code is available at https://github.com/wtliao/DCENet.
翻译:准确预测交通场景中不同代理商未来的位置,对于在现实世界环境中安全部署智能自主系统至关重要。然而,由于目标代理商的行为受到其他代理商的动态影响,而且该代理商可能走的社会途径不止一种,因此这仍然是一项挑战。在本文件中,我们提议了一个新框架,名为动态环境编码自动编码器网络(DCENet) 。在我们的框架中,首先,利用自我注意结构来探索不同代理商之间的空间环境。然后,双流编码器接受培训,通过采用观察到的轨迹和提取的动态空间环境作为投入,学习不同步骤之间的时间背景。空间时局环境被编码成一个潜在的空间,使用调控变自动编码器自动编码器模块(CVAE) 。最后,每种代理商的未来轨迹的预测以从暗中空间取样所学到的空间时空环境为条件。 DCENet通过对轨迹预测最流行的挑战性基准之一进行了评估,并报告了新的动态轨迹/动态空间环境环境网络运行情况。在每套数据库中进行的一次高端点评估。