Motion prediction for traffic participants is essential for a safe and robust automated driving system, especially in cluttered urban environments. However, it is highly challenging due to the complex road topology as well as the uncertain intentions of the other agents. In this paper, we present a graph-based trajectory prediction network named the Dual Scale Predictor (DSP), which encodes both the static and dynamical driving context in a hierarchical manner. Different from methods based on a rasterized map or sparse lane graph, we consider the driving context as a graph with two layers, focusing on both geometrical and topological features. Graph neural networks (GNNs) are applied to extract features with different levels of granularity, and features are subsequently aggregated with attention-based inter-layer networks, realizing better local-global feature fusion. Following the recent goal-driven trajectory prediction pipeline, goal candidates with high likelihood for the target agent are extracted, and predicted trajectories are generated conditioned on these goals. Thanks to the proposed dual-scale context fusion network, our DSP is able to generate accurate and human-like multi-modal trajectories. We evaluate the proposed method on the large-scale Argoverse motion forecasting benchmark, and it achieves promising results, outperforming the recent state-of-the-art methods.
翻译:交通参与者的动态预测对于安全和稳健的自动化驾驶系统至关重要,特别是在拥挤的城市环境中。然而,由于道路地形复杂,其他物剂的用意不确定,这具有极大的挑战性。在本文件中,我们展示了一个基于图形的轨迹预测网络,名为双比例预测(DSP),以等级方式将静态和动态驱动环境编码。不同于以光化地图或稀疏通道图为基础的方法,我们认为驾驶背景是一个以两层为主的图表,侧重于几何和地形特征。图形神经网络(GNN)被用于提取不同颗粒度的特征,随后与基于关注的跨层网络汇总,实现更好的地方-全球特征融合。根据最近以目标驱动的轨迹预测管道,提取了目标物剂极有可能达到的目标物,并生成了以这些目标为条件的预测轨迹。由于拟议的双比例环境聚合网络,我们的DSP能够产生准确和人相类似的多模式动态预测结果,我们提出的州级预测方法在高尺度的最近模型上取得了前景。