Predicting the plausible future trajectories of nearby agents is a core challenge for the safety of Autonomous Vehicles and it mainly depends on two external cues: the dynamic neighbor agents and static scene context. Recent approaches have made great progress in characterizing the two cues separately. However, they ignore the correlation between the two cues and most of them are difficult to achieve map-adaptive prediction. In this paper, we use lane as scene data and propose a staged network that Jointly learning Agent and Lane information for Multimodal Trajectory Prediction (JAL-MTP). JAL-MTP use a Social to Lane (S2L) module to jointly represent the static lane and the dynamic motion of the neighboring agents as instance-level lane, a Recurrent Lane Attention (RLA) mechanism for utilizing the instance-level lanes to predict the map-adaptive future trajectories and two selectors to identify the typical and reasonable trajectories. The experiments conducted on the public Argoverse dataset demonstrate that JAL-MTP significantly outperforms the existing models in both quantitative and qualitative.
翻译:预测附近物剂的可疑未来轨迹是机动车辆安全的一个核心挑战,主要取决于两个外部线索:动态邻居代理人和静态场景背景。最近的办法在分别说明这两个线索方面大有进展。不过,它们忽略了这两个线索之间的相互关系,而且大多数都难以实现地图适应性预测。在本文中,我们使用车道作为现场数据,并提议一个分阶段网络,共同学习多式轨迹预测(JAL-MTP)的代理人和航道信息。 JAL-MTP使用社会到巷(S2L)模块联合代表静态车道和邻居代理人动态运动,作为例级车道,即常态道注意(RLA)机制,用以利用例级车道预测地图适应性未来轨迹和两个选择器来确定典型和合理的轨迹。在公共Argoverset数据集上进行的实验显示,JAL-MTP大大超越了现有定量和定性模型。