Autonomous driving requires a comprehensive understanding of the surrounding environment for reliable trajectory planning. Previous works rely on dense rasterized scene representation (e.g., agent occupancy and semantic map) to perform planning, which is computationally intensive and misses the instance-level structure information. In this paper, we propose VAD, an end-to-end vectorized paradigm for autonomous driving, which models the driving scene as fully vectorized representation. The proposed vectorized paradigm has two significant advantages. On one hand, VAD exploits the vectorized agent motion and map elements as explicit instance-level planning constraints which effectively improves planning safety. On the other hand, VAD runs much faster than previous end-to-end planning methods by getting rid of computation-intensive rasterized representation and hand-designed post-processing steps. VAD achieves state-of-the-art end-to-end planning performance on the nuScenes dataset, outperforming the previous best method by a large margin (reducing the average collision rate by 48.4%). Besides, VAD greatly improves the inference speed (up to 9.3x), which is critical for the real-world deployment of an autonomous driving system. Code and models will be released for facilitating future research.
翻译:自动驾驶需要全面了解周围环境以进行可靠的轨迹规划。先前的研究依赖于密集的栅格化场景表示(例如,代理人占用和语义地图)来执行规划,这是计算密集型的,并且缺失实例级结构信息。在本文中,我们提出VAD,这是一种用于自动驾驶的端到端向量化范式,它将驾驶场景建模为完全向量化表示。所提出的向量化范式具有两个显著优点。一方面,VAD利用向量化代理运动和地图元素作为明确的实例级规划约束,从而有效提高了规划安全性。另一方面,VAD比以前的端到端规划方法运行更快,通过摆脱计算密集型的栅格化表示和手动设计的后处理步骤。VAD在nuScenes数据集上实现了最先进的端到端规划性能,比以前最佳方法表现得更好(将平均碰撞率降低了48.4%)。此外,VAD极大地提高了推理速度(高达9.3倍),这对于自动驾驶系统的实际部署非常关键。代码和模型将被发布,以促进未来的研究。