Modern autonomous driving system is characterized as modular tasks in sequential order, i.e., perception, prediction and planning. As sensors and hardware get improved, there is trending popularity to devise a system that can perform a wide diversity of tasks to fulfill higher-level intelligence. Contemporary approaches resort to either deploying standalone models for individual tasks, or designing a multi-task paradigm with separate heads. These might suffer from accumulative error or negative transfer effect. Instead, we argue that a favorable algorithm framework should be devised and optimized in pursuit of the ultimate goal, i.e. planning of the self-driving-car. Oriented at this goal, we revisit the key components within perception and prediction. We analyze each module and prioritize the tasks hierarchically, such that all these tasks contribute to planning (the goal). To this end, we introduce Unified Autonomous Driving (UniAD), the first comprehensive framework up-to-date that incorporates full-stack driving tasks in one network. It is exquisitely devised to leverage advantages of each module, and provide complementary feature abstractions for agent interaction from a global perspective. Tasks are communicated with unified query design to facilitate each other toward planning. We instantiate UniAD on the challenging nuScenes benchmark. With extensive ablations, the effectiveness of using such a philosophy is proven to surpass previous state-of-the-arts by a large margin in all aspects. The full suite of codebase and models would be available to facilitate future research in the community.
翻译:现代自主驱动系统被描述为按顺序排列的模块化任务,即感知、预测和规划。随着传感器和硬件的改进,设计一个能够执行各种各样的任务以达到更高层次的智能的系统越来越受欢迎。当代方法要么为个别任务单独部署模式,要么设计一个有不同头领的多任务模式。这些可能受到累积性错误或负转移效应的影响。相反,我们争辩说,为实现最终目标,即自行驾驶汽车的规划,应当设计一个有利的算法框架并加以优化。我们以这一目标为导向,重新审视认识和预测中的关键组成部分。我们从等级上分析每个模块并排列任务的优先次序,使所有这些任务都有助于规划(目标)。为此,我们引入一个统一自主驱动(UnitiaD),这是第一个包含一个网络全套驱动任务或负转移效应的综合框架。我们精心设计这个逻辑框架是为了利用每个模块的优势,并为代理人从全球视角的互动提供互补的抽象特征。我们分析每个模块对一个统一研究理念设计,一个具有挑战性,一个历史的比额,一个历史比值,一个历史比强的比值,一个历史比值比值比值比值,一个比值,一个比值比值比值比值比值比值比值比值比值比值比值,一个比值比值比值比值比值比值比值比值比值比值比值将所有。