Methods for 3D lane detection have been recently proposed to address the issue of inaccurate lane layouts in many autonomous driving scenarios (uphill/downhill, bump, etc.). Previous work struggled in complex cases due to their simple designs of the spatial transformation between front view and bird's eye view (BEV) and the lack of a realistic dataset. Towards these issues, we present PersFormer: an end-to-end monocular 3D lane detector with a novel Transformer-based spatial feature transformation module. Our model generates BEV features by attending to related front-view local regions with camera parameters as a reference. PersFormer adopts a unified 2D/3D anchor design and an auxiliary task to detect 2D/3D lanes simultaneously, enhancing the feature consistency and sharing the benefits of multi-task learning. Moreover, we release one of the first large-scale real-world 3D lane datasets: OpenLane, with high-quality annotation and scenario diversity. OpenLane contains 200,000 frames, over 880,000 instance-level lanes, 14 lane categories, along with scene tags and the closed-in-path object annotations to encourage the development of lane detection and more industrial-related autonomous driving methods. We show that PersFormer significantly outperforms competitive baselines in the 3D lane detection task on our new OpenLane dataset as well as Apollo 3D Lane Synthetic dataset, and is also on par with state-of-the-art algorithms in the 2D task on OpenLane. The project page is available at https://github.com/OpenPerceptionX/PersFormer_3DLane and OpenLane dataset is provided at https://github.com/OpenPerceptionX/OpenLane.
翻译:最近提出了3D车道探测方法,以解决许多自主驾驶情景(上坡/下坡、下坡等)中不准确的车道布局问题。由于在前视和鸟眼视图之间空间转换的简单设计(BEV)和缺乏现实的数据集,以往的工作在复杂情况下挣扎。为了解决这些问题,我们推出了PersFormer:端到端单向单向3D车道探测器,配有新型变异器空间特征转换模块。我们的模型通过以摄像参数作为参照,关注相关的前视地方区域,生成了BEV特征。 PersFormer采用了统一的 2D/3D 锚设计和辅助任务,以同时检测2D/3D车道,提高地貌一致性,共享多任务学习的好处。此外,我们发布了第一个大型真实世界3D车道单路路探测器:OpenLane,配有高品质的注释/情景多样性。 OpenLane包含880,000以上平级车道分类,加上屏幕标签和闭路路路路路径,同时进行检测L3L3LD