Nowadays, Light Detection And Ranging (LiDAR) has been widely used in autonomous vehicles for perception and localization. However, the cost of a high-resolution LiDAR is still prohibitively expensive, while its low-resolution counterpart is much more affordable. Therefore, using low-resolution LiDAR for autonomous driving perception tasks instead of high-resolution LiDAR is an economically feasible solution. In this paper, we propose a novel framework for 3D object detection in Bird-Eye View (BEV) using a low-resolution LiDAR and a monocular camera. Taking the low-resolution LiDAR point cloud and the monocular image as input, our depth completion network is able to produce dense point cloud that is subsequently processed by a voxel-based network for 3D object detection. Evaluated with KITTI dataset, the experimental results shows that the proposed approach performs significantly better than directly applying the 16-line LiDAR point cloud for object detection. For both easy and moderate cases, our detection results are comparable to those from 64-line high-resolution LiDAR. The network architecture and performance evaluations are analyzed in detail.
翻译:目前,光探测和测距(LiDAR)已被广泛用于自主车辆,以进行感知和定位。然而,高分辨率激光雷达的成本仍然高得令人望而却步,而低分辨率激光雷达的成本却高得令人望而却步,而低分辨率激光雷达的成本则更低廉。因此,使用低分辨率激光雷达进行自主驾驶感知任务,而不是高分辨率激光雷达是一种经济上可行的解决办法。在本文中,我们提出了一个在Bird-Eye View(BEV)使用低分辨率激光雷达和单视相机进行三维物体探测的新框架。用低分辨率激光雷达点云和单视图像作为投入,我们的深度完成网络能够产生密度点云,随后由基于 voxel 的3D物体探测网络进行处理。用KITTI 数据集评估的实验结果表明,拟议方法比直接应用16线激光雷达点云进行物体探测要好得多。对于简单和中小案例来说,我们的探测结果与64线高分辨率激光雷达的探测结果相似。网络结构和性评估是详细分析。