The task of detecting 3D objects in point cloud has a pivotal role in many real-world applications. However, 3D object detection performance is behind that of 2D object detection due to the lack of powerful 3D feature extraction methods. In order to address this issue, we propose to build a 3D backbone network to learn rich 3D feature maps by using sparse 3D CNN operations for 3D object detection in point cloud. The 3D backbone network can inherently learn 3D features from almost raw data without compressing point cloud into multiple 2D images and generate rich feature maps for object detection. The sparse 3D CNN takes full advantages of the sparsity in the 3D point cloud to accelerate computation and save memory, which makes the 3D backbone network achievable. Empirical experiments are conducted on the KITTI benchmark and results show that the proposed method can achieve state-of-the-art performance for 3D object detection.
翻译:在点云中探测三维天体的任务在许多现实世界应用中具有关键作用。然而,由于缺乏强大的三维特征提取方法,三维天体探测的性能落后于二维天体探测。为了解决这一问题,我们建议建立一个三维主干网,通过使用稀疏的三维CNN操作在点云中探测三维天体物体来学习丰富的三维地貌图。三维主干网可以从几乎原始的数据中学习三维特征,而不必将点云压缩成多个二维图像,并生成丰富的地貌图进行天体探测。稀有的三维CNN利用三维天体云中的广度的优势来加速计算和保存记忆,从而使三维主干网可以实现。在KITTI的基准上进行了经验实验,结果显示拟议的方法可以实现三维天体物体探测的状态性能。