3D object detection is receiving increasing attention from both industry and academia thanks to its wide applications in various fields. In this paper, we propose Point-Voxel Region-based Convolution Neural Networks (PV-RCNNs) for 3D object detection on point clouds. First, we propose a novel 3D detector, PV-RCNN, which boosts the 3D detection performance by deeply integrating the feature learning of both point-based set abstraction and voxel-based sparse convolution through two novel steps, i.e., the voxel-to-keypoint scene encoding and the keypoint-to-grid RoI feature abstraction. Second, we propose an advanced framework, PV-RCNN++, for more efficient and accurate 3D object detection. It consists of two major improvements: sectorized proposal-centric sampling for efficiently producing more representative keypoints, and VectorPool aggregation for better aggregating local point features with much less resource consumption. With these two strategies, our PV-RCNN++ is about $3\times$ faster than PV-RCNN, while also achieving better performance. The experiments demonstrate that our proposed PV-RCNN++ framework achieves state-of-the-art 3D detection performance on the large-scale and highly-competitive Waymo Open Dataset with 10 FPS inference speed on the detection range of 150m * 150m.
翻译:3D物体探测由于在各个领域的广泛应用,正日益受到产业和学术界的注意。在本文件中,我们提议在点云上为3D物体探测建立基于点-Voxel区域变异神经网络(PV-RCNNNs),首先,我们提出一个新的3D探测器(PV-RCNN),即PV-RCNNN,通过两个新的步骤,即Voxel-Key点场景编码和关键点对电网特征抽象(PV-Grid RoNN),深入整合基于点的抽取和基于oxel的稀释特征,从而提升3D探测性能。第二,我们提出一个先进的框架(PV-RCNN++),用于在点云云云云中检测3D物体。它包括两个重大改进:为高效生产更具代表性的键点进行以部门化建议为中心取样,以及用更少的资源更好地汇集当地点特征的Vestor Pool聚合。根据这两个战略,我们的PV-RC-RCNN++值比PV-G-G 和关键点-G-GNNNNP-G-G-G-G-G-G-GNS-CS-CS-CS-CS-CS-CS-CS-CS-CS-CS-C-C-CS-CS-CS-CS-CS-C-CS-CS-CS-CS-CS-CS-C-CS-C-C-C-CS-CS-C-C-CS-CS-CS-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C