Considerable research effort has been devoted to LiDAR-based 3D object detection and empirical performance has been significantly improved. While progress has been encouraging, we observe an overlooked issue: it is not yet common practice to compare different 3D detectors under the same cost, e.g., inference latency. This makes it difficult to quantify the true performance gain brought by recently proposed architecture designs. The goal of this work is to conduct a cost-aware evaluation of LiDAR-based 3D object detectors. Specifically, we focus on SECOND, a simple grid-based one-stage detector, and analyze its performance under different costs by scaling its original architecture. Then we compare the family of scaled SECOND with recent 3D detection methods, such as Voxel R-CNN and PV-RCNN++. The results are surprising. We find that, if allowed to use the same latency, SECOND can match the performance of PV-RCNN++, the current state-of-the-art method on the Waymo Open Dataset. Scaled SECOND also easily outperforms many recent 3D detection methods published during the past year. We recommend future research control the inference cost in their empirical comparison and include the family of scaled SECOND as a strong baseline when presenting novel 3D detection methods.
翻译:虽然进展令人鼓舞,但我们注意到一个被忽视的问题:在相同的费用下比较不同的三维探测器,例如推推理潜伏,尚不常见。这使得难以量化最近提议的建筑设计带来的真正性能收益。这项工作的目标是对基于三维天体探测器进行成本认知评估。具体地说,我们侧重于第二层,一个简单的网基单级探测器,并以不同成本分析其性能,缩小其原有结构的规模。然后,我们比较第二层与最近的三维探测方法,例如Voxel R-CNN和PV-RCNN++。结果令人吃惊。我们发现,如果允许使用同样的延缓度,第二层可以匹配目前以光栅为基础的三维天体探测器的性能。在Waymo OpenD数据集上,目前最先进的方法也很容易超越最近公布的许多三维探测方法。我们建议对二层探测器的类别进行对比,例如Voxel R-CNN和PV-RCNNN+++。结果令人吃惊。我们发现,如果允许使用同样的延缓度,第二层可以与Waymo Open dal Drodustrual roduction roduction roduction roduction roview roduction roduction roduction roduction roduction roduction roduction roducation roduction lablegleglegism roduction sublegation lating lating lating sublement subal subal</s>