Currently, there have been many kinds of voxel-based 3D single stage detectors, while point-based single stage methods are still underexplored. In this paper, we first present a lightweight and effective point-based 3D single stage object detector, named 3DSSD, achieving a good balance between accuracy and efficiency. In this paradigm, all upsampling layers and refinement stage, which are indispensable in all existing point-based methods, are abandoned to reduce the large computation cost. We novelly propose a fusion sampling strategy in downsampling process to make detection on less representative points feasible. A delicate box prediction network including a candidate generation layer, an anchor-free regression head with a 3D center-ness assignment strategy is designed to meet with our demand of accuracy and speed. Our paradigm is an elegant single stage anchor-free framework, showing great superiority to other existing methods. We evaluate 3DSSD on widely used KITTI dataset and more challenging nuScenes dataset. Our method outperforms all state-of-the-art voxel-based single stage methods by a large margin, and has comparable performance to two stage point-based methods as well, with inference speed more than 25 FPS, 2x faster than former state-of-the-art point-based methods.
翻译:目前,有许多基于oxel的基于3D的单一级探测器,而基于点的单一级方法仍未得到充分探讨。在本文中,我们首先提出一个轻量和有效的基于点的3D单一级物体探测器,名为3DSSD,在准确性和效率之间实现良好的平衡。在这个范例中,所有现有所有基于点的方法中不可或缺的高采样层和精细阶段都被废弃,以降低巨大的计算成本。我们新颖地提议在下游取样过程中采用混合取样战略,以便在代表性较低的点上进行探测。一个微妙的盒式预测网络,包括一个候选生成层,一个带有3D中心状态定位定位定位的回归头,旨在满足我们的准确性和速度需求。我们的范例是一个优雅的单一级无锚框架,显示了其他现有方法的优越性。我们对广泛使用的KITTI数据集和更具挑战性的nuScenes数据集进行了3DSDSD的评审。我们的方法比所有基于oxel的州级单级方法都更接近于大边缘,并且比基于FPS-25级的速度要快。