Bird's eye view (BEV) is widely adopted by most of the current point cloud detectors due to the applicability of well-explored 2D detection techniques. However, existing methods obtain BEV features by simply collapsing voxel or point features along the height dimension, which causes the heavy loss of 3D spatial information. To alleviate the information loss, we propose a novel point cloud detection network based on a Multi-level feature dimensionality reduction strategy, called MDRNet. In MDRNet, the Spatial-aware Dimensionality Reduction (SDR) is designed to dynamically focus on the valuable parts of the object during voxel-to-BEV feature transformation. Furthermore, the Multi-level Spatial Residuals (MSR) is proposed to fuse the multi-level spatial information in the BEV feature maps. Extensive experiments on nuScenes show that the proposed method outperforms the state-of-the-art methods. The code will be available upon publication.
翻译:鸟类眼视(BEV)被当前大多数点云探测器广泛采用,原因是2D探测技术的开发效果良好。然而,现有方法通过简单地在高度维度一带折叠 voxel 或点谱功能获得BEV特征,从而导致3D空间信息严重丢失。为了减轻信息损失,我们提议基于多层次特征维度减少战略(称为MDRNet)的新型点云探测网络。在MDRNet, 空间觉识分量减少(SDRNet)旨在动态地聚焦在Voxel-BEV特征变异期间该物体的宝贵部分。此外,多层次空间残留物(MSR)拟将BEV特征地图中的多层次空间信息整合起来。关于核气象的广泛实验显示,拟议的方法超越了最新技术的方法。该代码将在出版时公布。