3D object detection has achieved remarkable progress by taking point clouds as the only input. However, point clouds often suffer from incomplete geometric structures and the lack of semantic information, which makes detectors hard to accurately classify detected objects. In this work, we focus on how to effectively utilize object-level information from images to boost the performance of point-based 3D detector. We present DeMF, a simple yet effective method to fuse image information into point features. Given a set of point features and image feature maps, DeMF adaptively aggregates image features by taking the projected 2D location of the 3D point as reference. We evaluate our method on the challenging SUN RGB-D dataset, improving state-of-the-art results by a large margin (+2.1 mAP@0.25 and +2.3mAP@0.5). Code is available at https://github.com/haoy945/DeMF.
翻译:3D天体探测工作取得了显著进展,将点云作为唯一的输入。然而,点云往往受到不完整的几何结构的影响,缺乏语义信息,使得探测器难以准确分类被探测到的物体。在这项工作中,我们侧重于如何有效利用图像中的物体级信息来提高基于点的3D探测器的性能。我们介绍了DEMF,这是将图像信息结合到点特征的一个简单而有效的方法。鉴于有一套点特征和图像特征图,DEMF适应性综合图像特征,以3D点预测的2D位置作为参照。我们评估了我们关于具有挑战性的 SUN RGB-D 数据集的方法,通过大边缘改进最新结果(+2.1 mAP@0.25和+2.3mAP@0.5)。代码可在https://github.com/haoy945/DeMF中查阅。