Despite the importance of unsupervised object detection, to the best of our knowledge, there is no previous work addressing this problem. One main issue, widely known to the community, is that object boundaries derived only from 2D image appearance are ambiguous and unreliable. To address this, we exploit LiDAR clues to aid unsupervised object detection. By exploiting the 3D scene structure, the issue of localization can be considerably mitigated. We further identify another major issue, seldom noticed by the community, that the long-tailed and open-ended (sub-)category distribution should be accommodated. In this paper, we present the first practical method for unsupervised object detection with the aid of LiDAR clues. In our approach, candidate object segments based on 3D point clouds are firstly generated. Then, an iterative segment labeling process is conducted to assign segment labels and to train a segment labeling network, which is based on features from both 2D images and 3D point clouds. The labeling process is carefully designed so as to mitigate the issue of long-tailed and open-ended distribution. The final segment labels are set as pseudo annotations for object detection network training. Extensive experiments on the large-scale Waymo Open dataset suggest that the derived unsupervised object detection method achieves reasonable accuracy compared with that of strong supervision within the LiDAR visible range. Code shall be released.
翻译:尽管未经监督的天体探测非常重要,但据我们所知,以前没有关于该问题的任何工作。一个人们广泛知道的主要问题是,仅从 2D 图像外观得出的天体边界是模糊和不可靠的。为了解决这个问题,我们利用激光雷达线索协助不受监督的天体探测。通过利用3D 场景结构,定位问题可以大大减轻。我们进一步确定另一个重要问题,社区很少注意到,应当照顾到长尾和开放(子)类别的分布。在本文件中,我们介绍了第一个由激光雷达线索协助的未经监督天体探测实用方法。在我们的方法中,以3D点云为主的候选天体部分首先产生。然后,进行迭代部分标签进程,以指定段标签,并训练一个分区标签网络,以2D 图像和 3D 点云的特征为基础。标签过程经过仔细设计,以便减轻长尾和开放的天体(子)分布问题。最终的天体探测器标签应该以高清晰度的天体探测方法,在大型天体探测网络中进行合理的精确度测试。