Though quite challenging, leveraging large-scale unlabeled or partially labeled images in a cost-effective way has increasingly attracted interests for its great importance to computer vision. To tackle this problem, many Active Learning (AL) methods have been developed. However, these methods mainly define their sample selection criteria within a single image context, leading to the suboptimal robustness and impractical solution for large-scale object detection. In this paper, aiming to remedy the drawbacks of existing AL methods, we present a principled Self-supervised Sample Mining (SSM) process accounting for the real challenges in object detection. Specifically, our SSM process concentrates on automatically discovering and pseudo-labeling reliable region proposals for enhancing the object detector via the introduced cross image validation, i.e., pasting these proposals into different labeled images to comprehensively measure their values under different image contexts. By resorting to the SSM process, we propose a new AL framework for gradually incorporating unlabeled or partially labeled data into the model learning while minimizing the annotating effort of users. Extensive experiments on two public benchmarks clearly demonstrate our proposed framework can achieve the comparable performance to the state-of-the-art methods with significantly fewer annotations.
翻译:尽管具有相当挑战性,但以具有成本效益的方式利用大规模未贴标签或部分贴标签的图像已经日益吸引了对计算机视觉的极大重要性的兴趣。为了解决这一问题,已经制定了许多主动学习方法。但是,这些方法主要是在单一图像背景下界定其样本选择标准,导致大规模物体探测的不优化强性和不切实际的解决方案。在本文件中,为了补救现有AL方法的缺陷,我们提出了一个原则性自我监督采样(SSM)进程,其中说明了物体探测方面的实际挑战。具体地说,我们的SSM进程侧重于自动发现和假贴标签可靠的区域建议,通过引入的交叉图像验证,即将这些建议贴上不同的标签图像,以全面衡量不同图像环境中的价值观。我们通过SSM进程,提出了一个新的AL框架,以逐步将未贴标签或部分贴标签的数据纳入模型学习,同时尽量减少用户的注意努力。在两个公共基准上进行的广泛实验清楚地表明,我们提议的框架能够实现与州度说明方法相近的业绩。