Weakly-supervised object detection (WSOD) aims to train an object detector only requiring the image-level annotations. Recently, some works have managed to select the accurate boxes generated from a well-trained WSOD network to supervise a semi-supervised detection framework for better performance. However, these approaches simply divide the training set into labeled and unlabeled sets according to the image-level criteria, such that sufficient mislabeled or wrongly localized box predictions are chosen as pseudo ground-truths, resulting in a sub-optimal solution of detection performance. To overcome this issue, we propose a novel WSOD framework with a new paradigm that switches from weak supervision to noisy supervision (W2N). Generally, with given pseudo ground-truths generated from the well-trained WSOD network, we propose a two-module iterative training algorithm to refine pseudo labels and supervise better object detector progressively. In the localization adaptation module, we propose a regularization loss to reduce the proportion of discriminative parts in original pseudo ground-truths, obtaining better pseudo ground-truths for further training. In the semi-supervised module, we propose a two tasks instance-level split method to select high-quality labels for training a semi-supervised detector. Experimental results on different benchmarks verify the effectiveness of W2N, and our W2N outperforms all existing pure WSOD methods and transfer learning methods. Our code is publicly available at https://github.com/1170300714/w2n_wsod.
翻译:微弱监督天体探测( WSOD) 的目的是训练只要求图像级别说明的物体探测器( WSOD) 。 最近, 一些工程设法选择了从受过良好训练的 WSOD 网络中生成的准确的盒子, 以监督半监督检测框架, 以便提高性能。 然而, 这些方法只是按照图像级别标准将训练组分为标签和无标签的数据集, 这样就可以选择足够的错误标签或错误的本地化框预测作为假的地面真相, 从而导致检测性能的亚最佳解决方案。 为了克服这一问题, 我们提议了一个新型的SSODD框架, 其新的范例可以将监管薄弱的监管转换为噪音监督( W2N2N)。 一般来说, 由受过良好训练的SWOD网络生成的假地真真真真真真真真真假, 我们的原始地面正正正正正正正正正正的SWOD2 演示方法, 将现有精准地真真正正正的校准地和半正正式的校准的校准的校准方法 。