Weakly supervised object detection (WSOD) has recently attracted much attention. However, the method, performance and speed gaps between WSOD and fully supervised detection prevent WSOD from being applied in real-world tasks. To bridge the gaps, this paper proposes a new framework, Salvage of Supervision (SoS), with the key idea being to harness every potentially useful supervisory signal in WSOD: the weak image-level labels, the pseudo-labels, and the power of semi-supervised object detection. This paper shows that each type of supervisory signal brings in notable improvements, outperforms existing WSOD methods (which mainly use only the weak labels) by large margins. The proposed SoS-WSOD method achieves 64.4 $m\text{AP}_{50}$ on VOC2007, 61.9 $m\text{AP}_{50}$ on VOC2012 and 16.4 $m\text{AP}_{50:95}$ on MS-COCO, and also has fast inference speed. Ablations and visualization further verify the effectiveness of SoS.
翻译:然而,WSOD和完全受监督的检测之间的方法、性能和速度差距使WSOD无法应用于现实世界的任务。为了弥合差距,本文件提出了一个新的框架,即 " 保护监督 " (SOS),其关键思想是利用WSOD中每一个潜在有用的监督信号:微弱的图像标签、伪标签和半监督的检测对象的功率。本文表明,每类监督信号都带来显著的改进,以大幅度的幅度超过现有的WSOD方法(主要使用弱的标签)。拟议的SSOD方法在VOC2007年实现了64.4万美元(text{AP ⁇ 50美元)、VOC2012年达到61.9美元(text{AP ⁇ 50美元)和MS-CO的16.4美元(text{AP ⁇ 50:95美元),并且具有快速的推导速度。