The performance of object detection, to a great extent, depends on the availability of large annotated datasets. To alleviate the annotation cost, the research community has explored a number of ways to exploit unlabeled or weakly labeled data. However, such efforts have met with limited success so far. In this work, we revisit the problem with a pragmatic standpoint, trying to explore a new balance between detection performance and annotation cost by jointly exploiting fully and weakly annotated data. Specifically, we propose a weakly- and semi-supervised object detection framework (WSSOD), which involves a two-stage learning procedure. An agent detector is first trained on a joint dataset and then used to predict pseudo bounding boxes on weakly-annotated images. The underlying assumptions in the current as well as common semi-supervised pipelines are also carefully examined under a unified EM formulation. On top of this framework, weakly-supervised loss (WSL), label attention and random pseudo-label sampling (RPS) strategies are introduced to relax these assumptions, bringing additional improvement on the efficacy of the detection pipeline. The proposed framework demonstrates remarkable performance on PASCAL-VOC and MSCOCO benchmark, achieving a high performance comparable to those obtained in fully-supervised settings, with only one third of the annotations.
翻译:在这项工作中,我们以务实的观点重新审视了这一问题,试图通过充分和微弱的附加说明的数据来探索探测性能与说明性成本之间的新平衡。具体地说,我们提议建立一个薄弱和半监督的物体探测框架(WGRE),这涉及一个两阶段学习程序。首先,对一个代理探测器进行了联合数据集培训,然后用来预测标记性弱的图象上的假装盒。目前和共同的半监督管道的基本假设也在统一的EM的编制之下得到仔细审查。除了这一框架之外,还引入了薄弱的超强损失(WSL)、标签关注和随机假冒标签抽样(RPS)战略,以放松这些假设,从而进一步提高探测性管的效能。拟议的框架仅展示了在标准性能高的PASCO标准下取得的高水平业绩,仅展示了高水平的MSCO标准。