Weakly-supervised semantic segmentation (WSSS) with image-level labels has been widely studied to relieve the annotation burden of the traditional segmentation task. In this paper, we show that existing fully-annotated base categories can help segment objects of novel categories with only image-level labels, even if base and novel categories have no overlap. We refer to this task as weak-shot semantic segmentation, which could also be treated as WSSS with auxiliary fully-annotated categories. Recent advanced WSSS methods usually obtain class activation maps (CAMs) and refine them by affinity propagation. Based on the observation that semantic affinity and boundary are class-agnostic, we propose a method under the WSSS framework to transfer semantic affinity and boundary from base categories to novel ones. As a result, we find that pixel-level annotation of base categories can facilitate affinity learning and propagation, leading to higher-quality CAMs of novel categories. Extensive experiments on PASCAL VOC 2012 dataset demonstrate that our method significantly outperforms WSSS baselines on novel categories.
翻译:在本文中,我们表明,现有的充分注解基类可以帮助新类的分块对象,只有图像级标签,即使基类和新类标签没有重叠。我们将此任务称为微弱发音分块,也可以作为辅助全注分类的辅助性全注分类处理。最近先进的SSS方法通常获得等级激活图(CAMs),并通过亲近性传播加以改进。基于对语义亲近性和边界是阶级不可知性的观察,我们提议了一种在SSS框架下将语义亲近性和边界从基类转为新类的方法。结果,我们发现基类的像素级分解可以促进亲近性学习和传播,从而导致质量更高的CAMs新类。关于PASAL VOC2012年数据设置的广泛实验表明,我们的方法大大超出了我们关于PASAL VOC新类的系统SS基线。