Weakly supervised semantic segmentation produces a pixel-level localization from a classifier, but it is likely to restrict its focus to a small discriminative region of the target object. AdvCAM is an attribution map of an image that is manipulated to increase the classification score. This manipulation is realized in an anti-adversarial manner, which perturbs the images along pixel gradients in the opposite direction from those used in an adversarial attack. It forces regions initially considered not to be discriminative to become involved in subsequent classifications, and produces attribution maps that successively identify more regions of the target object. In addition, we introduce a new regularization procedure that inhibits the incorrect attribution of regions unrelated to the target object and limits the attributions of the regions that already have high scores. On PASCAL VOC 2012 test images, we achieve mIoUs of 68.0 and 76.9 for weakly and semi-supervised semantic segmentation respectively, which represent a new state-of-the-art.
翻译:受微弱监督的语义分解从一个分类器中产生像素级本地化,但有可能将其焦点限制在目标对象的一小块有区别的区域。 AdvCAM是一张被操纵用于提高分类分数的图像归属图。这种操纵是以对抗性方式实现的,它将像素梯度的图像与对抗性攻击中使用的像素梯度的相反方向相扰。它迫使最初被认为没有歧视的区域参与随后的分类,并产生归属图,相继确定目标对象的更多区域。此外,我们引入了新的规范化程序,禁止与目标对象无关的区域的不正确归属,并限制已经得分高的区域的归属。在PACAL VOC 2012 测试图像中,我们为弱小和半超强的语义分解分别实现了68.0和76.9的 mIOU,这代表了一个新的状态。