Current state-of-the-art saliency detection models rely heavily on large datasets of accurate pixel-wise annotations, but manually labeling pixels is time-consuming and labor-intensive. There are some weakly supervised methods developed for alleviating the problem, such as image label, bounding box label, and scribble label, while point label still has not been explored in this field. In this paper, we propose a novel weakly-supervised salient object detection method using point supervision. To infer the saliency map, we first design an adaptive masked flood filling algorithm to generate pseudo labels. Then we develop a transformer-based point-supervised saliency detection model to produce the first round of saliency maps. However, due to the sparseness of the label, the weakly supervised model tends to degenerate into a general foreground detection model. To address this issue, we propose a Non-Salient Suppression (NSS) method to optimize the erroneous saliency maps generated in the first round and leverage them for the second round of training. Moreover, we build a new point-supervised dataset (P-DUTS) by relabeling the DUTS dataset. In P-DUTS, there is only one labeled point for each salient object. Comprehensive experiments on five largest benchmark datasets demonstrate our method outperforms the previous state-of-the-art methods trained with the stronger supervision and even surpass several fully supervised state-of-the-art models. The code is available at: https://github.com/shuyonggao/PSOD.
翻译:目前最先进的显要性检测模型严重依赖精确像素说明的大型数据集,但人工标签像素却耗时费力。有些为缓解问题而开发的监管不力的方法,如图像标签、捆绑框标签和排字标签,而在这方面尚未探索点标签。在本文中,我们建议使用一个新颖的、监督不力的突出对象检测方法。为了推断突出性地图,我们首先设计一个适应性掩蔽式洪水填充算法,以生成假标签。然后我们开发一个基于变压器的点超强显要性检测模型,以制作第一轮显要性地图。然而,由于标签的分散性,受监管的薄弱模型往往会退化为一般表面检测模型。为了解决这一问题,我们建议了一个非高度监控性抑制方法,以优化第一轮生成的错误显要性地图,并利用它们来生成假标签标签标签标签标签标签标签标签。此外,我们用新的点超前几级标本/显要性检测模型来制作第一轮的新的点超强度模型。