Semantic segmentation has been continuously investigated in the last ten years, and majority of the established technologies are based on supervised models. In recent years, image-level weakly supervised semantic segmentation (WSSS), including single- and multi-stage process, has attracted large attention due to data labeling efficiency. In this paper, we propose to embed affinity learning of multi-stage approaches in a single-stage model. To be specific, we introduce an adaptive affinity loss to thoroughly learn the local pairwise affinity. As such, a deep neural network is used to deliver comprehensive semantic information in the training phase, whilst improving the performance of the final prediction module. On the other hand, considering the existence of errors in the pseudo labels, we propose a novel label reassign loss to mitigate over-fitting. Extensive experiments are conducted on the PASCAL VOC 2012 dataset to evaluate the effectiveness of our proposed approach that outperforms other standard single-stage methods and achieves comparable performance against several multi-stage methods.
翻译:在过去10年中,对语义分解进行了持续调查,大多数既定技术都以监督模型为基础。近年来,图像层受监管薄弱的语义分解(WSSS),包括单阶段和多阶段过程,由于数据标签效率的提高而引起极大关注。在本文中,我们提议将多阶段方法的亲和性学习纳入单一阶段模式。具体地说,我们引入适应性亲和性损失,以彻底学习本地双向亲和性。因此,深神经网络被用于在培训阶段提供全面的语义信息,同时改进最终预测模块的性能。另一方面,考虑到伪标签中存在错误,我们建议采用新的标签重划损失以减少过度配置。在PACAL VOC 2012 数据集上进行了广泛的实验,以评价我们拟议方法的效力,该方法超越了其他标准的单阶段方法,并以多种多阶段方法取得类似性能。