具有净化机制和结构相似性损失的突出物体探测 (Salient Object Detection with Purificatory Mechanism and Structural Similarity Loss)

By the aid of attention mechanisms to weight the image features adaptively, recent advanced deep learning-based models encourage the predicted results to approximate the ground-truth masks with as large predictable areas as possible, thus achieving the state-of-the-art performance. However, these methods do not pay enough attention to small areas prone to misprediction. In this way, it is still tough to accurately locate salient objects due to the existence of regions with indistinguishable foreground and background and regions with complex or fine structures. To address these problems, we propose a novel convolutional neural network with purificatory mechanism and structural similarity loss. Specifically, in order to better locate preliminary salient objects, we first introduce the promotion attention, which is based on spatial and channel attention mechanisms to promote attention to salient regions. Subsequently, for the purpose of restoring the indistinguishable regions that can be regarded as error-prone regions of one model, we propose the rectification attention, which is learned from the areas of wrong prediction and guide the network to focus on error-prone regions thus rectifying errors. Through these two attentions, we use the Purificatory Mechanism to impose strict weights with different regions of the whole salient objects and purify results from hard-to-distinguish regions, thus accurately predicting the locations and details of salient objects. In addition to paying different attention to these hard-to-distinguish regions, we also consider the structural constraints on complex regions and propose the Structural Similarity Loss. In experiments, the proposed approach outperforms 19 state-of-the-art methods on six datasets with a notable margin at over 27FPS on a single NVIDIA 1080Ti GPU.

翻译：借助关注机制来调整图像特征的适应性,最近先进的深层次学习模型鼓励了预测结果,将地面真实面罩与尽可能大的可预测区域相近,从而达到最先进的性能。然而,这些方法并没有足够关注容易被误解的小区域。因此,由于存在一些无法区分的地表和背景区域以及结构复杂或结构精密的区域,因此要准确定位突出的物体仍很困难。为了解决这些问题,我们提议建立一个具有净化机制以及结构相似性损失的新型革命性神经网络。具体地说,为了更好地定位初步显著目标,我们首先引入了促销关注,这是基于空间和引导关注机制,以促进对突出区域的注意。因此,为了恢复可被视为一种模式中易误差的偏差区域,我们建议从错误的预测领域吸取纠正性关注,并指导网络关注易误差区域,从而纠正错误。通过这两个关注点,我们用精确的纯度、纯度、纯度、纯度的估算性结果,我们用一个简单、精确的估算性结构模型,我们用一个重的预估测的系统向不同的区域。

相关内容

注意力机制

关注 120

Attention机制最早是在视觉图像领域提出来的，但是真正火起来应该算是google mind团队的这篇论文《Recurrent Models of Visual Attention》[14]，他们在RNN模型上使用了attention机制来进行图像分类。随后，Bahdanau等人在论文《Neural Machine Translation by Jointly Learning to Align and Translate》 [1]中，使用类似attention的机制在机器翻译任务上将翻译和对齐同时进行，他们的工作算是是第一个提出attention机制应用到NLP领域中。接着类似的基于attention机制的RNN模型扩展开始应用到各种NLP任务中。最近，如何在CNN中使用attention机制也成为了大家的研究热点。下图表示了attention研究进展的大概趋势。

图像分割方法综述

专知会员服务

56+阅读 · 2020年11月22日

图像分割二十年，盘点影响力最大的10篇论文

专知会员服务

84+阅读 · 2020年9月27日

2020数据工程师成长路线图

专知会员服务

19+阅读 · 2020年9月6日

【旷视-CVPR2020】领域自适应对象检测的探索类别正则化，Exploring Categorical Regularization for Domain Adaptive Object Detection

专知会员服务

38+阅读 · 2020年3月23日