Pixel-wise predction with deep neural network has become an effective paradigm for salient object detection (SOD) and achieved remakable performance. However, very few SOD models are robust against adversarial attacks which are visually imperceptible for human visual attention. The previous work robust salient object detection against adversarial attacks (ROSA) shuffles the pre-segmented superpixels and then refines the coarse saliency map by the densely connected CRF. Different from ROSA that rely on various pre- and post-processings, this paper proposes a light-weight Learnble Noise (LeNo) to against adversarial attacks for SOD models. LeNo preserves accuracy of SOD models on both adversarial and clean images, as well as inference speed. In general, LeNo consists of a simple shallow noise and noise estimation that embedded in the encoder and decoder of arbitrary SOD networks respectively. Inspired by the center prior of human visual attention mechanism, we initialize the shallow noise with a cross-shaped gaussian distribution for better defense against adversarial attacks. Instead of adding additional network components for post-processing, the proposed noise estimation modifies only one channel of the decoder. With the deeply-supervised noise-decoupled training on state-of-the-art RGB and RGB-D SOD networks, LeNo outperforms previous works not only on adversarial images but also clean images, which contributes stronger robustness for SOD.
翻译:精密神经网络的像素预感与深层神经网络的深层神经网络结合已成为显性物体探测(SOD)并实现可再造性能的有效范例。 然而,很少有甚小的SOD模型能够抵御对抗性攻击,而对抗性攻击的对抗性攻击则在视觉上是看不见的。 先前的工作是强性物体对对抗性攻击(ROSA)的探测(ROSA)将预分解的超级像素冲洗成预分解的超像素,然后根据密密相交的通用报告格式改进粗度显性地图。 与依赖各种前和后处理的ROSA不同,本文建议对SOD模型的对抗性攻击采用轻度可读性图像(LeNo)来对付对抗对抗性攻击性攻击。 LeNo没有在对抗性图像和清洁图像上保持SOD模型的准确性,以及推断性速度。 一般来说, LeNo 包含一个简单的浅度噪音和噪音估计,在人类视觉关注机制之前,我们开始浅度噪音,我们以跨形的Gussian分发,不是用来更好地防御性对抗性攻击性网络,而是在更强的RDAD 上增加更精确的SD。