This work presents Z-Mask, a robust and effective strategy to improve the adversarial robustness of convolutional networks against physically-realizable adversarial attacks. The presented defense relies on specific Z-score analysis performed on the internal network features to detect and mask the pixels corresponding to adversarial objects in the input image. To this end, spatially contiguous activations are examined in shallow and deep layers to suggest potential adversarial regions. Such proposals are then aggregated through a multi-thresholding mechanism. The effectiveness of Z-Mask is evaluated with an extensive set of experiments carried out on models for both semantic segmentation and object detection. The evaluation is performed with both digital patches added to the input images and printed patches positioned in the real world. The obtained results confirm that Z-Mask outperforms the state-of-the-art methods in terms of both detection accuracy and overall performance of the networks under attack. Additional experiments showed that Z-Mask is also robust against possible defense-aware attacks.
翻译:这项工作展示了Z-Mask, 这是一项强大而有效的战略,目的是提高革命网络对抗实际可以实现的对抗性攻击的对抗性强势。 辩护所依据的是对内部网络特征进行特定的Z-级分析,以探测和掩蔽输入图像中与对抗对象相对应的像素。 为此,在浅层和深层对空间毗连激活进行检查,以显示潜在的对抗区域。然后通过多保控机制将这些建议汇总在一起。 Z-Mask的有效性通过一系列广泛的实验来评价,这些实验既针对语义分割模型,又针对物体探测模型。评价是用输入图像和印刷印在真实世界中的补丁进行。获得的结果证实,Z-Mask在探测准确性和攻击网络的总体性方面都超越了最先进的方法。其他实验显示,Z-Mask对于可能的防御性攻击也十分强大。