High-level representation-guided pixel denoising and adversarial training are independent solutions to enhance the robustness of CNNs against adversarial attacks by pre-processing input data and re-training models, respectively. Most recently, adversarial training techniques have been widely studied and improved while the pixel denoising-based method is getting less attractive. However, it is still questionable whether there exists a more advanced pixel denoising-based method and whether the combination of the two solutions benefits each other. To this end, we first comprehensively investigate two kinds of pixel denoising methods for adversarial robustness enhancement (i.e., existing additive-based and unexplored filtering-based methods) under the loss functions of image-level and semantic-level restorations, respectively, showing that pixel-wise filtering can obtain much higher image quality (e.g., higher PSNR) as well as higher robustness (e.g., higher accuracy on adversarial examples) than existing pixel-wise additive-based method. However, we also observe that the robustness results of the filtering-based method rely on the perturbation amplitude of adversarial examples used for training. To address this problem, we propose predictive perturbation-aware pixel-wise filtering, where dual-perturbation filtering and an uncertainty-aware fusion module are designed and employed to automatically perceive the perturbation amplitude during the training and testing process. The proposed method is termed as AdvFilter. Moreover, we combine adversarial pixel denoising methods with three adversarial training-based methods, hinting that considering data and models jointly is able to achieve more robust CNNs. The experiments conduct on NeurIPS-2017DEV, SVHN, and CIFAR10 datasets and show the advantages over enhancing CNNs' robustness, high generalization to different models, and noise levels.
翻译:高级代表制像素导像素分解法和对抗性培训是独立的解决方案,可以分别通过预处理输入数据和再培训模型,加强CNN对对抗性攻击的稳健性。最近,对对抗性培训技术进行了广泛的研究和改进,而以像素分解为基础的方法则越来越不那么有吸引力。然而,仍然令人怀疑的是,是否存在一种更先进的像素分解法,以及两种解决方案的结合是否相互有益。为此,我们首先全面调查两种在图像级别和语义分解法的丧失功能下(即现有基于添加剂的过滤法和未使用过滤法的过滤法)CNNISN对对抗性攻击的稳性拆解性方法。在图像级别和语义分解法的恢复过程中,以像素分解法为方向的过滤法(e.g.,更高的PSNRR)以及较强的稳性模型(e.setrob)为制式的对立比对立法。然而,我们还注意到,在不断升级的Orvical-real view rodual view view view view rodude view roduction rodududustration 方法中,在使用一种我们使用了一种双轨方法。