The wide application of deep neural networks (DNNs) demands an increasing amount of attention to their real-world robustness, i.e., whether a DNN resists black-box adversarial attacks, among them score-based query attacks (SQAs) are the most threatening ones because of their practicalities and effectiveness: the attackers only need dozens of queries on model outputs to seriously hurt a victim network. Defending against SQAs requires a slight but artful variation of outputs due to the service purpose for users, who share the same output information with attackers. In this paper, we propose a real-world defense, called Unifying Gradients (UniG), to unify gradients of different data so that attackers could only probe a much weaker attack direction that is similar for different samples. Since such universal attack perturbations have been validated as less aggressive than the input-specific perturbations, UniG protects real-world DNNs by indicating attackers a twisted and less informative attack direction. To enhance UniG's practical significance in real-world applications, we implement it as a Hadamard product module that is computationally-efficient and readily plugged into any model. According to extensive experiments on 5 SQAs and 4 defense baselines, UniG significantly improves real-world robustness without hurting clean accuracy on CIFAR10 and ImageNet. For instance, UniG maintains a CIFAR-10 model of 77.80% accuracy under 2500-query Square attack while the state-of-the-art adversarially-trained model only has 67.34% on CIFAR10. Simultaneously, UniG greatly surpasses all compared baselines in clean accuracy and the modification degree of outputs. The code would be released.
翻译:67. 深度神经网络(DNNS)的广泛应用要求人们更加关注其真实世界的强力,即DNN是否抵制黑盒对抗性攻击,其中包括基于分数的查询攻击(SQAs)是否因其实用性和有效性而最具有威胁性:攻击者只需对模型输出进行数十次询问,就可严重伤害受害者网络。保护SQAs需要因用户的服务目的而对产出进行略微但奇异的改变,用户与攻击者共享相同的输出信息。在本文中,我们提议对真实世界进行防御,称为United Gradients(Unid-G),以统一不同数据的梯度,以便攻击者只能探测与不同样品相似的更弱攻击方向。由于这种普遍攻击的侵扰被证实比输入特定侵袭网络严重伤害受害者网络。UNG保护真实世界的DNNPs,因为用户的模型是扭曲和不那么信息化的攻击方向。为了提高UG在现实世界应用中的实际意义,我们建议将其作为HAND的修改产品模块,而无需进行精确的精确度和直观的精确度。