Deep neural network based face recognition models have been shown to be vulnerable to adversarial examples. However, many of the past attacks require the adversary to solve an input-dependent optimization problem using gradient descent which makes the attack impractical in real-time. These adversarial examples are also tightly coupled to the attacked model and are not as successful in transferring to different models. In this work, we propose ReFace, a real-time, highly-transferable attack on face recognition models based on Adversarial Transformation Networks (ATNs). ATNs model adversarial example generation as a feed-forward neural network. We find that the white-box attack success rate of a pure U-Net ATN falls substantially short of gradient-based attacks like PGD on large face recognition datasets. We therefore propose a new architecture for ATNs that closes this gap while maintaining a 10000x speedup over PGD. Furthermore, we find that at a given perturbation magnitude, our ATN adversarial perturbations are more effective in transferring to new face recognition models than PGD. ReFace attacks can successfully deceive commercial face recognition services in a transfer attack setting and reduce face identification accuracy from 82% to 16.4% for AWS SearchFaces API and Azure face verification accuracy from 91% to 50.1%.
翻译:以深神经网络为基础的面部识别模型被证明易受对抗性实例的影响。然而,过去许多袭击都要求对手使用梯度下降来解决依赖投入的优化问题,这使得攻击在实时时不切实际。这些对抗性实例也与被攻击的模型紧密结合,在向不同模型转移方面不那么成功。在这项工作中,我们提议对以反向转型网络为基础的面部识别模型进行实时、高度可转移的攻击ReFace。ATN对抗性攻击模型作为向神经网络提供反馈的模型生成。我们发现,纯 U-Net ATN的白箱袭击成功率大大低于大型面部识别数据集PGD等基于梯度的袭击。因此,我们为ATN提出一个新的结构,在缩小这一差距的同时,在PGD上保持了10000x速度。此外,我们发现,在给定的扰动程度上,我们的ATN对抗性攻击模型在向新的面部识别模型转移到新的面部识别模型方面比PGDGD更有效。 ReFace攻击能够成功地将商业识别服务从APD50的准确度从A-1%的A-1%的AST-I的准确度设定为AST As 16的准确度,并减少AVI的准确度为A+%As的准确度。