Deepfakes pose growing challenges to the trust of information on the Internet. Therefore,detecting deepfakes has attracted increasing attentions from both academia and industry. State-of-the-art deepfake detection methods consist of two key components, i.e., face extractor and face classifier, which extract the face region in an image and classify it to be real/fake, respectively. Existing studies mainly focused on improving the detection performance in non-adversarial settings, leaving security of deepfake detection in adversarial settings largely unexplored. In this work, we aim to bridge the gap. In particular, we perform a systematic measurement study to understand the security of the state-of-the-art deepfake detection methods in adversarial settings. We use two large-scale public deepfakes data sources including FaceForensics++ and Facebook Deepfake Detection Challenge, where the deepfakes are fake face images; and we train state-of-the-art deepfake detection methods. These detection methods can achieve 0.94--0.99 accuracies in non-adversarial settings on these datasets. However, our measurement results uncover multiple security limitations of the deepfake detection methods in adversarial settings. First, we find that an attacker can evade a face extractor, i.e., the face extractor fails to extract the correct face regions, via adding small Gaussian noise to its deepfake images. Second, we find that a face classifier trained using deepfakes generated by one method cannot detect deepfakes generated by another method, i.e., an attacker can evade detection via generating deepfakes using a new method. Third, we find that an attacker can leverage backdoor attacks developed by the adversarial machine learning community to evade a face classifier. Our results highlight that deepfake detection should consider the adversarial nature of the problem.
翻译:深假对互联网上信息的信任构成越来越多的挑战。 因此, 发现深假会引起学术界和业界越来越多的关注。 最先进的深假检测方法由两个关键组成部分组成, 即: 面部提取器和面部分类器, 将脸部区域以图像提取出来, 将其分类为真实/ 假的。 现有研究主要侧重于改进非对称环境中的检测性能, 使得对立环境中的深假发现安全性基本得不到探索。 在这项工作中, 我们的目标是弥合差距。 特别是, 我们进行系统化的测量研究, 以了解在对立环境中的状态深假的深假发现方法。 我们用深假的纸部检测法, 无法通过深度的对称检测方法, 我们用深度的面部位检测法, 也可以通过这些对面部的深度检测法, 来生成一种对面部的深度检测方法。 我们用深度对面部检测法, 也可以通过一次对面部的对面部检测方法, 来检测。