The DeepFakes, which are the facial manipulation techniques, is the emerging threat to digital society. Various DeepFake detection methods and datasets are proposed for detecting such data, especially for face-swapping. However, recent researches less consider facial animation, which is also important in the DeepFake attack side. It tries to animate a face image with actions provided by a driving video, which also leads to a concern about the security of recent payment systems that reply on liveness detection to authenticate real users via recognising a sequence of user facial actions. However, our experiments show that the existed datasets are not sufficient to develop reliable detection methods. While the current liveness detector cannot defend such videos as the attack. As a response, we propose a new human face animation dataset, called DeepFake MNIST+, generated by a SOTA image animation generator. It includes 10,000 facial animation videos in ten different actions, which can spoof the recent liveness detectors. A baseline detection method and a comprehensive analysis of the method is also included in this paper. In addition, we analyze the proposed dataset's properties and reveal the difficulty and importance of detecting animation datasets under different types of motion and compression quality.
翻译:DeepFakes是面部操纵技术,是对数字社会的新兴威胁。各种深底假检测方法和数据集被提议用于检测这些数据,特别是面部擦拭。然而,最近的研究较少考虑面部动画,这在DeepFake攻击一侧也很重要。它试图用驱动视频提供的行动来动容图像,这也引起了对最新支付系统安全性的担忧,这些系统通过识别用户面部动作的序列来对活性检测进行验证真实用户。然而,我们的实验显示,现有的数据集不足以开发可靠的检测方法。尽管目前的活性检测器无法为袭击等视频进行辩护。作为回应,我们提议一个新的人面部动画数据集,称为DeepFake MNNIST+,由SOTA图像动画生成。它包括10个不同动作中的10,000个面部动画视频,可以模拟最近的活性检测器。一个基线检测方法和对方法的全面分析也包含在本文中。此外,我们分析了提议的数据集的属性,并揭示了在不同的运动和动画数据类型下的难度和重要性。