WridDeepfake:一个挑战性真实世界数据集,用于深假探测 (WildDeepfake: A Challenging Real-World Dataset for Deepfake Detection)

In recent years, the abuse of a face swap technique called deepfake Deepfake has raised enormous public concerns. So far, a large number of deepfake videos (known as "deepfakes") have been crafted and uploaded to the internet, calling for effective countermeasures. One promising countermeasure against deepfakes is deepfake detection. Several deepfake datasets have been released to support the training and testing of deepfake detectors, such as DeepfakeDetection and FaceForensics++. While this has greatly advanced deepfake detection, most of the real videos in these datasets are filmed with a few volunteer actors in limited scenes, and the fake videos are crafted by researchers using a few popular deepfake softwares. Detectors developed on these datasets may become less effective against real-world deepfakes on the internet. To better support detection against real-world deepfakes, in this paper, we introduce a new dataset WildDeepfake, which consists of 7,314 face sequences extracted from 707 deepfake videos collected completely from the internet. WildDeepfake is a small dataset that can be used, in addition to existing datasets, to develop and test the effectiveness of deepfake detectors against real-world deepfakes. We conduct a systematic evaluation of a set of baseline detection networks on both existing and our WildDeepfake datasets, and show that WildDeepfake is indeed a more challenging dataset, where the detection performance can decrease drastically. We also propose two (eg. 2D and 3D) Attention-based Deepfake Detection Networks (ADDNets) to leverage the attention masks on real/fake faces for improved detection. We empirically verify the effectiveness of ADDNets on both existing datasets and WildDeepfake. The dataset is available at:https://github.com/deepfakeinthewild/deepfake-in-the-wild.

翻译：近些年来, 滥用称为深假深假的面部交换技术引起了公众的极大关注。到目前为止, 大量深假视频( 称为“ 深假” ) 已经制作并上传到互联网上, 要求采取有效的对策。对深假的有希望的反制措施是深假的探测。一些深假的数据集已经发布, 以支持深假探测器的培训和测试, 例如 Deepfake 检测和 Flace Forensiccs++。虽然这已经大大推进了深假检测, 但这些数据集中的大部分真实视频都是在有限的场景中与几个志愿者演员一起拍摄的, 而假视频是由研究人员用一些流行的深假软件制作的。在这些数据集上开发的探测器可能不太有效, 与真实世界深假的注意力探测。为了更好地支持对现实世界深底的探测, 我们用一个新的数据存储, 包括7,314个从深度的深度的深度探测中提取的面片段, 我们从深度的深度探测中获取的深度数据, 正在使用一个小的数据, 正在使用真实的深度的检测中进行。。