Deepfakes are becoming increasingly popular in both good faith applications such as in entertainment and maliciously intended manipulations such as in image and video forgery. Primarily motivated by the latter, a large number of deepfake detectors have been proposed recently in order to identify such content. While the performance of such detectors still need further improvements, they are often assessed in simple if not trivial scenarios. In particular, the impact of benign processing operations such as transcoding, denoising, resizing and enhancement are not sufficiently studied. This paper proposes a more rigorous and systematic framework to assess the performance of deepfake detectors in more realistic situations. It quantitatively measures how and to which extent each benign processing approach impacts a state-of-the-art deepfake detection method. By illustrating it in a popular deepfake detector, our benchmark proposes a framework to assess robustness of detectors and provides valuable insights to design more efficient deepfake detectors.
翻译:深海假象在娱乐和恶意操纵(如图像和视频伪造)等善意应用中越来越受欢迎。主要出于后者的动机,最近提出了大量深假探测器,以确定此类内容。虽然这类探测器的性能仍然需要进一步改进,但往往以简单(甚至微不足道)的假想来评估,特别是,对诸如转码、脱色、重新定级和增强等良性加工操作的影响研究不够充分。本文件提出一个更加严格和系统化的框架,用以评估深假探测器在更现实情况下的性能。它量化地衡量每种良性处理方法如何和在多大程度上影响最先进的深假探测器。通过在流行的深假探测器中加以展示,我们的基准提出了一个框架,用以评估探测器的稳健性,并为设计更高效的深假探测器提供宝贵的见解。