Diffusion models (DMs) have recently emerged as a promising method in image synthesis. They have surpassed generative adversarial networks (GANs) in both diversity and quality, and have achieved impressive results in text-to-image and image-to-image modeling. However, to date, only little attention has been paid to the detection of DM-generated images, which is critical to prevent adverse impacts on our society. Although prior work has shown that GAN-generated images can be reliably detected using automated methods, it is unclear whether the same methods are effective against DMs. In this work, we address this challenge and take a first look at detecting DM-generated images. We approach the problem from two different angles: First, we evaluate the performance of state-of-the-art detectors on a variety of DMs. Second, we analyze DM-generated images in the frequency domain and study different factors that influence the spectral properties of these images. Most importantly, we demonstrate that GANs and DMs produce images with different characteristics, which requires adaptation of existing classifiers to ensure reliable detection. We believe this work provides the foundation and starting point for further research to detect DM deepfakes effectively.
翻译:投影模型(DMs)最近成为图像合成的一个很有希望的方法,在多样性和质量方面超过了基因对抗网络(GANs),在文字到图像和图像到图像的建模方面取得了令人印象深刻的成果,然而,迄今为止,很少注意探测DM产生的图像,这对防止对我们的社会产生不利影响至关重要。虽然先前的工作表明,GAN产生的图像可以使用自动化方法可靠地检测出来,但GAN产生的图像是否对DMs有效尚不清楚。在这项工作中,我们应对这一挑战,首先考察DMS产生的图像。我们从两个不同的角度来处理这个问题:第一,我们评估DMs各种图像的最新探测器的性能。第二,我们分析DM在频率领域产生的图像,研究影响这些图像光谱特性的不同因素。最重要的是,我们证明GAN和DMs产生的图像具有不同的特点,需要对现有分类器进行修改,以确保可靠的探测。我们认为,这项工作提供了基础和起点,以便进一步有效地研究DMS。