Face image synthesis has progressed beyond the point at which humans can effectively distinguish authentic faces from synthetically generated ones. Recently developed synthetic face image detectors boast "better-than-human" discriminative ability, especially those guided by human perceptual intelligence during the model's training process. In this paper, we investigate whether these human-guided synthetic face detectors can assist non-expert human operators in the task of synthetic image detection when compared to models trained without human-guidance. We conducted a large-scale experiment with more than 1,560 subjects classifying whether an image shows an authentic or synthetically-generated face, and annotate regions that supported their decisions. In total, 56,015 annotations across 3,780 unique face images were collected. All subjects first examined samples without any AI support, followed by samples given (a) the AI's decision ("synthetic" or "authentic"), (b) class activation maps illustrating where the model deems salient for its decision, or (c) both the AI's decision and AI's saliency map. Synthetic faces were generated with six modern Generative Adversarial Networks. Interesting observations from this experiment include: (1) models trained with human-guidance offer better support to human examination of face images when compared to models trained traditionally using cross-entropy loss, (2) binary decisions presented to humans offers better support than saliency maps, (3) understanding the AI's accuracy helps humans to increase trust in a given model and thus increase their overall accuracy. This work demonstrates that although humans supported by machines achieve better-than-random accuracy of synthetic face detection, the ways of supplying humans with AI support and of building trust are key factors determining high effectiveness of the human-AI tandem.
翻译:最近开发的合成面部图像探测器具有“优于人”的歧视性能力,特别是在模型培训过程中由人类感知智能指导的。在本文中,我们调查这些人制合成面部探测器能否协助非专家人类操作者完成合成图像检测任务,而与没有人指导培训的模型相比,这些模型是没有人指导的,我们进行了1,560多个主题的大规模实验,对图像显示的真实面部或合成生成的面部以及支持其决定的注释区域进行了分类。总共收集了3,780个独特面部图像的56,015个说明。所有对象在没有任何人工智能支持的情况下首先检查样本,接下来的样本是(a) AI的决定(“合成”或“真实性”);(b) 显示模型决定的显著之处的班级启动地图;或者(c) AI的决定和大赦国际的显著的地图都支持了。 合成面部的面部面部与支持它们的决定。 (1) 6个现代的Adversari的准确性部位显示,因此所有3个传统面部图像都以经过训练的正确性网络展示的方式检查样品,然后进行样品的样本的样本测试, 显示人类测算。