Easy access to audio-visual content on social media, combined with the availability of modern tools such as Tensorflow or Keras, open-source trained models, and economical computing infrastructure, and the rapid evolution of deep-learning (DL) methods, especially Generative Adversarial Networks (GAN), have made it possible to generate deepfakes to disseminate disinformation, revenge porn, financial frauds, hoaxes, and to disrupt government functioning. The existing surveys have mainly focused on the detection of deepfake images and videos. This paper provides a comprehensive review and detailed analysis of existing tools and machine learning (ML) based approaches for deepfake generation and the methodologies used to detect such manipulations for both audio and visual deepfakes. For each category of deepfake, we discuss information related to manipulation approaches, current public datasets, and key standards for the performance evaluation of deepfake detection techniques along with their results. Additionally, we also discuss open challenges and enumerate future directions to guide future researchers on issues that need to be considered to improve the domains of both deepfake generation and detection. This work is expected to assist the readers in understanding the creation and detection mechanisms of deepfakes, along with their current limitations and future direction.
翻译:社会媒体的视听内容容易获取,加上现有现代工具,如Tensorflow或Keras、开放源码培训模型和经济计算基础设施等,以及深学习(DL)方法的迅速演变,特别是General Adversarial Networks(GAN),使得有可能产生深刻的假象,以传播虚假信息、报复色情、金融欺诈、骗骗局、骗局和扰乱政府运作;现有调查主要侧重于发现深假图像和视频;本文件全面审查和详细分析现有的深假生成工具和基于机器学习(ML)的方法,以及用于发现此类视听深假生成的操纵方法的方法;对于每一类深假网络,我们讨论与操纵方法、当前公共数据集和深假探测技术业绩评价的关键标准及其结果;此外,我们还讨论公开的挑战和列举未来方向,以指导未来的研究人员,这些问题需要考虑,以改善深假生成和探测的领域。这项工作可望帮助读者了解当前在深度发现和探测机制方面所面临的限制。