Concern regarding the wide-spread use of fraudulent images/videos in social media necessitates precise detection of such fraud. The importance of facial expressions in communication is widely known, and adversarial attacks often focus on manipulating the expression related features. Thus, it is important to develop methods that can detect manipulations in facial expressions, and localize the manipulated regions. To address this problem, we propose a framework that is able to detect manipulations in facial expression using a close combination of facial expression recognition and image manipulation methods. With the addition of feature maps extracted from the facial expression recognition framework, our manipulation detector is able to localize the manipulated region. We show that, on the Face2Face dataset, where there is abundant expression manipulation, our method achieves over 3% higher accuracy for both classification and localization of manipulations compared to state-of-the-art methods. In addition, results on the NeuralTextures dataset where the facial expressions corresponding to the mouth regions have been modified, show 2% higher accuracy in both classification and localization of manipulation. We demonstrate that the method performs at-par with the state-of-the-art methods in cases where the expression is not manipulated, but rather the identity is changed, thus ensuring generalizability of the approach.
翻译:对社交媒体广泛使用欺诈性图像/视频的担忧要求准确发现此类欺诈。在沟通中面部表达方式的重要性广为人知,对抗性攻击往往侧重于操纵表达方式的相关特征。因此,开发能够检测面部表达方式的操控操作的方法非常重要,并使被操纵区域本地化。为了解决这一问题,我们提出了一个框架,能够利用面部表达识别和图像操控方法的密切结合来检测面部表达方式的操控操作。加上从面部表达识别框架提取的特征地图,我们的操纵检测器能够将被操纵区域本地化。我们在Face2Face数据集中显示,在有大量表达方式操纵的地方,我们的方法在对操纵的分类和本地化方面都达到了3%以上。此外,在与口部区域相对的面部表达方式已经修改的地方化方法方面,我们提出了一个框架,显示在分类和本地化两方面都提高了2%的精准度。我们显示,在Face2Face数据集中,在有大量表达方式操纵的情况下,我们的方法在一般的身份操控情况下,能够确保表达方式的精准性。