Recently, the popularity and wide use of the last-generation video conferencing technologies created an exponential growth in its market size. Such technology allows participants in different geographic regions to have a virtual face-to-face meeting. Additionally, it enables users to employ a virtual background to conceal their own environment due to privacy concerns or to reduce distractions, particularly in professional settings. Nevertheless, in scenarios where the users should not hide their actual locations, they may mislead other participants by claiming their virtual background as a real one. Therefore, it is crucial to develop tools and strategies to detect the authenticity of the considered virtual background. In this paper, we present a detection strategy to distinguish between real and virtual video conferencing user backgrounds. We demonstrate that our detector is robust against two attack scenarios. The first scenario considers the case where the detector is unaware about the attacks and inn the second scenario, we make the detector aware of the adversarial attacks, which we refer to Adversarial Multimedia Forensics (i.e, the forensically-edited frames are included in the training set). Given the lack of publicly available dataset of virtual and real backgrounds for video conferencing, we created our own dataset and made them publicly available [1]. Then, we demonstrate the robustness of our detector against different adversarial attacks that the adversary considers. Ultimately, our detector's performance is significant against the CRSPAM1372 [2] features, and post-processing operations such as geometric transformations with different quality factors that the attacker may choose. Moreover, our performance results shows that we can perfectly identify a real from a virtual background with an accuracy of 99.80%.
翻译:最近,上一代电视会议技术的受欢迎程度和广泛使用上一代电视会议技术导致其市场规模的虚拟背景急剧增长。这种技术使不同地理区域的参与者能够举行虚拟面对面的会议。此外,它使用户能够利用虚拟背景来掩盖自己的环境,因为隐私问题或减少分心,特别是在专业环境中。然而,在用户不应隐藏其实际位置的情况下,它们可能误导其他参与者,声称其虚拟背景是真实的。因此,至关重要的是,开发工具和战略来查明所考虑的虚拟背景的真实性。在本文中,我们提出了一个探测战略,以区分真实和虚拟电视会议用户的背景。我们证明我们的探测器对两种攻击情景是强大的。第一种情景是,检测者对袭击并不知情,而在第二种情景中,我们使探测器意识到对抗攻击的对抗性攻击,我们指Adversarial多媒介法医(即法医框架包含在培训组合中)。鉴于我们缺乏可公开获取的虚拟和真实背景来区分真实和虚拟的视频会议用户背景。我们证明我们的探测器对真实性能进行强力的测试,我们创建了我们自己的最终性评估结果,而我们用真实性评估了我们自己的性能评估这些结果,我们用真实性能来展示了我们自己的性能,我们自己的对结果,我们用真实性评估来展示了我们自己的反反反向的对结果。