This work tackles the problem of temporally coherent face anonymization in natural video streams.We propose JaGAN, a two-stage system starting with detecting and masking out faces with black image patches in all individual frames of the video. The second stage leverages a privacy-preserving Video Generative Adversarial Network designed to inpaint the missing image patches with artificially generated faces. Our initial experiments reveal that image based generative models are not capable of inpainting patches showing temporal coherent appearance across neighboring video frames. To address this issue we introduce a newly curated video collection, which is made publicly available for the research community along with this paper. We also introduce the Identity Invariance Score IdI as a means to quantify temporal coherency between neighboring frames.
翻译:这项工作解决了自然视频流中在时间上一致的面部匿名化问题。 我们提出“ JaGAN”,这是一个两阶段系统,首先在视频的所有单个框中检测和遮盖面部和黑色图像补丁。 第二阶段利用一个隐私保护视频创生反对立网络,目的是用人工生成的面部来绘制丢失的图像补丁。 我们最初的实验显示,基于图像的基因化模型无法在相邻视频框中绘制显示时间一致性外观的补丁。 为了解决这一问题,我们引入了新制作的视频集,与本文一起向研究界公开提供。 我们还引入了“身份差异评分IdI”作为量化相邻框间时间一致性的手段。