Without deploying face anti-spoofing countermeasures, face recognition systems can be spoofed by presenting a printed photo, a video, or a silicon mask of a genuine user. Thus, face presentation attack detection (PAD) plays a vital role in providing secure facial access to digital devices. Most existing video-based PAD countermeasures lack the ability to cope with long-range temporal variations in videos. Moreover, the key-frame sampling prior to the feature extraction step has not been widely studied in the face anti-spoofing domain. To mitigate these issues, this paper provides a data sampling approach by proposing a video processing scheme that models the long-range temporal variations based on Gaussian Weighting Function. Specifically, the proposed scheme encodes the consecutive t frames of video sequences into a single RGB image based on a Gaussian-weighted summation of the t frames. Using simply the data sampling scheme alone, we demonstrate that state-of-the-art performance can be achieved without any bells and whistles in both intra-database and inter-database testing scenarios for the three public benchmark datasets; namely, Replay-Attack, MSU-MFSD, and CASIA-FASD. In particular, the proposed scheme provides a much lower error (from 15.2% to 6.7% on CASIA-FASD and 5.9% to 4.9% on Replay-Attack) compared to baselines in cross-database scenarios.
翻译:在不部署面部防伪反措施的情况下,面部识别系统可以通过展示印刷照片、视频或真实用户的硅面罩来掩盖。 因此, 面部显示攻击检测( PAD) 在提供数字设备的安全面部访问方面发挥着至关重要的作用。 大部分现有的基于视频的 PAD 应对措施缺乏应对视频远程时间变异的能力。 此外, 地段提取步骤之前的关键框架抽样在面对反污射领域没有进行广泛研究。 为缓解这些问题,本文提供了一个数据抽样处理方法,提出了一个视频处理方案,以基于高西亚 Weighting 功能的远程时间变异为模型。 具体地说, 拟议的方案将连续的视频序列的t框架编码成单一的 RGB 图像, 其依据是高斯加权的对图框的对比。 仅仅使用数据取样方案,我们证明, 可以在内部数据库和数据库间数据库测试情景时, 可以实现最新业绩,而不能在三个公共基准数据- IMFA(即, 将特定比例为 ) 和 IMFA 的低级数据分析模型中, 提供快速的 RBA- sB- sDA- sal- sal- sDDA 和 。