Biometric data contains distinctive human traits such as facial features or gait patterns. The use of biometric data permits an individuation so exact that the data is utilized effectively in identification and authentication systems. But for this same reason, privacy protections become indispensably necessary. Privacy protection is extensively afforded by the technique of anonymization. Anonymization techniques obfuscate or remove the sensitive personal data to achieve high levels of anonymity. However, the effectiveness of anonymization relies, in equal parts, on the effectiveness of the methods employed to evaluate anonymization performance. In this paper, we assess the state-of-the-art methods used to evaluate the performance of anonymization techniques for facial images and gait patterns. We demonstrate that the state-of-the-art evaluation methods have serious and frequent shortcomings. In particular, we find that the underlying assumptions of the state-of-the-art are quite unwarranted. When a method evaluating the performance of anonymization assumes a weak adversary or a weak recognition scenario, then the resulting evaluation will very likely be a gross overestimation of the anonymization performance. Therefore, we propose a stronger adversary model which is alert to the recognition scenario as well as to the anonymization scenario. Our adversary model implements an appropriate measure of anonymization performance. We improve the selection process for the evaluation dataset, and we reduce the numbers of identities contained in the dataset while ensuring that these identities remain easily distinguishable from one another. Our novel evaluation methodology surpasses the state-of-the-art because we measure worst-case performance and so deliver a highly reliable evaluation of biometric anonymization techniques.
翻译:生物计量数据包含人类的特殊特征,例如面部特征或步态模式。生物计量数据的使用使得个人的识别和认证系统精准达到了前所未有的程度。但是由于同样的原因,隐私保护变得至关重要。匿名化技术广泛用于隐私保护。匿名化技术以模糊或删除敏感个人数据以实现高度的匿名化。然而,匿名化的有效性同样取决于用于评估匿名化性能的方法的有效性。在本文中,我们评估了用于面部图像和步态模式匿名化技术性能评估的现有方法。我们证明了现有技术性能评估方法经常存在严重缺陷。特别地,我们发现现有技术的潜在假设很不足取。当用于匿名化性能评估的方法假设虚弱的攻击方或虚弱的识别场景时,所得到的评估很可能是匿名化性能的显著过高估计。因此,我们提出了一个更加强大的攻击方模型,对识别场景以及匿名化场景都要警觉。我们的攻击方模型实施了一种适当的匿名化性能度量。我们改进了评估数据集的选择过程,同时减少了数据集中包含的身份数量,并确保这些身份之间仍然容易区分。我们的新颖评估方法超越了现有技术,因为我们测量了最坏情况下的性能,从而提供了高度可靠的生物计量匿名化技术评估。