Biometrics are one of the most privacy-sensitive data. Ubiquitous authentication systems with a focus on privacy favor decentralized approaches as they reduce potential attack vectors, both on a technical and organizational level. The gold standard is to let the user be in control of where their own data is stored, which consequently leads to a high variety of devices used. Moreover, in comparison with a centralized system, designs with higher end-user freedom often incur additional network overhead. Therefore, when using face recognition for biometric authentication, an efficient way to compare faces is important in practical deployments, because it reduces both network and hardware requirements that are essential to encourage device diversity. This paper proposes an efficient way to aggregate embeddings used for face recognition based on an extensive analysis on different datasets and the use of different aggregation strategies. As part of this analysis, a new dataset has been collected, which is available for research purposes. Our proposed method supports the construction of massively scalable, decentralized face recognition systems with a focus on both privacy and long-term usability.
翻译:生物特征是最具隐私敏感性的数据之一。专注于隐私的无处不在的认证系统倾向于采用去中心化方法,因为它们减少了潜在的攻击向量,不仅在技术层面上,而且在组织层面上。黄金标准是让用户控制自己的数据存储位置,这自然导致使用各种设备。而且,与集中式系统相比,设计更注重终端用户自由的系统通常会引起额外的网络开销。因此,在使用人脸识别进行生物识别认证时,高效比对人脸很重要,因为它会减少对网络和硬件需求的依赖,这对于鼓励设备多样性至关重要。本文提出了一种基于不同数据集和不同聚合策略分析的聚合嵌入人脸识别方法,并且采集了一个可用于研究目的的新增数据集。我们提出的方法支持构建具有隐私和长期可用性的大规模、去中心化的人脸识别系统。