This paper addresses the deep face recognition problem under an open-set protocol, where ideal face features are expected to have smaller maximal intra-class distance than minimal inter-class distance under a suitably chosen metric space. To this end, hyperspherical face recognition, as a promising line of research, has attracted increasing attention and gradually become a major focus in face recognition research. As one of the earliest works in hyperspherical face recognition, SphereFace explicitly proposed to learn face embeddings with large inter-class angular margin. However, SphereFace still suffers from severe training instability which limits its application in practice. In order to address this problem, we introduce a unified framework to understand large angular margin in hyperspherical face recognition. Under this framework, we extend the study of SphereFace and propose an improved variant with substantially better training stability -- SphereFace-R. Specifically, we propose two novel ways to implement the multiplicative margin, and study SphereFace-R under three different feature normalization schemes (no feature normalization, hard feature normalization and soft feature normalization). We also propose an implementation strategy -- "characteristic gradient detachment" -- to stabilize training. Extensive experiments on SphereFace-R show that it is consistently better than or competitive with state-of-the-art methods.
翻译:本文探讨开放协议下的深层面部识别问题, 理想面部特征预计将具有比在适当选择的计量空间下最短的顶尖阶级内部距离比最小的阶级间距离小。 为此,超球面识别,作为一种有希望的研究线,吸引了越来越多的关注,并逐渐成为面部识别研究的主要焦点。 作为超球面识别的最早工作之一, Sphere Face 明确提议学习与大型阶级间三角边缘相嵌的面部。 然而, 侧面面面面面貌仍然受到严重培训不稳定的影响,限制了其实际应用。 为了解决这一问题,我们引入了一个统一框架,以理解超球面面度识别的大角边边边。 在这个框架内,我们扩展了对Sphere Face 的研究, 并提出了一个改进的变式, 其培训稳定性大大提高 -- SphereFace-R。 具体地说, 我们提出了两种新的方法来实施多相复制的边距, 以及根据三种不同特征正常化计划( 没有特征正常化、硬特征正常化和软地特征正常化) 。 我们还提议了一个执行战略 -- “ 直观的实验, 以稳定的梯形式的梯层化式的梯层化方法, 以稳定地显示, 渐渐渐渐变式的梯变式的状态。