Deep neural networks have rapidly become the mainstream method for face recognition (FR). However, this limits the deployment of such models that contain an extremely large number of parameters to embedded and low-end devices. In this work, we present an extremely lightweight and accurate FR solution, namely PocketNet. We utilize neural architecture search to develop a new family of lightweight face-specific architectures. We additionally propose a novel training paradigm based on knowledge distillation (KD), the multi-step KD, where the knowledge is distilled from the teacher model to the student model at different stages of the training maturity. We conduct a detailed ablation study proving both, the sanity of using NAS for the specific task of FR rather than general object classification, and the benefits of our proposed multi-step KD. We present an extensive experimental evaluation and comparisons with the state-of-the-art (SOTA) compact FR models on nine different benchmarks including large-scale evaluation benchmarks such as IJB-B, IJB-C, and MegaFace. PocketNets have consistently advanced the SOTA FR performance on nine mainstream benchmarks when considering the same level of model compactness. With 0.92M parameters, our smallest network PocketNetS-128 achieved very competitive results to recent SOTA compacted models that contain up to 4M parameters.
翻译:深心神经网络迅速成为面部识别的主流方法(FR)。然而,这限制了在嵌入和低端装置中部署含有数量极多参数的模型。在这项工作中,我们提出了一个极轻而准确的FR解决方案,即PocketNet。我们利用神经结构搜索来开发一个轻量面形结构的新体系。我们还提议了一个基于知识蒸馏(KD)、多步骤KD的新培训模式,在培训成熟的不同阶段,将知识从教师模式提炼到学生模式。我们进行了详细的校外研究,证明使用NAS进行FR具体任务而不是一般目标分类是否明智,以及我们提议的多步骤KD的好处。我们提出了与最新工艺(SOTA)的缩略微FR模型进行广泛实验评估和比较的九个不同基准,包括大型评价基准,如IJB-B、IJB-C和Megaface。 PockNets在考虑SOTAFR标准达到最低标准标准时,在STOFR标准中不断提升SBR的九项竞争力基准。