重新思考强力代表制学习在精美的吵闹脸孔下 (Rethinking Robust Representation Learning Under Fine-grained Noisy Faces)

Learning robust feature representation from large-scale noisy faces stands out as one of the key challenges in high-performance face recognition. Recent attempts have been made to cope with this challenge by alleviating the intra-class conflict and inter-class conflict. However, the unconstrained noise type in each conflict still makes it difficult for these algorithms to perform well. To better understand this, we reformulate the noise type of each class in a more fine-grained manner as N-identities|K^C-clusters. Different types of noisy faces can be generated by adjusting the values of \nkc. Based on this unified formulation, we found that the main barrier behind the noise-robust representation learning is the flexibility of the algorithm under different N, K, and C. For this potential problem, we propose a new method, named Evolving Sub-centers Learning~(ESL), to find optimal hyperplanes to accurately describe the latent space of massive noisy faces. More specifically, we initialize M sub-centers for each class and ESL encourages it to be automatically aligned to N-identities|K^C-clusters faces via producing, merging, and dropping operations. Images belonging to the same identity in noisy faces can effectively converge to the same sub-center and samples with different identities will be pushed away. We inspect its effectiveness with an elaborate ablation study on the synthetic noisy dataset with different N, K, and C. Without any bells and whistles, ESL can achieve significant performance gains over state-of-the-art methods on large-scale noisy faces

翻译：大规模吵闹的面孔具有很强的学习特点,这是高性能所面临的关键挑战之一。最近有人试图通过缓解阶级内部冲突和阶级间冲突来应对这一挑战。然而,每次冲突中不受限制的噪音类型仍然使得这些算法难以很好地发挥作用。为了更好地理解这一点,我们将每个阶级的噪音类型以更细微的方式重新配制成N身份=KQC-集群。通过调整每个阶级的面孔,可以产生不同种类的吵闹面孔。基于这一统一公式,我们发现,噪音-褐色代表性学习背后的主要障碍是不同N、K和C类内部的算法的灵活性。对于这一潜在问题,我们提出了一种名为“流动子中心学习~(ESL)”的新方法,以最精细的方式重新配置每个阶级的噪音类型,以精确地描述大张扬声的面孔。更具体地说,我们为每个阶级的M分点和ESL鼓励它自动与N身份-KQQ-L的面孔相匹配。我们发现,噪音-B类代表的主要障碍是不同的算法,通过制作、合并和不断升级的图像,我们可以有效地推进一个不同的图像。