Although deep face recognition has achieved impressive progress in recent years, controversy has arisen regarding discrimination based on skin tone, questioning their deployment into real-world scenarios. In this paper, we aim to systematically and scientifically study this bias from both data and algorithm aspects. First, using the dermatologist approved Fitzpatrick Skin Type classification system and Individual Typology Angle, we contribute a benchmark called Identity Shades (IDS) database, which effectively quantifies the degree of the bias with respect to skin tone in existing face recognition algorithms and commercial APIs. Further, we provide two skin-tone aware training datasets, called BUPT-Globalface dataset and BUPT-Balancedface dataset, to remove bias in training data. Finally, to mitigate the algorithmic bias, we propose a novel meta-learning algorithm, called Meta Balanced Network (MBN), which learns adaptive margins in large margin loss such that the model optimized by this loss can perform fairly across people with different skin tones. To determine the margins, our method optimizes a meta skewness loss on a clean and unbiased meta set and utilizes backward-on-backward automatic differentiation to perform a second order gradient descent step on the current margins. Extensive experiments show that MBN successfully mitigates bias and learns more balanced performance for people with different skin tones in face recognition. The proposed datasets are available at http://www.whdeng.cn/RFW/index.html.
翻译:尽管近些年来,人们深刻地认识到了这一点,但在基于肤色的歧视问题上出现了争议,因为基于皮肤的语调的歧视已经取得了令人印象深刻的进展,他们被部署到现实世界的情景中。在本文中,我们的目标是系统地和科学地研究数据和算法方面的这种偏差。首先,我们利用皮肤学家批准的菲茨帕特里克·斯金类型分类系统和个人类型字型安格,我们贡献了一个称为身份形状数据库的基准,该数据库有效地量化了现有面貌识别算法和商用API中对肤色的偏差程度。此外,我们还提供了两个有意识的皮肤板骨培训数据集,称为BUPT-Globalface数据集和BUPT-Balancedface数据集,目的是从数据和算算法两个方面来消除偏差。最后,为了减轻算法偏差,我们提出了一个新的元学习算法,称为Meta平衡网络(MMBN),该算法在大范围内学到了适应性的差差差幅,因此,通过这种损失最优化的模型可以在不同的皮肤人之间公平表现。为了确定边际的差,我们的方法,我们的方法在干净和不偏差的代代代的MEBLA上最优的基底的底的底的底的底的底的底的底的底,并显示显示显示,人们可以成功地显示。