As the use of deep learning in high impact domains becomes ubiquitous, it is increasingly important to assess the resilience of models. One such high impact domain is that of face recognition, with real world applications involving images affected by various degradations, such as motion blur or high exposure. Moreover, images captured across different attributes, such as gender and race, can also challenge the robustness of a face recognition algorithm. While traditional summary statistics suggest that the aggregate performance of face recognition models has continued to improve, these metrics do not directly measure the robustness or fairness of the models. Visual Psychophysics Sensitivity Analysis (VPSA) [1] provides a way to pinpoint the individual causes of failure by way of introducing incremental perturbations in the data. However, perturbations may affect subgroups differently. In this paper, we propose a new fairness evaluation based on robustness in the form of a generic framework that extends VPSA. With this framework, we can analyze the ability of a model to perform fairly for different subgroups of a population affected by perturbations, and pinpoint the exact failure modes for a subgroup by measuring targeted robustness. With the increasing focus on the fairness of models, we use face recognition as an example application of our framework and propose to compactly visualize the fairness analysis of a model via AUC matrices. We analyze the performance of common face recognition models and empirically show that certain subgroups are at a disadvantage when images are perturbed, thereby uncovering trends that were not visible using the model's performance on subgroups without perturbations.
翻译:随着在高影响领域的深层次学习的使用变得无处不在,越来越有必要评估模型的弹性。这种高影响领域之一是面部识别,真实世界应用中涉及受各种退化影响的图像,例如运动模糊或高暴露度;此外,从性别和种族等不同属性拍摄的图像,也可能挑战面部识别算法的稳健性。传统汇总统计数据表明,面部识别模型的总体性能在继续改善,但这些指标并不直接衡量模型的稳健性或公平性。视觉心理物理学感应分析(VPSA)[1]提供了一种方法,通过在数据中引入递增扰动或高暴露度等受各种退化影响的图像,来确定单个失败趋势的原因。此外,在本文中,通过扩展 VPSA的通用框架的稳健性形式,我们提出一个新的公平性评价。在这个框架内,我们可以分析模型对受扰动影响人口不同分组的面部位进行公平性表现的能力,并且通过在不使用目标直观的图像分析时,用精确性模型来精确的失败模式来测量失败的模型。我们用一个透视性分析模型来显示,通过直观性分析,我们以直观的直观的直观分析,我们通过直观的直观分析,我们以显示的直观分析显示的直观的基图。