With the growing use of ML in highly consequential domains, quantifying disparity with respect to protected attributes, e.g., gender, race, etc., is important. While quantifying disparity is essential, sometimes the needs of an occupation may require the use of certain features that are critical in a way that any disparity that can be explained by them might need to be exempted. E.g., in hiring a software engineer for a safety-critical application, coding-skills may be weighed strongly, whereas name, zip code, or reference letters may be used only to the extent that they do not add disparity. In this work, we propose an information-theoretic decomposition of the total disparity (a quantification inspired from counterfactual fairness) into two components: a non-exempt component which quantifies the part that cannot be accounted for by the critical features, and an exempt component that quantifies the remaining disparity. This decomposition allows one to check if the disparity arose purely due to the critical features (inspired from the business necessity defense of disparate impact law) and also enables selective removal of the non-exempt component if desired. We arrive at this decomposition through canonical examples that lead to a set of desirable properties (axioms) that a measure of non-exempt disparity should satisfy. Our proposed measure satisfies all of them. Our quantification bridges ideas of causality, Simpson's paradox, and a body of work from information theory called Partial Information Decomposition. We also obtain an impossibility result showing that no observational measure can satisfy all the desirable properties, leading us to relax our goals and examine observational measures that satisfy only some of them. We perform case studies to show how one can audit/train models while reducing non-exempt disparity.
翻译:由于在高度间接的领域中越来越多地使用ML,在受保护的属性(如性别、种族等)方面,量化差异很重要。虽然量化差异很重要,但有时职业的需要可能需要使用某些关键特征,其方式可能是任何可以解释的差异都需要豁免。例如,在为安全关键应用而雇用软件工程师时,编码技能可能会得到严格的权衡,而名称、拉链代码或参考字母可能只用于它们不会增加差异的程度。在这项工作中,我们提议将完全差异(来自反事实公平)的信息理论分解成两个部分:一个非豁免部分,其方式可能要求使用这些差异来解释任何差异。例如,在为安全关键应用而雇用一名软件工程师时,编码技能可能会得到严格的权衡,而名称、拉链代码或参考字母可能仅用于它们不会增加差异的程度。在这项工作中,我们提议将完全的不豁免部分分解成非统计(源于反事实公正的量化),我们在进行这一理论性评估时,将得出一个衡量结果,从而显示我们目前所有差异的统计结果。我们从一个衡量标准到一个衡量结果,从而可以确定一个衡量结果。