As a technology ML is oblivious to societal good or bad, and thus, the field of fair machine learning has stepped up to propose multiple mathematical definitions, algorithms, and systems to ensure different notions of fairness in ML applications. Given the multitude of propositions, it has become imperative to formally verify the fairness metrics satisfied by different algorithms on different datasets. In this paper, we propose a \textit{stochastic satisfiability} (SSAT) framework, Justicia, that formally verifies different fairness measures of supervised learning algorithms with respect to the underlying data distribution. We instantiate Justicia on multiple classification and bias mitigation algorithms, and datasets to verify different fairness metrics, such as disparate impact, statistical parity, and equalized odds. Justicia is scalable, accurate, and operates on non-Boolean and compound sensitive attributes unlike existing distribution-based verifiers, such as FairSquare and VeriFair. Being distribution-based by design, Justicia is more robust than the verifiers, such as AIF360, that operate on specific test samples. We also theoretically bound the finite-sample error of the verified fairness measure.
翻译:由于技术ML忽视了社会上的好坏,因此,公平机器学习领域已经加快,提出了多种数学定义、算法和系统,以确保多种数学定义、算法和系统,以确保在ML应用中的不同公平概念。鉴于各种主张,现在必须正式核实不同数据集的不同算法所满足的公平度量。在本文中,我们提议了一个“科学”框架,即“正义”,正式核实在基本数据分布方面监督学习算法的不同公平度量度。我们即时地在多重分类和减少偏差算法上提出正义,数据集用于核查不同的公平度量度量,例如不同的影响、统计均等和均等的概率。“正义”是可扩展的、准确的,并且运行于非Boolean和复合的敏感属性,不同于现有的基于分布的验证器,如FairSquarre和VeriFair。根据设计进行分配,Justicia比核查者(如AIF360,在具体测试样品上运行的AIF360)更加稳健。我们从理论上也约束了计量误。