This work provides several fundamental characterizations of the optimal classification function under the demographic parity constraint. In the awareness framework, akin to the classical unconstrained classification case, we show that maximizing accuracy under this fairness constraint is equivalent to solving a corresponding regression problem followed by thresholding at level $1/2$. We extend this result to linear-fractional classification measures (e.g., ${\rm F}$-score, AM measure, balanced accuracy, etc.), highlighting the fundamental role played by the regression problem in this framework. Our results leverage recently developed connection between the demographic parity constraint and the multi-marginal optimal transport formulation. Informally, our result shows that the transition between the unconstrained problems and the fair one is achieved by replacing the conditional expectation of the label by the solution of the fair regression problem. Finally, leveraging our analysis, we demonstrate an equivalence between the awareness and the unawareness setups in the case of two sensitive groups.
翻译:这项工作为人口均等限制下的最佳分类功能提供了几种基本特征。在认识框架方面,类似于传统的不受限制的分类案例,我们表明,在这种公平限制下,最大限度地提高准确性相当于解决相应的回归问题,然后以1/2美元为门槛。我们把这一结果扩大到线性不轨分类措施(例如,$\F}-美元-分数、AM度、平衡准确性等),突出回归问题在这个框架中所起的基本作用。我们的成果最近发展了人口均等限制与多边际最佳运输配方之间的联系。非正式地说,我们的结果表明,在未受限制的问题与公平问题之间的过渡是通过公平回归问题的解决办法取代对标签的有条件期望而取得的。最后,我们利用我们的分析,显示了两个敏感群体的认识和意识的相等性。