Despite the success of large-scale empirical risk minimization (ERM) at achieving high accuracy across a variety of machine learning tasks, fair ERM is hindered by the incompatibility of fairness constraints with stochastic optimization. We consider the problem of fair classification with discrete sensitive attributes and potentially large models and data sets, requiring stochastic solvers. Existing in-processing fairness algorithms are either impractical in the large-scale setting because they require large batches of data at each iteration or they are not guaranteed to converge. In this paper, we develop the first stochastic in-processing fairness algorithm with guaranteed convergence. For demographic parity, equalized odds, and equal opportunity notions of fairness, we provide slight variations of our algorithm--called FERMI--and prove that each of these variations converges in stochastic optimization with any batch size. Empirically, we show that FERMI is amenable to stochastic solvers with multiple (non-binary) sensitive attributes and non-binary targets, performing well even with minibatch size as small as one. Extensive experiments show that FERMI achieves the most favorable tradeoffs between fairness violation and test accuracy across all tested setups compared with state-of-the-art baselines for demographic parity, equalized odds, equal opportunity. These benefits are especially significant with small batch sizes and for non-binary classification with large number of sensitive attributes, making FERMI a practical, scalable fairness algorithm. The code for all of the experiments in this paper is available at: https://github.com/optimization-for-data-driven-science/FERMI.
翻译:尽管大规模的经验风险最小化(ERM)成功地在各种机器学习任务中实现了高精度,但公平的机构风险管理却因公平限制与随机优化不相容而受到阻碍。我们考虑到以离散敏感属性和潜在大模型和数据集进行公平分类的问题,需要随机求解者。在大规模环境下,处理中的公平算法要么是不切实际的,因为每次循环都需要大量数据,或者没有保证它们会趋同。在本文件中,我们开发了第一个具有保证趋同性的处理公平性算法,而有保证的趋同性。关于人口均等、均等概率和机会均等的公平概念,我们提供了我们所谓的FERMI的算法的微变异性,并证明每一种变异性都与任何批量的随机优化相趋近。我们随机地显示,FERMI的分类法则适合多(非二元)敏感属性和非二元目标的随机求解解解解解,即使最小分数也小为一。广泛的实验显示,FERMI的算法性-最易变易易变的算法性数据在规模上比性交易的精准性,比重的基价测试。在公平度测试中,这些公平性交易的基值是公平性测算的大小。