We propose an information-theoretic bias measurement technique through a causal interpretation of spurious correlation, which is effective to identify the feature-level algorithmic bias by taking advantage of conditional mutual information. Although several bias measurement methods have been proposed and widely investigated to achieve algorithmic fairness in various tasks such as face recognition, their accuracy- or logit-based metrics are susceptible to leading to trivial prediction score adjustment rather than fundamental bias reduction. Hence, we design a novel debiasing framework against the algorithmic bias, which incorporates a bias regularization loss derived by the proposed information-theoretic bias measurement approach. In addition, we present a simple yet effective unsupervised debiasing technique based on stochastic label noise, which does not require the explicit supervision of bias information. The proposed bias measurement and debiasing approaches are validated in diverse realistic scenarios through extensive experiments on multiple standard benchmarks.
翻译:我们建议采用信息理论偏差计量方法,通过对虚假相关关系进行因果解释,从而有效地利用有条件的相互信息确定地貌水平算法偏差。虽然已经提出并广泛调查了几种偏差计量方法,以便在诸如面部识别等各种任务中实现算法公平性,但其精确度或日志基指标很容易导致微小的预测得分调整,而不是根本性的偏差减少。因此,我们设计了一个与算法偏差相对的新式的偏差框架,其中包括拟议的信息理论偏差计量方法所产生的偏差正规化损失。此外,我们提出了一种简单而有效的、不受监督的偏差分析技术,基于随机标签的噪音,不需要对偏差信息进行明确的监督。提议的偏差计量和偏差方法通过对多种标准基准的广泛实验,在不同现实情景中得到验证。