Removing bias while keeping all task-relevant information is challenging for fair representation learning methods since they would yield random or degenerate representations w.r.t. labels when the sensitive attributes correlate with labels. Existing works proposed to inject the label information into the learning procedure to overcome such issues. However, the assumption that the observed labels are clean is not always met. In fact, label bias is acknowledged as the primary source inducing discrimination. In other words, the fair pre-processing methods ignore the discrimination encoded in the labels either during the learning procedure or the evaluation stage. This contradiction puts a question mark on the fairness of the learned representations. To circumvent this issue, we explore the following question: \emph{Can we learn fair representations predictable to latent ideal fair labels given only access to unreliable labels?} In this work, we propose a \textbf{D}e-\textbf{B}iased \textbf{R}epresentation Learning for \textbf{F}airness (DBRF) framework which disentangles the sensitive information from non-sensitive attributes whilst keeping the learned representations predictable to ideal fair labels rather than observed biased ones. We formulate the de-biased learning framework through information-theoretic concepts such as mutual information and information bottleneck. The core concept is that DBRF advocates not to use unreliable labels for supervision when sensitive information benefits the prediction of unreliable labels. Experiment results over both synthetic and real-world data demonstrate that DBRF effectively learns de-biased representations towards ideal labels.
翻译:在保留所有与任务相关的信息的同时,消除偏见对于公平代表性学习方法具有挑战性,因为当敏感属性与标签相关时,它们会产生随机或变质的表达方式标签。 现有的工程建议将标签信息输入学习程序, 以克服这些问题。 但是, 所观察到的标签不干净的假设并不总是得到满足。 事实上, 标签偏见被公认为引发歧视的主要来源。 换句话说, 公平的预处理方法忽略了标签在学习程序或评价阶段所编码的歧视。 这种矛盾给所学的表达方式的公平性留下一个疑问。 为了回避这一问题, 我们能否探讨以下问题 :\ emph{ 我们能否在只提供不可靠的标签的情况下, 将标签信息输入到潜在的理想公平标签中, 将公平化的表达方式纳入到隐性理想的标签中。 当我们通过不可靠的标签来了解不可靠的信息时, 使用不可靠的标签( textbffff{F}R}R} 演示者学习如何学习。