Graph embedding techniques are pivotal in real-world machine learning tasks that operate on graph-structured data, such as social recommendation and protein structure modeling. Embeddings are mostly performed on the node level for learning representations of each node. Since the formation of a graph is inevitably affected by certain sensitive node attributes, the node embeddings can inherit such sensitive information and introduce undesirable biases in downstream tasks. Most existing works impose ad-hoc constraints on the node embeddings to restrict their distributions for unbiasedness/fairness, which however compromise the utility of the resulting embeddings. In this paper, we propose a principled new way for unbiased graph embedding by learning node embeddings from an underlying bias-free graph, which is not influenced by sensitive node attributes. Motivated by this new perspective, we propose two complementary methods for uncovering such an underlying graph, with the goal of introducing minimum impact on the utility of the embeddings. Both our theoretical justification and extensive experimental comparisons against state-of-the-art solutions demonstrate the effectiveness of our proposed methods.
翻译:图形嵌入技术在以图表结构数据(如社会建议和蛋白结构建模)操作的实际机器学习任务中至关重要。嵌入技术大多在每个节点的节点层次上进行,以学习每个节点的演示。由于图表的形成不可避免地受到某些敏感节点属性的影响,结点嵌入技术可以继承这种敏感信息,并在下游任务中引入不可取的偏见。大多数现有作品都对节点嵌入施加了特别的制约,以限制其分配,从而实现公正性/公平性,这无论如何损害由此形成的嵌入的效用。在本文件中,我们提出了一种原则性的新方式,即通过学习一个基本的无偏心嵌入图(不受敏感节点属性的影响)来嵌入无偏心图形。受这种新视角的驱动,我们提出了两种补充方法来发现这种深点图,目的是对嵌入的效用产生最小影响。我们的理论理由和对最新解决方案的广泛实验性比较都表明了我们拟议方法的有效性。