Graph representation learning has become a ubiquitous component in many scenarios, ranging from social network analysis to energy forecasting in smart grids. In several applications, ensuring the fairness of the node (or graph) representations with respect to some protected attributes is crucial for their correct deployment. Yet, fairness in graph deep learning remains under-explored, with few solutions available. In particular, the tendency of similar nodes to cluster on several real-world graphs (i.e., homophily) can dramatically worsen the fairness of these procedures. In this paper, we propose a novel biased edge dropout algorithm (FairDrop) to counter-act homophily and improve fairness in graph representation learning. FairDrop can be plugged in easily on many existing algorithms, is efficient, adaptable, and can be combined with other fairness-inducing solutions. After describing the general algorithm, we demonstrate its application on two benchmark tasks, specifically, as a random walk model for producing node embeddings, and to a graph convolutional network for link prediction. We prove that the proposed algorithm can successfully improve the fairness of all models up to a small or negligible drop in accuracy, and compares favourably with existing state-of-the-art solutions. In an ablation study, we demonstrate that our algorithm can flexibly interpolate between biasing towards fairness and an unbiased edge dropout. Furthermore, to better evaluate the gains, we propose a new dyadic group definition to measure the bias of a link prediction task when paired with group-based fairness metrics. In particular, we extend the metric used to measure the bias in the node embeddings to take into account the graph structure.
翻译:从社会网络分析到智能电网的能源预测,许多情景中,图形代表学习已成为一个无处不在的组成部分,从社会网络分析到智能电网的能源预测。 在一些应用中,确保节点(或图形)表达对于某些受保护属性的公平性对于正确部署至关重要。 然而,图形深层学习的公平性仍然未得到充分探讨,而且几乎没有什么解决办法。特别是,类似节点对多个现实世界图表(即同质)的分组的倾向会大大恶化这些程序的公平性。在本文中,我们提议一种新的偏差倾斜退出算法(FairDrop)来对抗同一行为,提高图形代表学习的公平性。 FairDrop可以很容易地插入许多现有的算法,效率高、适应性,并且可以与其他公平性激励性解决方案相结合。在描述一般算法之后,我们将其应用于两个基准任务,特别是一个随机行走模式,用来生成基于节点的嵌嵌嵌式,以及一个图形直径网络的预测。我们证明拟议的算能够成功地改善所有模型的公正性, 向一个小或可忽略的准确性比例的准确性比例,我们用来显示一个特定的计算。