Representation learning on user-item graph for recommendation has evolved from using single ID or interaction history to exploiting higher-order neighbors. This leads to the success of graph convolution networks (GCNs) for recommendation such as PinSage and LightGCN. Despite effectiveness, we argue that they suffer from two limitations: (1) high-degree nodes exert larger impact on the representation learning, deteriorating the recommendations of low-degree (long-tail) items; and (2) representations are vulnerable to noisy interactions, as the neighborhood aggregation scheme further enlarges the impact of observed edges. In this work, we explore self-supervised learning on user-item graph, so as to improve the accuracy and robustness of GCNs for recommendation. The idea is to supplement the classical supervised task of recommendation with an auxiliary self-supervised task, which reinforces node representation learning via self-discrimination. Specifically, we generate multiple views of a node, maximizing the agreement between different views of the same node compared to that of other nodes. We devise three operators to generate the views -- node dropout, edge dropout, and random walk -- that change the graph structure in different manners. We term this new learning paradigm as \textit{Self-supervised Graph Learning} (SGL), implementing it on the state-of-the-art model LightGCN. Through theoretical analyses, we find that SGL has the ability of automatically mining hard negatives. Empirical studies on three benchmark datasets demonstrate the effectiveness of SGL, which improves the recommendation accuracy, especially on long-tail items, and the robustness against interaction noises. Our implementations are available at \url{https://github.com/wujcan/SGL}.
翻译:在用于建议的用户项目图上进行代表学习,从使用单一ID或互动历史演变为利用高端邻居,从使用单一ID或互动历史演变为利用高端邻居。这导致图变网(GCN)成功获得PinSage和LightGCN等建议。尽管效果有效,但我们认为,它们受到两个限制:(1) 高度节点对代表学习产生更大影响,低度(长尾)项目的建议恶化;(2) 演示容易受到吵闹的相互作用,因为邻里汇总计划进一步扩大了所观察到的边缘的影响。在这项工作中,我们探索用户项目图上的自我监督学习,以提高GCN建议的准确性和稳健性。这个想法是用辅助性自我监督任务来补充受监督的经典建议任务,通过自我歧视来强化无偏重代表学习;我们对节点产生多种观点,使同一节点的不同观点与其他节点的不同观点之间取得最大程度的一致。我们设计了三个操作者来生成观点 -- 无偏差、边缘步行,以及随机行走 -- 以便提高GCN建议的准确性,从而以不同的方式改变SG的图表结构结构结构结构结构结构分析。我们学习如何进行。