Although machine learning on hypergraphs has attracted considerable attention, most of the works have focused on (semi-)supervised learning, which may cause heavy labeling costs and poor generalization. Recently, contrastive learning has emerged as a successful unsupervised representation learning method. Despite the prosperous development of contrastive learning in other domains, contrastive learning on hypergraphs remains little explored. In this paper, we propose TriCL (Tri-directional Contrastive Learning), a general framework for contrastive learning on hypergraphs. Its main idea is tri-directional contrast, and specifically, it aims to maximize in two augmented views the agreement (a) between the same node, (b) between the same group of nodes, and (c) between each group and its members. Together with simple but surprisingly effective data augmentation and negative sampling schemes, these three forms of contrast enable TriCL to capture both microscopic and mesoscopic structural information in node embeddings. Our extensive experiments using 13 baseline approaches, five datasets, and two tasks demonstrate the effectiveness of TriCL, and most noticeably, TriCL consistently outperforms not just unsupervised competitors but also (semi-)supervised competitors mostly by significant margins for node classification. The code and datasets are available at https://github.com/wooner49/TriCL.
翻译:虽然高原上的机器学习吸引了相当多的注意力,但大多数作品都集中在(半)监督的学习上,这可能造成很高的标签成本和不良的概括化。最近,对比性学习作为一种成功的无监督的代表性学习方法出现。尽管其他领域的对比性学习发展得相当顺利,但高原上的对比性学习仍然很少探索。在本文中,我们提议TriCL(双向反向学习),这是高原上对比性学习的一般框架。其主要想法是三向对比,具体地说,它的目的是在两种扩大的观点中尽量扩大(a)同一节点之间的协议;(b)同一节点之间的协议;(b)同一节点之间的协议;(c)每个组及其成员之间的协议。与简单但令人惊讶的有效数据扩增和负面抽样计划一起,这三种对比形式使得TriCLL能够捕捉微型和中位结构信息。我们利用13个基线方法进行的广泛实验,5个数据集,以及两项任务显示TriCLLV/MLF的效益,而且多数是远端/跨级竞争者。