Concept relatedness estimation (CRE) aims to determine whether two given concepts are related. Existing methods only consider the pairwise relationship between concepts, while overlooking the higher-order relationship that could be encoded in a concept-level graph structure. We discover that this underlying graph satisfies a set of intrinsic properties of CRE, including reflexivity, commutativity, and transitivity. In this paper, we formalize the CRE properties and introduce a graph structure named ConcreteGraph. To address the data scarcity issue in CRE, we introduce a novel data augmentation approach to sample new concept pairs from the graph. As it is intractable for data augmentation to fully capture the structural information of the ConcreteGraph due to a large amount of potential concept pairs, we further introduce a novel Graph Component Contrastive Learning framework to implicitly learn the complete structure of the ConcreteGraph. Empirical results on three datasets show significant improvement over the state-of-the-art model. Detailed ablation studies demonstrate that our proposed approach can effectively capture the high-order relationship among concepts.
翻译:概念相关估计(CRE)旨在确定两个特定概念是否相关。现有方法只考虑概念之间的对称关系,而忽略概念级图表结构中可以编码的较高顺序关系。我们发现,这一基本图表满足了CRE的一套内在特性,包括反应性、中枢性和中转性。在本文中,我们正式确定了CRE属性,并引入了一个名为Cemple Graph的图形结构结构。为了解决CRE中的数据稀缺问题,我们采用了一种新颖的数据增强方法从图表中抽取新概念配对。由于大量潜在概念配对,数据增强难以完全捕捉到CeuteGraph的结构信息。我们进一步引入了一个新的图形组合差异学习框架,以隐含地学习CeuteGraph的完整结构。三个数据集的实证结果显示,比最新模型有了显著改进。详细的关系研究表明,我们提出的方法能够有效地捕捉到概念之间的高阶关系。