Graph similarity measurement, which computes the distance/similarity between two graphs, arises in various graph-related tasks. Recent learning-based methods lack interpretability, as they directly transform interaction information between two graphs into one hidden vector and then map it to similarity. To cope with this problem, this study proposes a more interpretable end-to-end paradigm for graph similarity learning, named Similarity Computation via Maximum Common Subgraph Inference (INFMCS). Our critical insight into INFMCS is the strong correlation between similarity score and Maximum Common Subgraph (MCS). We implicitly infer MCS to obtain the normalized MCS size, with the supervision information being only the similarity score during training. To capture more global information, we also stack some vanilla transformer encoder layers with graph convolution layers and propose a novel permutation-invariant node Positional Encoding. The entire model is quite simple yet effective. Comprehensive experiments demonstrate that INFMCS consistently outperforms state-of-the-art baselines for graph-graph classification and regression tasks. Ablation experiments verify the effectiveness of the proposed computation paradigm and other components. Also, visualization and statistics of results reveal the interpretability of INFMCS.
翻译:计算两个图形之间的距离/相似度的图形相似度测量,产生于各种与图形有关的任务。最近的学习方法缺乏解释性,因为它们直接将两个图形之间的相互作用信息转换成一个隐藏的矢量,然后将其映射为相似性。为解决这一问题,本研究提出一个更可解释的图相似度学习端到端模式,名为“通过最大共同子值推断的相似度计算”(INFMCS),我们对于国际红外线和红外线中心的重要洞察力是相似度分和最大共同子集(MCS)之间的紧密关联。我们隐含地将MCS推算以获得正常的MCS尺寸,而监督信息只是培训期间的相似度分数。为了获取更多的全球信息,我们还将一些香草变压变压器编码器编码器的层堆积成图变相层,并提议一种新颖的变异性内置位置定位编码。整个模型非常简单,但有效。全面实验表明,国际红外线和红外科系统在图形化分类和回归任务方面始终超越了最新基线。对比实验核查了拟议FMIS统计模型和其他组成部分的有效性。