The Weisfeiler-Lehman (WL) test is a widely used algorithm in graph machine learning, including graph kernels, graph metrics, and graph neural networks. However, it focuses only on the consistency of the graph, which means that it is unable to detect slight structural differences. Consequently, this limits its ability to capture structural information, which also limits the performance of existing models that rely on the WL test. This limitation is particularly severe for traditional metrics defined by the WL test, which cannot precisely capture slight structural differences. In this paper, we propose a novel graph metric called the Wasserstein WL Subtree (WWLS) distance to address this problem. Our approach leverages the WL subtree as structural information for node neighborhoods and defines node metrics using the $L_1$-approximated tree edit distance ($L_1$-TED) between WL subtrees of nodes. Subsequently, we combine the Wasserstein distance and the $L_1$-TED to define the WWLS distance, which can capture slight structural differences that may be difficult to detect using conventional metrics. We demonstrate that the proposed WWLS distance outperforms baselines in both metric validation and graph classification experiments.
翻译:Weisfeiler-Lehman(WL)检验是图机器学习中广泛使用的算法,包括图内核、图度量和图神经网络。然而,它仅关注图的一致性,这意味着它无法检测轻微的结构差异。因此,这限制了它捕捉结构信息的能力,也限制了依赖WL测试的现有模型的性能。这种限制对于传统的由WL测试定义的度量尤其严重,它们无法精确捕捉轻微的结构差异。在本文中,我们提出了一种新的图度量方法,称为基于Wasserstein的Weisfeiler-Lehman子树(WWLS)距离,用于解决这个问题。我们的方法利用WL子树作为节点邻域的结构信息,使用节点WL子树之间的$L_1$近似树编辑距离($L_1$-TED)定义节点度量。随后,我们结合Wasserstein距离和$L_1$-TED定义WWLS距离,可以捕捉可能难以使用常规度量检测到的轻微结构差异。我们证明了所提出的WWLS距离在度量验证和图分类实验中优于基线。