Graph isomorphism is a problem for which there is no known polynomial-time solution. Nevertheless, assessing (dis)similarity between two or more networks is a key task in many areas, such as image recognition, biology, chemistry, computer and social networks. Moreover, questions of similarity are typically more general and their answers more widely applicable than the more restrictive isomorphism question. In this article, we offer a statistical answer to the following questions: a) {\it ``Are networks $G_1$ and $G_2$ similar?''}, b) {\it ``How different are the networks $G_1$ and $G_2$?''} and c) {\it ``Is $G_3$ more similar to $G_1$ or $G_2$?''}. Our comparisons begin with the transformation of each graph into an all-pairs distance matrix. Our node-node distance, Jaccard distance, has been shown to offer a good reflection of the graph's connectivity structure. We then model these distances as probability distributions. Finally, we use well-established statistical tools to gauge the (dis)similarities in terms of probability distribution (dis)similarity. This comparison procedure aims to detect (dis)similarities in connectivity structure, not in easily observable graph characteristics, such as degrees, edge counts or density. We validate our hypothesis that graphs can be meaningfully summarized and compared via their node-node distance distributions, using several synthetic and real-world graphs. Empirical results demonstrate its validity and the accuracy of our comparison technique.
翻译:然而,评估两个或两个以上网络之间的差异(不同)是许多领域的关键任务,例如图像识别、生物学、化学、计算机和社会网络。此外,相似性的问题通常比较一般,其答案比限制性程度较高的异形问题更为广泛适用。在本篇文章中,我们提供了对下列问题的统计答案:a) {(是) 网络$_1美元和易相近的2美元?'},b) {(不同) 两个或两个以上网络之间的差异是许多领域的关键任务,例如图像识别、生物学、化学、计算机和社会网络。此外,相似性的问题通常更为一般,其答案比限制性程度更广泛。我们进行比较的起点是从每张图表转换成全色距离矩阵开始的。我们的结点距离,Jacard距离可以很好地反映图表的连通性结构。我们然后将这些距离作为概率分布的准确性比较,我们用这些精确度的直径比值来测量其直径的直径比值,我们用一些直径的直径直径比值来测量其直径的直径比值,我们用这些直径直径直方的直径比,我们用一些的统计工具来测量的直径比。