We develop a theory to measure the variance and covariance of probability distributions defined on the nodes of a graph, which takes into account the distance between nodes. Our approach generalizes the usual (co)variance to the setting of weighted graphs and retains many of its intuitive and desired properties. Interestingly, we find that a number of famous concepts in graph theory and network science can be reinterpreted in this setting as variances and covariances of particular distributions. As a particular application, we define the maximum variance problem on graphs with respect to the effective resistance distance, and characterize the solutions to this problem both numerically and theoretically. We show how the maximum variance distribution is concentrated on the boundary of the graph, and illustrate this in the case of random geometric graphs. Our theoretical results are supported by a number of experiments on a network of mathematical concepts, where we use the variance and covariance as analytical tools to study the (co-)occurrence of concepts in scientific papers with respect to the (network) relations between these concepts.
翻译:我们开发了一种理论来测量图表节点所定义的概率分布的差异和共变性,该理论考虑到节点之间的距离。 我们的方法概括了加权图表设置的通常(共同)变量,并保留了其许多直观和期望属性。 有趣的是,我们发现图形理论和网络科学中的一些著名概念可以在此环境中重新解释为特定分布的差异和共变。 作为一个特殊应用,我们定义了图表中与有效抵抗距离有关的最大差异问题,并用数字和理论来描述解决这一问题的解决方案。我们展示了最大差异分布如何集中在图形的边界上,并在随机几何图中说明了这一点。我们理论结果得到一系列数学概念网络实验的支持,我们利用这些差异和共变化作为分析工具来研究科学论文中与这些概念(网络)关系有关的概念(共同)的出现。