A powerful framework for studying graphs is to consider them as geometric graphs: nodes are randomly sampled from an underlying metric space, and any pair of nodes is connected if their distance is less than a specified neighborhood radius. Currently, the literature mostly focuses on uniform sampling and constant neighborhood radius. However, real-world graphs are likely to be better represented by a model in which the sampling density and the neighborhood radius can both vary over the latent space. For instance, in a social network communities can be modeled as densely sampled areas, and hubs as nodes with larger neighborhood radius. In this work, we first perform a rigorous mathematical analysis of this (more general) class of models, including derivations of the resulting graph shift operators. The key insight is that graph shift operators should be corrected in order to avoid potential distortions introduced by the non-uniform sampling. Then, we develop methods to estimate the unknown sampling density in a self-supervised fashion. Finally, we present exemplary applications in which the learnt density is used to 1) correct the graph shift operator and improve performance on a variety of tasks, 2) improve pooling, and 3) extract knowledge from networks. Our experimental findings support our theory and provide strong evidence for our model.
翻译:用于研究图表的强大框架是将其视为几何图形:节点是从一个基础的计量空间随机抽样,如果距离低于特定的邻里半径,则任何对节点都可以连接。目前,文献大多侧重于统一的抽样和常态邻里半径。然而,真实世界图可能由取样密度和邻里半径两者在潜伏空间上都不同的模型来更好地代表。例如,在社会网络社区中,可将节点建为密集抽样区,中心建为大邻里半径的节点。在这项工作中,我们首先对这一(更一般的)类模型进行严格的数学分析,包括由此产生的图形转换操作器的衍生。关键洞察力是,图形转换操作器应当加以纠正,以避免非统一取样所引入的潜在扭曲。然后,我们制定方法,以自我监督的方式估计未知的采样密度。最后,我们提出了示范性应用方法,其中学习的密度被用于(1) 校正图形转换操作器,并改进各种任务的性,(2) 改进组合,(3) 从网络中提取知识。我们的实验性发现为我们提供了强有力的理论和证据。