Network inference is the process of learning the properties of complex networks from data. Besides using information about known links in the network, node attributes and other forms of network metadata can help to solve network inference problems. Indeed, several approaches have been proposed to introduce metadata into probabilistic network models and to use them to make better inferences. However, we know little about the effect of such metadata in the inference process. Here, we investigate this issue. We find that, rather than affecting inference gradually, adding metadata causes abrupt transitions in the inference process and in our ability to make accurate predictions, from a situation in which metadata does not play any role to a situation in which metadata completely dominates the inference process. When network data and metadata are partly correlated, metadata optimally contributes to the inference process at the transition between data-dominated and metadata-dominated regimes.
翻译:网络推论是从数据中了解复杂网络特性的过程,除了使用网络已知链接的信息外,节点属性和其他形式的网络元数据还有助于解决网络推论问题,事实上,已提出若干办法将元数据引入概率网络模型,并利用这些模型进行更好的推论。然而,我们对这种元数据在推论过程中的影响知之甚少。在这里,我们调查这一问题。我们发现,增加元数据不是逐渐影响推论,而是在推论过程和我们作出准确预测的能力方面造成突变,从元数据不发挥任何作用的情况到元数据完全主宰推论过程的情况。当网络数据和元数据部分相互关联时,元数据对数据主导和元数据主导制度过渡过程中的推论进程作出了最佳贡献。