Network inference is the process of deciding what is the true unknown graph underlying a set of interactions between nodes. There is a vast literature on the subject, but most known methods have an important drawback: the inferred graph is not guaranteed to explain every interaction from the input trace. We consider this an important issue since such inferred graph cannot be used as input for applications that require a reliable estimate of the true graph. On the other hand, a graph having trace feasibility guarantees can help us better understand the true (hidden) interactions that may have taken place between nodes of interest. The inference of such graph is the goal of this paper. Firstly, given an activity log from a social network, we introduce a set of constraints that take into consideration all the hidden paths that are possible between the nodes of the trace, given their timestamps of interaction. Then, we develop a nontrivial modification of the Expectation-Maximization algorithm by Newman [1], that we call Constrained-EM, which incorporates the constraints and a set of auxiliary variables into the inference process to guide it towards the feasibility of the trace. Experimental results on real-world data from Twitter confirm that Constrained-EM generates a posterior distribution of graphs that explains all the events observed in the trace while presenting the desired properties of a scale-free, small-world graph. Our method also outperforms established methods in terms of feasibility and quality of the inferred graph.
翻译:网络推论是决定什么是真正未知的图形的过程,这是节点之间一系列相互作用的基础。 关于这个主题, 有大量文献, 但最已知的方法有一个重要的缺点: 推断的图表不能保证解释输入跟踪的每一种相互作用。 我们认为这是一个重要问题, 因为这种推断的图表不能用作需要可靠估计真实图的应用程序的输入。 另一方面, 具有追踪可行性保证的图表可以帮助我们更好地了解在节点之间可能发生的真实( 隐藏的) 相互作用。 这种图表的推论是本文的目的。 首先, 鉴于一个社交网络的活动记录, 我们引入了一系列制约, 它将考虑到跟踪节点之间可能存在的所有隐藏路径, 因为它们是互动的时标。 然后, 我们开发了新曼[ 的期待- 最大化算法的非重大修改, 我们称之为 Constracert- EEM, 将制约和一系列辅助变量纳入推论进程, 以指导它实现跟踪。 首先, 我们的图表质量记录, 实验性结果 将真实的图表形式 展示了我们所观察到的图表的图表形式上所观察到的图表的缩略图。