Recent empirical evidence has shown that in many real-world systems, successfully represented as networks, interactions are not limited to dyads, but often involve three or more agents at a time. These data are better described by hypergraphs, where hyperlinks encode higher-order interactions among a group of nodes. In spite of the large number of works on networks, highlighting informative hyperlinks in hypergraphs obtained from real world data is still an open problem. Here we propose an analytic approach to filter hypergraphs by identifying those hyperlinks that are over-expressed with respect to a random null hypothesis, and represent the most relevant higher-order connections. We apply our method to a class of synthetic benchmarks and to several datasets. For all cases, the method highlights hyperlinks that are more informative than those extracted with pairwise approaches. Our method provides a first way to obtain statistically validated hypergraphs, separating informative connections from redundant and noisy ones.
翻译:最近的经验证据表明,在许多以网络形式得到成功代表的真实世界系统中,互动并不局限于短短,而且往往一次涉及三个或三个以上的代理商。这些数据在高音中描述得更好,超文本链接能将一组节点之间的高顺序互动编码起来。尽管在网络上做了大量工作,但从真实世界数据中获取的高频中突出信息超链接仍然是一个尚未解决的问题。我们在这里建议对过滤超高光谱采取分析方法,方法是识别那些在随机的空虚假设中表达过多的超链接,并代表最相关的高顺序连接。我们用我们的方法对一组合成基准和数组数据集进行描述。对于所有案例,该方法都强调比通过对称方法提取的超链接更具有信息性。我们的方法为获得经统计验证的高光谱提供了第一种方法,将信息连接与多余的和吵闹的连接区分开来。