Network analysis has been a powerful tool to unveil relationships and interactions among a large number of objects. Yet its effectiveness in accurately identifying important node-node interactions is challenged by the rapidly growing network size, with data being collected at an unprecedented granularity and scale. Common wisdom to overcome such high dimensionality is collapsing nodes into smaller groups and conducting connectivity analysis on the group level. Dividing efforts into two phases inevitably opens a gap in consistency and drives down efficiency. Consensus learning emerges as a new normal for common knowledge discovery with multiple data sources available. To this end, this paper features developing a unified framework of simultaneous grouping and connectivity analysis by combining multiple data sources. The algorithm also guarantees a statistically optimal estimator.
翻译:网络分析是揭露大量天体之间的关系和相互作用的有力工具;然而,由于网络规模迅速扩大,数据正在以前所未有的颗粒和规模收集,网络分析在准确确定重要的节点互动方面的效力受到网络规模迅速扩大的挑战;克服这种高维度的共同智慧正在将节点分成较小的群体,并在小组一级进行连通性分析;分两个阶段的努力不可避免地在一致性和降低效率方面造成差距;共识学习是利用多种数据源进行共同知识发现的新常态;为此,本文件将结合多种数据源,制定一个同时组合和连接分析的统一框架;算法还保证采用统计上最理想的估算器。