The full range of activity in a temporal network is captured in its edge activity data -- time series encoding the tie strengths or on-off dynamics of each edge in the network. However, in many practical applications, edge-level data are unavailable, and the network analyses must rely instead on node activity data which aggregates the edge-activity data and thus is less informative. This raises the question: Is it possible to use the static network to recover the richer edge activities from the node activities? Here we show that recovery is possible, often with a surprising degree of accuracy given how much information is lost, and that the recovered data are useful for subsequent network analysis tasks. Recovery is more difficult when network density increases, either topologically or dynamically, but exploiting dynamical and topological sparsity enables effective solutions to the recovery problem. We formally characterize the difficulty of the recovery problem both theoretically and empirically, proving the conditions under which recovery errors can be bounded and showing that, even when these conditions are not met, good quality solutions can still be derived. Effective recovery carries both promise and peril, as it enables deeper scientific study of complex systems but in the context of social systems also raises privacy concerns when social information can be aggregated across multiple data sources.
翻译:时间网络的全部活动范围都记录在它的边缘活动数据中 -- -- 时间序列将网络中每个边缘的连接强力或现成动态进行编码。然而,在许多实际应用中,边缘数据是不存在的,而网络分析则必须依赖汇集边缘活动数据并因此较少提供信息的节点活动数据。这提出了这样一个问题:能否利用静态网络从节点活动中恢复较富的边缘活动?我们在这里表明,恢复是可能的,由于信息损失了多少,恢复的准确度往往令人吃惊,而且回收的数据对随后的网络分析任务有用。当网络密度增加时,无论是从结构上还是动态上看,但利用动态和表面的宽度使恢复问题得到有效解决办法时,恢复就更加困难。我们正式地描述复苏问题在理论上和实验上的困难,证明恢复错误可以被捆绑起来的条件,并且表明,即使没有达到这些条件,仍然可以得出高质量的解决办法。有效的恢复既有可能带来希望,也有可能带来危险,因为它有助于对复杂的系统进行更深入的科学研究,但在社会系统的背景下,当社会信息系统的多种数据来源都会引起隐私问题时,因此,恢复问题也会引起各种关注。