Graph summarization is beneficial in a wide range of applications, such as visualization, interactive and exploratory analysis, approximate query processing, reducing the on-disk storage footprint, and graph processing in modern hardware. However, the bulk of the literature on graph summarization surprisingly overlooks the possibility of having edges of different types. In this paper, we study the novel problem of producing summaries of multi-relation networks, i.e., graphs where multiple edges of different types may exist between any pair of nodes. Multi-relation graphs are an expressive model of real-world activities, in which a relation can be a topic in social networks, an interaction type in genetic networks, or a snapshot in temporal graphs. The first approach that we consider for multi-relation graph summarization is a two-step method based on summarizing each relation in isolation, and then aggregating the resulting summaries in some clever way to produce a final unique summary. In doing this, as a side contribution, we provide the first polynomial-time approximation algorithm based on the k-Median clustering for the classic problem of lossless single-relation graph summarization. Then, we demonstrate the shortcomings of these two-step methods, and propose holistic approaches, both approximate and heuristic algorithms, to compute a summary directly for multi-relation graphs. In particular, we prove that the approximation bound of k-Median clustering for the single relation solution can be maintained in a multi-relation graph with proper aggregation operation over adjacency matrices corresponding to its multiple relations. Experimental results and case studies (on co-authorship networks and brain networks) validate the effectiveness and efficiency of the proposed algorithms.
翻译:图形和图的图形和图和图的组合在一系列应用中是有益的,例如视觉化、互动和探索性分析、近似查询处理、减少磁盘存储足迹、以及现代硬件的图形处理。然而,图表和图的文献大都惊人地忽略了不同类型边缘的可能性。在本文中,我们研究了制作多关系网络摘要的新颖问题,即不同类型多种边缘可能存在于任何对结点之间的图表。多关系图表是真实世界活动的直观模型,其中一种关系可以是社交网络的话题、基因网络的交互类型或时间图形的快照。我们考虑的多关系图形和图的首个方法基于对每个关系进行孤立的总结,然后以某种精巧的方式将由此产生的摘要汇编成最后的独特摘要。在做这项工作时,我们提供了基于K- Medial 组合的首个多数字和时间近比值算算算法,在单级关系中进行一个典型的单级关系操作类型和直截面的直截面的直径的直径直径的直径的直径的直径直径的直径直径直径直图和直径直径直径直的直径直径直径直方的直的直的直的直方的直方的直方的直方的直方的直方的直方的直方的直方的直方和直方的直方的直方对比比方。我们方法方法,我们展示法。我们展示法,我们展示的直方和直方的直方的直方和直方的直方的直方和直方的直方的直方的直方的直方的直方的直方的直方的直方的直方和直方的直方的直方的直方的直方的直方的直方的直方的直方的直方的直方的直方和直方的直方的直方的直方的直方的直方的直方的直方的直方的直方的直方的直方的直方的直方的直方的直方的直方的直方的直方的直方的直方的直方的直方的直方的直方的直方的直方的直方的直方的直方的直方的直方的直方