We are faced with data comprised of entities interacting over time: this can be individuals meeting, customers buying products, machines exchanging packets on the IP network, among others. Capturing the dynamics as well as the structure of these interactions is of crucial importance for analysis. These interactions can almost always be labeled with content: group belonging, reviews of products, abstracts, etc. We model these stream of interactions as stream graphs, a recent framework to model interactions over time. Formal Concept Analysis provides a framework for analyzing concepts evolving within a context. Considering graphs as the context, it has recently been applied to perform closed pattern mining on social graphs. In this paper, we are interested in pattern mining in sequences of interactions. After recalling and extending notions from formal concept analysis on graphs to stream graphs, we introduce algorithms to enumerate closed patterns on a labeled stream graph, and introduce a way to select relevant closed patterns. We run experiments on two real-world datasets of interactions among students and citations between authors, and show both the feasibility and the relevance of our method.
翻译:我们面对的是由长期互动的实体构成的数据:可以是个人会议、客户购买产品、IP网络上的交换包机等。掌握这些互动的动态和结构对于分析至关重要。这些互动几乎总是可以标有内容:群体归属、产品审查、摘要等。我们将这些互动流作为流图模型,这是最近一个模拟互动的框架。正式概念分析提供了一个框架,用于分析在某种背景下演变的概念。将图表作为背景来考虑,最近被用于在社会图表上进行封闭模式的挖掘。在本文中,我们有兴趣在互动序列中进行模式的挖掘。在回顾和扩展图表上正式概念分析的概念后,我们引入算法,在标签的流图上列出封闭模式,并引入选择相关封闭模式的方法。我们实验了两个学生之间互动和作者之间引用的真实世界数据集,并展示了我们方法的可行性和相关性。