Many real-world graphs or networks are temporal, e.g., in a social network persons only interact at specific points in time. This information directs dissemination processes on the network, such as the spread of rumors, fake news, or diseases. However, the current state-of-the-art methods for supervised graph classification are designed mainly for static graphs and may not be able to capture temporal information. Hence, they are not powerful enough to distinguish between graphs modeling different dissemination processes. To address this, we introduce a framework to lift standard graph kernels to the temporal domain. Specifically, we explore three different approaches and investigate the trade-offs between loss of temporal information and efficiency. Moreover, to handle large-scale graphs, we propose stochastic variants of our kernels with provable approximation guarantees. We evaluate our methods on a wide range of real-world social networks. Our methods beat static kernels by a large margin in terms of accuracy while still being scalable to large graphs and data sets. Hence, we confirm that taking temporal information into account is crucial for the successful classification of dissemination processes.
翻译:许多真实世界的图表或网络是时间性的,例如,在一个社交网络中,许多真实世界的图表或网络只是时间上的相互作用。这种信息指导着网络的传播过程,例如流言、假新闻或疾病的传播。然而,目前最先进的监督图表分类方法主要是为静态图表设计的,可能无法捕捉时间信息。因此,它们不够强大,不足以区分不同传播过程的模型。为此,我们引入了一个框架,将标准图形内核提升到时间域。具体地说,我们探索三种不同的方法,并调查时间信息损失与效率之间的取舍。此外,为了处理大比例的图表,我们提出了具有可辨别近似保证的我们内核的随机变量。我们用大量真实世界社交网络来评估我们的方法。我们的方法在准确性方面将静态内核打得很大,同时仍然可以对大图表和数据集进行缩放。因此,我们确认,将时间信息考虑在内对于传播过程的成功分类至关重要。