Network representation learning (NRL) methods have received significant attention over the last years thanks to their success in several graph analysis problems, including node classification, link prediction, and clustering. Such methods aim to map each vertex of the network into a low-dimensional space in a way that the structural information of the network is preserved. Of particular interest are methods based on random walks; such methods transform the network into a collection of node sequences, aiming to learn node representations by predicting the context of each node within the sequence. In this paper, we introduce TNE, a generic framework to enhance the embeddings of nodes acquired by means of random walk-based approaches with topic-based information. Similar to the concept of topical word embeddings in Natural Language Processing, the proposed model first assigns each node to a latent community with the favor of various statistical graph models and community detection methods and then learns the enhanced topic-aware representations. We evaluate our methodology in two downstream tasks: node classification and link prediction. The experimental results demonstrate that by incorporating node and community embeddings, we are able to outperform widely-known baseline NRL models.
翻译:过去几年来,由于在包括节点分类、链接预测和集群在内的若干图表分析问题中取得了成功,网络代表学习方法在过去几年中受到极大关注,因为这些方法在包括节点分类、链接预测和集群在内的若干图表分析问题中取得了成功,这些方法旨在以保存网络结构信息的方式将网络的每个顶端映射成一个低维空间。特别令人感兴趣的是随机行走的方法;这些方法将网络转换成一组节点序列,目的是通过预测序列中每个节点的背景来学习节点表达。在本文中,我们引入了TNE,这是一个通用框架,用随机行走方式获得的节点嵌入以专题为基础的信息。与在自然语言处理中嵌入专题词的概念类似,拟议的模型首先将每个节点指派给潜在的社区,并赞同各种统计图表模型和社区检测方法,然后学习强化的专题觉测图。我们评估了我们两个下游任务的方法:节点分类和连接预测。实验结果表明,通过将节点和社区嵌入,我们能够超越众所周知的基线NRRL模型。