Heterogeneous information networks(HINs) become popular in recent years for its strong capability of modelling objects with abundant information using explicit network structure. Network embedding has been proved as an effective method to convert information networks into lower-dimensional space, whereas the core information can be well preserved. However, traditional network embedding algorithms are sub-optimal in capturing rich while potentially incompatible semantics provided by HINs. To address this issue, a novel meta-path-based HIN representation learning framework named mSHINE is designed to simultaneously learn multiple node representations for different meta-paths. More specifically, one representation learning module inspired by the RNN structure is developed and multiple node representations can be learned simultaneously, where each representation is associated with one respective meta-path. By measuring the relevance between nodes with the designed objective function, the learned module can be applied in downstream link prediction tasks. A set of criteria for selecting initial meta-paths is proposed as the other module in mSHINE which is important to reduce the optimal meta-path selection cost when no prior knowledge of suitable meta-paths is available. To corroborate the effectiveness of mSHINE, extensive experimental studies including node classification and link prediction are conducted on five real-world datasets. The results demonstrate that mSHINE outperforms other state-of-the-art HIN embedding methods.
翻译:近些年来,由于利用明确的网络结构建立大量信息的模拟物体的强大能力,异质信息网络(HINs)近年来变得很受欢迎。网络嵌入已被证明是将信息网络转换为低维空间的有效方法,而核心信息则可得到妥善保存。然而,传统的网络嵌入算法在捕捉HINs提供的丰富而可能不兼容的语义时是次优化的。为解决这一问题,一个名为 mSHIN的新颖的基于超正向HIN代表学习框架旨在同时学习不同元路径的多节点表示。更具体地说,开发了一个受 RNN 结构启发的代表学习模块,可以同时学习多个节点表示,其中每个代表与一个相应的元路径相关。通过测量节点与设计目标功能的相关性,学习的模块可以应用于下游链接的预测任务。在 mSHINENE中提出了一套选择初始元路径的标准,作为MSHINENE的其他模块,这对于降低最佳的元路径选择成本非常重要,而以前没有适当的元路径的知识。在不掌握任何适当的元路径的情况下,因此可以同时开发一个代表模块,同时学习多个节点表,可以同时学习,同时学习,同时学习,可以同时学习多节表,同时学习,同时学习,同时学习,可以同时学习,同时学习,每个代表,每个代表,每个代表。通过每个代表,每个代表的每个代表都与一个不同的元路径。通过一个元路径。通过测量,通过测量,通过MSHIN-SHIN-hIN 测试方法,通过一个不同的模式,通过一个不同的模式,通过测算出真实的模型,从而验证算出真实的路径。