Recent studies on Graph Convolutional Networks (GCNs) reveal that the initial node representations (i.e., the node representations before the first-time graph convolution) largely affect the final model performance. However, when learning the initial representation for a node, most existing work linearly combines the embeddings of node features, without considering the interactions among the features (or feature embeddings). We argue that when the node features are categorical, e.g., in many real-world applications like user profiling and recommender system, feature interactions usually carry important signals for predictive analytics. Ignoring them will result in suboptimal initial node representation and thus weaken the effectiveness of the follow-up graph convolution. In this paper, we propose a new GCN model named CatGCN, which is tailored for graph learning when the node features are categorical. Specifically, we integrate two ways of explicit interaction modeling into the learning of initial node representation, i.e., local interaction modeling on each pair of node features and global interaction modeling on an artificial feature graph. We then refine the enhanced initial node representations with the neighborhood aggregation-based graph convolution. We train CatGCN in an end-to-end fashion and demonstrate it on semi-supervised node classification. Extensive experiments on three tasks of user profiling (the prediction of user age, city, and purchase level) from Tencent and Alibaba datasets validate the effectiveness of CatGCN, especially the positive effect of performing feature interaction modeling before graph convolution.
翻译:最近对图表革命网络(GCN)的研究显示,最初的节点表示(即第一次图形演变之前的节点表示)主要影响最后模型性能。然而,当学习节点的初步表示时,大多数现有工作线性地结合了节点特性的嵌入,而没有考虑到各特点(或嵌入特性)之间的相互作用。我们争辩说,当节点特征是绝对的,例如,在许多现实世界应用程序中,如用户剖析和建议系统,特征互动通常含有预测解析的重要信号。忽略它们将导致低于最优化的初步节点表示,从而削弱后续图表演变的效果。在本文中,我们提议一个新的GCN模型名为CatGCN,在结结结结点特征明确时专门用于图形学习。具体地说,我们将两种明确的互动模式纳入最初节点表示模式的学习中,即,在每对节点特征进行地方互动的建模,以及全球互动模拟模型,在人造时,我们先在模型化阶段进行更精确的G值分析,然后在模型前,我们用更精细的图形进行初步的缩缩缩缩略地分析。