Local graph neighborhood sampling is a fundamental computational problem that is at the heart of algorithms for node representation learning. Several works have presented algorithms for learning discrete node embeddings where graph nodes are represented by discrete features such as attributes of neighborhood nodes. Discrete embeddings offer several advantages compared to continuous word2vec-like node embeddings: ease of computation, scalability, and interpretability. We present LoNe Sampler, a suite of algorithms for generating discrete node embeddings by Local Neighborhood Sampling, and address two shortcomings of previous work. First, our algorithms have rigorously understood theoretical properties. Second, we show how to generate approximate explicit vector maps that avoid the expensive computation of a Gram matrix for the training of a kernel model. Experiments on benchmark datasets confirm the theoretical findings and demonstrate the advantages of the proposed methods.
翻译:本地图形邻里取样是一个基本的计算问题,这是节点代表学习算法的核心。 几个作品展示了学习离散节点嵌入的算法, 其中图形节点由相邻节点的属性等离散特性所代表。 分立嵌入与连续的单词2vec类节点嵌入相比具有若干优势: 计算方便、 缩放性和 可解释性。 我们展示了LoNe 样板, 由本地 Neighborhood 取样生成离散节点嵌入的一套算法, 并解决了先前工作的两个缺点。 首先, 我们的算法严格理解了理论属性。 其次, 我们展示了如何生成大致明确的矢量图, 避免为培训内核模型而花费计算Gram矩阵。 基准数据集实验证实了理论发现, 并展示了拟议方法的优点 。