Hypergraphs are a common model for multiway relationships in data, and hypergraph semi-supervised learning is the problem of assigning labels to all nodes in a hypergraph, given labels on just a few nodes. Diffusions and label spreading are classical techniques for semi-supervised learning in the graph setting, and there are some standard ways to extend them to hypergraphs. However, these methods are linear models, and do not offer an obvious way of incorporating node features for making predictions. Here, we develop a nonlinear diffusion process on hypergraphs that spreads both features and labels following the hypergraph structure, which can be interpreted as a hypergraph equilibrium network. Even though the process is nonlinear, we show global convergence to a unique limiting point for a broad class of nonlinearities, which is the global optimum of a interpretable, regularized semi-supervised learning loss function. The limiting point serves as a node embedding from which we make predictions with a linear model. Our approach is much more accurate than several hypergraph neural networks, and also takes less time to train.
翻译:测谎仪是数据中多路关系的常见模型, 高光半监视的学习是将标签分配给高光谱中所有节点的问题, 仅在几个节点上贴上标签。 扩散和标签是图形设置中半监督学习的古典技术, 并且有一些标准的方法可以将其扩展至高光学。 但是, 这些方法都是线性模型, 并不提供将节点特性纳入预测的明显方法。 在这里, 我们开发了一个超光学上的非线性扩散过程, 传播在高光学结构之后的特征和标签, 它可以被解释为高光学平衡网络。 尽管这个过程是非线性, 我们显示全球趋同为非线性大类非线性学习的独特限制点, 这是可解释的、 常规的半监督性学习损失功能的全球最佳之处 。 限制点作为我们用线性模型作出预测的节点。 我们的方法比几个高光线线线网络更准确, 也比培训的时间要少。