Given a graph with partial observations of node features, how can we estimate the missing features accurately? Feature estimation is a crucial problem for analyzing real-world graphs whose features are commonly missing during the data collection process. Accurate estimation not only provides diverse information of nodes but also supports the inference of graph neural networks that require the full observation of node features. However, designing an effective approach for estimating high-dimensional features is challenging, since it requires an estimator to have large representation power, increasing the risk of overfitting. In this work, we propose SVGA (Structured Variational Graph Autoencoder), an accurate method for feature estimation. SVGA applies strong regularization to the distribution of latent variables by structured variational inference, which models the prior of variables as Gaussian Markov random field based on the graph structure. As a result, SVGA combines the advantages of probabilistic inference and graph neural networks, achieving state-of-the-art performance in real datasets.
翻译:根据部分观测节点特征的图表,我们如何准确估计缺失的特征? 地貌估计是分析在数据收集过程中通常缺乏特征的真实世界图形的一个关键问题。准确估计不仅提供不同节点信息,而且支持需要充分观察节点特征的图形神经网络的推论。然而,设计一个有效估计高维特征的方法具有挑战性,因为它要求测量器拥有巨大的代表力,增加过度配置的风险。在这项工作中,我们提议SVGA(结构化变异图解自动编码器),这是地貌估计的精确方法。 SVGA通过结构化变异推理对潜在变量的分布进行严格的规范化,根据图形结构将先前变量作为Gausian Markov随机字段的模型。因此,SVGA将概率推论和图形神经网络的优势结合起来,从而在真实数据集中实现最先进的性能。