Many real-world data can be modeled as heterogeneous graphs that contain multiple types of nodes and edges. Meanwhile, due to excellent performance, heterogeneous graph neural networks (GNNs) have received more and more attention. However, the existing work mainly focuses on the design of novel GNN models, while ignoring another important issue that also has a large impact on the model performance, namely the missing attributes of some node types. The handcrafted attribute completion requires huge expert experience and domain knowledge. Also, considering the differences in semantic characteristics between nodes, the attribute completion should be fine-grained, i.e., the attribute completion operation should be node-specific. Moreover, to improve the performance of the downstream graph learning task, attribute completion and the training of the heterogeneous GNN should be jointly optimized rather than viewed as two separate processes. To address the above challenges, we propose a differentiable attribute completion framework called AutoAC for automated completion operation search in heterogeneous GNNs. We first propose an expressive completion operation search space, including topology-dependent and topology-independent completion operations. Then, we propose a continuous relaxation schema and further propose a differentiable completion algorithm where the completion operation search is formulated as a bi-level joint optimization problem. To improve the search efficiency, we leverage two optimization techniques: discrete constraints and auxiliary unsupervised graph node clustering. Extensive experimental results on real-world datasets reveal that AutoAC outperforms the SOTA handcrafted heterogeneous GNNs and the existing attribute completion method
翻译:许多真实世界的数据可以建模为包含多种节点和边缘的混合图解。 同时,由于性能优异,多元图形神经网络(GNNS)越来越受到越来越多的关注。然而,现有工作主要侧重于设计新型GNN模型,而忽视了对模型性能有重大影响的另一个重要问题,即某些节点类型缺失的属性。手制属性完成需要丰富的专家经验和域知识。此外,考虑到节点之间语义特性的差异,属性完成应精细,即属性完成操作应具有准度。此外,为了改进下游图形学习任务的业绩,将完成和训练混合的GNNNNM模型视为共同优化而不是视为两个不同的进程。为了应对上述挑战,我们建议一个不同的属性完成框架,即AutomaAC用于在不复杂的 GNNNNPS中自动完成操作。我们首先建议一个直观的完成操作空间,包括基于表和基于表层的完成操作。然后,我们建议一个连续的高级智能智能智能智能智能智能智能智能智能智能智能操作,即我们提出一个连续的搜索水平。