As powerful tools for representation learning on graphs, graph neural networks (GNNs) have facilitated various applications from drug discovery to recommender systems. Nevertheless, the effectiveness of GNNs is immensely challenged by issues related to data quality, such as distribution shift, abnormal features and adversarial attacks. Recent efforts have been made on tackling these issues from a modeling perspective which requires additional cost of changing model architectures or re-training model parameters. In this work, we provide a data-centric view to tackle these issues and propose a graph transformation framework named GTrans which adapts and refines graph data at test time to achieve better performance. We provide theoretical analysis on the design of the framework and discuss why adapting graph data works better than adapting the model. Extensive experiments have demonstrated the effectiveness of GTrans on three distinct scenarios for eight benchmark datasets where suboptimal data is presented. Remarkably, GTrans performs the best in most cases with improvements up to 2.8%, 8.2% and 3.8% over the best baselines on three experimental settings. Code is released at https://github.com/ChandlerBang/GTrans.
翻译:图表神经网络(GNNs)作为在图表上进行代表学习的有力工具,促进了从药物发现到建议系统的各种应用,然而,与数据质量有关的问题,如分布转移、异常特征和对抗性攻击等问题,都对GNNs的效力提出了巨大挑战。最近为从模型的角度处理这些问题作出了努力,这需要改变模型结构或再培训模型参数的额外费用。在这项工作中,我们提供了一种以数据为中心的观点来解决这些问题,并提出了一个名为GTR的图形转换框架,该框架在测试时调整和完善图表数据,以取得更好的性能。我们提供了框架的设计理论分析,并讨论了为什么调整图表数据比调整模型效果更好。广泛的实验表明GTrans在8个基准数据集的三种不同情景上的有效性,在提供亚最佳数据的地方。值得注意的是,GTrans在多数情况下,在三个实验环境的最佳基线上改进了2.8%、8.2%和3.8%。代码发布在https://github.com/ChandlerBang/GTransy。</s>