Unsupervised/self-supervised pre-training methods for graph representation learning have recently attracted increasing research interests, and they are shown to be able to generalize to various downstream applications. Yet, the adversarial robustness of such pre-trained graph learning models remains largely unexplored. More importantly, most existing defense techniques designed for end-to-end graph representation learning methods require pre-specified label definitions, and thus cannot be directly applied to the pre-training methods. In this paper, we propose an unsupervised defense technique to robustify pre-trained deep graph models, so that the perturbations on the input graph can be successfully identified and blocked before the model is applied to different downstream tasks. Specifically, we introduce a mutual information-based measure, \textit{graph representation vulnerability (GRV)}, to quantify the robustness of graph encoders on the representation space. We then formulate an optimization problem to learn the graph representation by carefully balancing the trade-off between the expressive power and the robustness (\emph{i.e.}, GRV) of the graph encoder. The discrete nature of graph topology and the joint space of graph data make the optimization problem intractable to solve. To handle the above difficulty and to reduce computational expense, we further relax the problem and thus provide an approximate solution. Additionally, we explore a provable connection between the robustness of the unsupervised graph encoder and that of models on downstream tasks. Extensive experiments demonstrate that even without access to labels and tasks, our model is still able to enhance robustness against adversarial attacks on three downstream tasks (node classification, link prediction, and community detection) by an average of +16.5% compared with existing methods.
翻译:未监督/自我监督的图形代表学习培训前方法最近吸引了越来越多的研究兴趣,显示它们能够推广到各种下游应用。然而,这种经过预先训练的图形学习模型的对抗性强度基本上尚未探索。更重要的是,为端到端图形代表学习方法设计的大多数现有防御技术都需要预先指定的标签定义,因此无法直接应用于培训前方法。在本文件中,我们提议一种未经监督的防御技术,以巩固经过预先训练的深度图形模型,从而在模型应用到不同的下游任务之前,可以成功地查明和堵住输入图上的干扰。具体地说,我们引入了一种基于信息的衡量标准,\textit{图形代表脆弱性},以量化代表空间的图表显示器学习方法的稳健性。然后,我们提出一个优化问题,通过仔细平衡显示显示的表达力和强度(\emph{i.e.},GRV)在图表中,甚至可以成功地辨别出,甚至能够将数据连接到上方的直径直径。因此,我们最接近的直径直的图像的计算方法可以使我们更难度和直径直径直地处理。