As a way to implement the "right to be forgotten" in machine learning, \textit{machine unlearning} aims to completely remove the contributions and information of the samples to be deleted from a trained model without affecting the contributions of other samples. Recently, many frameworks for machine unlearning have been proposed, and most of them focus on image and text data. To extend machine unlearning to graph data, \textit{GraphEraser} has been proposed. However, a critical issue is that \textit{GraphEraser} is specifically designed for the transductive graph setting, where the graph is static and attributes and edges of test nodes are visible during training. It is unsuitable for the inductive setting, where the graph could be dynamic and the test graph information is invisible in advance. Such inductive capability is essential for production machine learning systems with evolving graphs like social media and transaction networks. To fill this gap, we propose the \underline{{\bf G}}\underline{{\bf U}}ided \underline{{\bf I}}n\underline{{\bf D}}uctiv\underline{{\bf E}} Graph Unlearning framework (GUIDE). GUIDE consists of three components: guided graph partitioning with fairness and balance, efficient subgraph repair, and similarity-based aggregation. Empirically, we evaluate our method on several inductive benchmarks and evolving transaction graphs. Generally speaking, GUIDE can be efficiently implemented on the inductive graph learning tasks for its low graph partition cost, no matter on computation or structure information. The code will be available here: https://github.com/Happy2Git/GUIDE.
翻译:作为实现机器学习中“被遗忘权”的一种方式,机器去学习旨在完全从训练模型中移除要删除的样本的贡献和信息,而不影响其他样本的贡献。最近,许多机器去学习框架已被提出,其中大多数集中在图像和文本数据方面。为了将机器去学习扩展到图形数据,提出了GraphEraser。然而,一个关键问题是,GraphEraser是专门为推导性图形设置而设计的,其中图形是静态的,并且在训练期间可以看到测试节点的属性和边缘。它不适用于归纳设置,其中图可能是动态的,测试图信息事先不可见。这种归纳能力对于具有不断发展的图形的生产机器学习系统(如社交媒体和交易网络)至关重要。为了填补这一空白,我们提出了GUidED INDUCtivE Graph Unlearning框架(GUIDE)。GUIDE包含三个组件:具有公平性和平衡性的引导图分区、高效的子图修复和基于相似性的聚合。从经验上讲,我们在几个归纳基准和不断发展的交易图上评估了我们的方法。一般来说,由于其低图分区成本(无论是计算还是结构信息),GUIDE可以在归纳图学习任务上高效实现。代码将在此处提供:https://github.com/Happy2Git/GUIDE.