适配器GNN: 高效增量调整提高图神经网络的泛化能力 (AdapterGNN: Efficient Delta Tuning Improves Generalization Ability in Graph Neural Networks) - 专知论文

会员服务 ·

0

Delta Tuning · 泛化能力 · 泛化 · tuning · 微调 ·

2023 年 4 月 19 日

AdapterGNN: Efficient Delta Tuning Improves Generalization Ability in Graph Neural Networks

翻译：适配器GNN: 高效增量调整提高图神经网络的泛化能力

Shengrui Li,Xueting Han,Jing Bai

Fine-tuning pre-trained models has recently yielded remarkable performance gains in graph neural networks (GNNs). In addition to pre-training techniques, inspired by the latest work in the natural language fields, more recent work has shifted towards applying effective fine-tuning approaches, such as parameter-efficient tuning (delta tuning). However, given the substantial differences between GNNs and transformer-based models, applying such approaches directly to GNNs proved to be less effective. In this paper, we present a comprehensive comparison of delta tuning techniques for GNNs and propose a novel delta tuning method specifically designed for GNNs, called AdapterGNN. AdapterGNN preserves the knowledge of the large pre-trained model and leverages highly expressive adapters for GNNs, which can adapt to downstream tasks effectively with only a few parameters, while also improving the model's generalization ability on the downstream tasks. Extensive experiments show that AdapterGNN achieves higher evaluation performance (outperforming full fine-tuning by 1.4% and 5.5% in the chemistry and biology domains respectively, with only 5% of its parameters tuned) and lower generalization gaps compared to full fine-tuning. Moreover, we empirically show that a larger GNN model can have a worse generalization ability, which differs from the trend observed in large language models. We have also provided a theoretical justification for delta tuning can improve the generalization ability of GNNs by applying generalization bounds.

翻译：最近，在图神经网络（GNNs）中微调预训练模型已经取得了显著的性能提升。除了预训练技术，受到最新的自然语言领域的工作的启发，更近期的工作已经转向应用有效的微调方法，如参数高效调整（delta tuning）。然而，由于GNNs和基于transformer的模型之间的显著差异，将这样的方法直接应用于GNNs被证明不太有效。在本文中，我们对GNNs的delta tuning技术进行了全面比较，并提出了一种专门为GNNs设计的新的delta tuning方法，称为AdapterGNN。AdapterGNN保留了大型预训练模型的知识，并利用高度表达力的适配器适应于GNNs的下游任务，只使用少量参数即可有效改善模型在下游任务上的泛化能力。大量实验表明，AdapterGNN在评估性能方面表现更好（化学和生物领域的性能分别比全面微调高1.4%和5.5%，只调整了其5%的参数），并且比全面微调具有低的泛化间隙。此外，我们还通过应用泛化边界实现了对于delta tuning如何提高GNNs的泛化能力的理论证明。我们还通过实验证明了更大的GNN模型可能具有更差的泛化能力，这与大型语言模型的趋势不同。

0

相关内容

Delta Tuning

【AAAI2023】对抗性权重扰动提高图神经网络的泛化能力

【AAAI2023】对抗性权重扰动提高图神经网络的泛化能力

专知会员服务

19+阅读 · 2022年12月12日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【清华大学】Delta调优:预训练语言模型参数有效方法的综合研究，Delta Tuning: A Comprehensive Study of Parameter Efficient Methods for Pre-trained Language Models

【清华大学】Delta调优:预训练语言模型参数有效方法的综合研究，Delta Tuning: A Comprehensive Study of Parameter Efficient Methods for Pre-trained Language Models

专知会员服务

26+阅读 · 2022年3月15日

【CVPR 2022】基于可迁移GNN的自适应轨迹预测，Adaptive Trajectory Prediction via Transferable GNN

【CVPR 2022】基于可迁移GNN的自适应轨迹预测，Adaptive Trajectory Prediction via Transferable GNN

专知会员服务

47+阅读 · 2022年3月11日

图神经网络GNN预训练技术进展概述

专知会员服务

44+阅读 · 2021年4月12日

【KDD2020】图神经网络生成式预训练，GPT-GNN: Generative Pre-Training of Graph Neural Networks

【KDD2020】图神经网络生成式预训练，GPT-GNN: Generative Pre-Training of Graph Neural Networks

专知会员服务

99+阅读 · 2020年7月3日

【ICML2020】持续图神经网络，Continuous Graph Neural Networks

【ICML2020】持续图神经网络，Continuous Graph Neural Networks

专知会员服务

151+阅读 · 2020年6月28日

【斯坦福大学-ICLR2020】图神经网络预训练的策略，Strategies for Pre-training Graph Neural Networks

【斯坦福大学-ICLR2020】图神经网络预训练的策略，Strategies for Pre-training Graph Neural Networks

专知会员服务

78+阅读 · 2020年3月1日

【ICLR2020】利用图神经网络进行高效概率逻辑推理，Efficient Probabilistic Logic Reasoning with Graph Neural Networks

【ICLR2020】利用图神经网络进行高效概率逻辑推理，Efficient Probabilistic Logic Reasoning with Graph Neural Networks

专知会员服务

113+阅读 · 2020年1月29日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

从此告别繁琐的模型微调，LLM-Adapters助力NLP任务快速高效微调！

从此告别繁琐的模型微调，LLM-Adapters助力NLP任务快速高效微调！

PaperWeekly

2+阅读 · 2023年4月6日

MIT博士论文 | 图指导的预测（含GNN的泛化能力和表示能力分析）

MIT博士论文 | 图指导的预测（含GNN的泛化能力和表示能力分析）

图与推荐

0+阅读 · 2022年11月14日

NeurIPS'22上的GNN好文集合 (表示能力、架构设计、图对比/自监督学习、分布偏移、可解释、推荐系统等)

NeurIPS'22上的GNN好文集合 (表示能力、架构设计、图对比/自监督学习、分布偏移、可解释、推荐系统等)

图与推荐

3+阅读 · 2022年9月20日

【ICML2020】持续图神经网络，Continuous Graph Neural Networks

【ICML2020】持续图神经网络，Continuous Graph Neural Networks

专知

75+阅读 · 2020年6月29日

ICML2020 图神经网络的预训练

ICML2020 图神经网络的预训练

图与推荐

12+阅读 · 2020年4月4日

论文浅尝 | GMNN: Graph Markov Neural Networks

论文浅尝 | GMNN: Graph Markov Neural Networks

开放知识图谱

20+阅读 · 2020年2月14日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】RNN最新研究进展综述

【推荐】RNN最新研究进展综述

机器学习研究会

26+阅读 · 2018年1月6日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

均聚物自组装的研究

国家自然科学基金

0+阅读 · 2014年12月31日

纳米大通道MnO2电极材料的制备、电容特性及其储能机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

高性能基准源架构及设计方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于时频二维训练信息的高谱效多天线TFT-OFDM技术研究

国家自然科学基金

1+阅读 · 2012年12月31日

卟啉类离子液体的熔点及萃取Sr2+效率的理论研究

国家自然科学基金

0+阅读 · 2012年12月31日

糖基化修饰大豆蛋白过敏原的调控机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

蛋白磷酸酶1调节tau外显子10可变剪接及在AD致病过程中的作用

国家自然科学基金

0+阅读 · 2011年12月31日

疏水离子液体－酶均相体系的构建、表征与性能调控

国家自然科学基金

0+阅读 · 2011年12月31日

基于PEG-PE的载药胶束组装机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

亚胺和一氧化碳交替共聚反应合成多肽类高分子材料

国家自然科学基金

0+阅读 · 2008年12月31日

Complex Preferences for Different Convergent Priors in Discrete Graph Diffusion

Arxiv

0+阅读 · 2023年6月5日

Multi-Predict: Few Shot Predictors For Efficient Neural Architecture Search

Arxiv

0+阅读 · 2023年6月4日

Locally Regularized Neural Differential Equations: Some Black Boxes Were Meant to Remain Closed!

Arxiv

0+阅读 · 2023年6月2日

Adaptive Message Quantization and Parallelization for Distributed Full-graph GNN Training

Arxiv

0+阅读 · 2023年6月2日

Make Your Pre-trained Model Reversible: From Parameter to Memory Efficient Fine-Tuning

Arxiv

0+阅读 · 2023年6月1日

Adaptive Transfer Learning on Graph Neural Networks

Arxiv

14+阅读 · 2021年7月20日

A Survey of Quantization Methods for Efficient Neural Network Inference

Arxiv

22+阅读 · 2021年6月21日

Adaptive Attentional Network for Few-Shot Knowledge Graph Completion

Arxiv

17+阅读 · 2020年10月19日

Graph Transformer Networks

Arxiv

15+阅读 · 2020年2月5日

How Powerful are Graph Neural Networks?

Arxiv

23+阅读 · 2018年10月1日

VIP会员

文章信息

相关主题

相关VIP内容

【AAAI2023】对抗性权重扰动提高图神经网络的泛化能力

【AAAI2023】对抗性权重扰动提高图神经网络的泛化能力

专知会员服务

19+阅读 · 2022年12月12日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【清华大学】Delta调优:预训练语言模型参数有效方法的综合研究，Delta Tuning: A Comprehensive Study of Parameter Efficient Methods for Pre-trained Language Models

【清华大学】Delta调优:预训练语言模型参数有效方法的综合研究，Delta Tuning: A Comprehensive Study of Parameter Efficient Methods for Pre-trained Language Models

专知会员服务

26+阅读 · 2022年3月15日

【CVPR 2022】基于可迁移GNN的自适应轨迹预测，Adaptive Trajectory Prediction via Transferable GNN

【CVPR 2022】基于可迁移GNN的自适应轨迹预测，Adaptive Trajectory Prediction via Transferable GNN

专知会员服务

47+阅读 · 2022年3月11日

图神经网络GNN预训练技术进展概述

专知会员服务

44+阅读 · 2021年4月12日

【KDD2020】图神经网络生成式预训练，GPT-GNN: Generative Pre-Training of Graph Neural Networks

【KDD2020】图神经网络生成式预训练，GPT-GNN: Generative Pre-Training of Graph Neural Networks

专知会员服务

99+阅读 · 2020年7月3日

【ICML2020】持续图神经网络，Continuous Graph Neural Networks

【ICML2020】持续图神经网络，Continuous Graph Neural Networks

专知会员服务

151+阅读 · 2020年6月28日

【斯坦福大学-ICLR2020】图神经网络预训练的策略，Strategies for Pre-training Graph Neural Networks

【斯坦福大学-ICLR2020】图神经网络预训练的策略，Strategies for Pre-training Graph Neural Networks

专知会员服务

78+阅读 · 2020年3月1日

【ICLR2020】利用图神经网络进行高效概率逻辑推理，Efficient Probabilistic Logic Reasoning with Graph Neural Networks

【ICLR2020】利用图神经网络进行高效概率逻辑推理，Efficient Probabilistic Logic Reasoning with Graph Neural Networks

专知会员服务

113+阅读 · 2020年1月29日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

【新书】面向企业的图学习扩展：生产级图学习与推理，485页pdf

AI智能体编程：技术、挑战与机遇综述

【国家标准】数据安全技术数据安全风险评估方法

【CMU博士论文】交互式学习的进展：替代性反馈机制与自适应因果推理

相关资讯

从此告别繁琐的模型微调，LLM-Adapters助力NLP任务快速高效微调！

从此告别繁琐的模型微调，LLM-Adapters助力NLP任务快速高效微调！

PaperWeekly

2+阅读 · 2023年4月6日

MIT博士论文 | 图指导的预测（含GNN的泛化能力和表示能力分析）

MIT博士论文 | 图指导的预测（含GNN的泛化能力和表示能力分析）

图与推荐

0+阅读 · 2022年11月14日

NeurIPS'22上的GNN好文集合 (表示能力、架构设计、图对比/自监督学习、分布偏移、可解释、推荐系统等)

NeurIPS'22上的GNN好文集合 (表示能力、架构设计、图对比/自监督学习、分布偏移、可解释、推荐系统等)

图与推荐

3+阅读 · 2022年9月20日

【ICML2020】持续图神经网络，Continuous Graph Neural Networks

【ICML2020】持续图神经网络，Continuous Graph Neural Networks

专知

75+阅读 · 2020年6月29日

ICML2020 图神经网络的预训练

ICML2020 图神经网络的预训练

图与推荐

12+阅读 · 2020年4月4日

论文浅尝 | GMNN: Graph Markov Neural Networks

论文浅尝 | GMNN: Graph Markov Neural Networks

开放知识图谱

20+阅读 · 2020年2月14日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】RNN最新研究进展综述

【推荐】RNN最新研究进展综述

机器学习研究会

26+阅读 · 2018年1月6日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Complex Preferences for Different Convergent Priors in Discrete Graph Diffusion

Arxiv

0+阅读 · 2023年6月5日

Multi-Predict: Few Shot Predictors For Efficient Neural Architecture Search

Arxiv

0+阅读 · 2023年6月4日

Locally Regularized Neural Differential Equations: Some Black Boxes Were Meant to Remain Closed!

Arxiv

0+阅读 · 2023年6月2日

Adaptive Message Quantization and Parallelization for Distributed Full-graph GNN Training

Arxiv

0+阅读 · 2023年6月2日

Make Your Pre-trained Model Reversible: From Parameter to Memory Efficient Fine-Tuning

Arxiv

0+阅读 · 2023年6月1日

Adaptive Transfer Learning on Graph Neural Networks

Arxiv

14+阅读 · 2021年7月20日

A Survey of Quantization Methods for Efficient Neural Network Inference

Arxiv

22+阅读 · 2021年6月21日

Adaptive Attentional Network for Few-Shot Knowledge Graph Completion

Arxiv

17+阅读 · 2020年10月19日

Graph Transformer Networks

Arxiv

15+阅读 · 2020年2月5日

How Powerful are Graph Neural Networks?

Arxiv

23+阅读 · 2018年10月1日

相关基金

均聚物自组装的研究

国家自然科学基金

0+阅读 · 2014年12月31日

纳米大通道MnO2电极材料的制备、电容特性及其储能机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

高性能基准源架构及设计方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于时频二维训练信息的高谱效多天线TFT-OFDM技术研究

国家自然科学基金

1+阅读 · 2012年12月31日

卟啉类离子液体的熔点及萃取Sr2+效率的理论研究

国家自然科学基金

0+阅读 · 2012年12月31日

糖基化修饰大豆蛋白过敏原的调控机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

蛋白磷酸酶1调节tau外显子10可变剪接及在AD致病过程中的作用

国家自然科学基金

0+阅读 · 2011年12月31日

疏水离子液体－酶均相体系的构建、表征与性能调控

国家自然科学基金

0+阅读 · 2011年12月31日

基于PEG-PE的载药胶束组装机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

亚胺和一氧化碳交替共聚反应合成多肽类高分子材料

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员