DiffFit：通过简单的参数高效微调解锁大型扩散模型的可迁移性 (DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-Efficient Fine-Tuning) - 专知论文

会员服务 ·

0

微调 · 扩散模型 · 可迁移性 · 参数高效 · 预训练 ·

2023 年 4 月 13 日

DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-Efficient Fine-Tuning

翻译：DiffFit：通过简单的参数高效微调解锁大型扩散模型的可迁移性

Enze Xie,Lewei Yao,Han Shi,Zhili Liu,Daquan Zhou,Zhaoqiang Liu,Jiawei Li,Zhenguo Li

from arxiv, Tech Report

Diffusion models have proven to be highly effective in generating high-quality images. However, adapting large pre-trained diffusion models to new domains remains an open challenge, which is critical for real-world applications. This paper proposes DiffFit, a parameter-efficient strategy to fine-tune large pre-trained diffusion models that enable fast adaptation to new domains. DiffFit is embarrassingly simple that only fine-tunes the bias term and newly-added scaling factors in specific layers, yet resulting in significant training speed-up and reduced model storage costs. Compared with full fine-tuning, DiffFit achieves 2$\times$ training speed-up and only needs to store approximately 0.12\% of the total model parameters. Intuitive theoretical analysis has been provided to justify the efficacy of scaling factors on fast adaptation. On 8 downstream datasets, DiffFit achieves superior or competitive performances compared to the full fine-tuning while being more efficient. Remarkably, we show that DiffFit can adapt a pre-trained low-resolution generative model to a high-resolution one by adding minimal cost. Among diffusion-based methods, DiffFit sets a new state-of-the-art FID of 3.02 on ImageNet 512$\times$512 benchmark by fine-tuning only 25 epochs from a public pre-trained ImageNet 256$\times$256 checkpoint while being 30$\times$ more training efficient than the closest competitor.

翻译：扩散模型在生成高质量图像方面已被证明非常有效。然而，将大型预训练扩散模型适应到新领域仍然是一个挑战，这对于实际应用非常关键。本文提出了DiffFit，一种高效的参数微调策略，可在快速适应新领域的同时，实现大型预训练扩散模型的精细调整。DiffFit非常简单，只微调特定层中的偏置项和新添加的比例因子，却能显著提高训练速度并降低模型存储成本。与全模型微调相比，DiffFit 实现了2倍的训练加速，并且只需要存储大约0.12％的总模型参数。有直观的理论分析证明了比例因子对快速适应的有效性。在8个下游数据集上，DiffFit的性能要么优于要么与全模型微调相当，同时更加高效。值得注意的是，我们展示了如何通过增加最小的代价，将一个预训练的低分辨率生成模型适应高分辨率。在基于扩散的方法中，DiffFit通过从公共预训练的ImageNet 256×256检查点微调25个时期来实现FID的新高度，达到3.02，同时比最接近的竞争者更高效30倍。

0

相关内容

【CVPR2023】基于图像特定提示学习的零样本生成模型自适应

【CVPR2023】基于图像特定提示学习的零样本生成模型自适应

专知会员服务

31+阅读 · 2023年4月7日

【CVPR2022】基于知识蒸馏的高效预训练

【CVPR2022】基于知识蒸馏的高效预训练

专知会员服务

32+阅读 · 2022年4月23日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【清华大学】Delta调优:预训练语言模型参数有效方法的综合研究，Delta Tuning: A Comprehensive Study of Parameter Efficient Methods for Pre-trained Language Models

【清华大学】Delta调优:预训练语言模型参数有效方法的综合研究，Delta Tuning: A Comprehensive Study of Parameter Efficient Methods for Pre-trained Language Models

专知会员服务

26+阅读 · 2022年3月15日

【CVPR 2022】视觉提示调整（VPT），Vision Prompt Tuning

【CVPR 2022】视觉提示调整（VPT），Vision Prompt Tuning

专知会员服务

32+阅读 · 2022年3月12日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

【微软】大型神经语言模型的对抗性训练，Adversarial Training for Large Neural Language Models

【微软】大型神经语言模型的对抗性训练，Adversarial Training for Large Neural Language Models

专知会员服务

51+阅读 · 2020年5月3日

【斯坦福】探究预训练语言模型中的可迁移性，Investigating Transferability in PLM

【斯坦福】探究预训练语言模型中的可迁移性，Investigating Transferability in PLM

专知会员服务

20+阅读 · 2020年5月3日

【NLP模型压缩方法综述】《A Survey of Methods for Model Compression in NLP》by Madison May

【NLP模型压缩方法综述】《A Survey of Methods for Model Compression in NLP》by Madison May

专知会员服务

43+阅读 · 2020年4月22日

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

专知会员服务

159+阅读 · 2020年2月29日

NeurlPS 2022 | 全新大模型参数高效微调方法：仅需训练0.3M的参数

NeurlPS 2022 | 全新大模型参数高效微调方法：仅需训练0.3M的参数

PaperWeekly

0+阅读 · 2022年11月9日

全新大模型参数高效微调方法SSF：仅需训练0.3M的参数，效果卓越（NeurlPS 22 ）

全新大模型参数高效微调方法SSF：仅需训练0.3M的参数，效果卓越（NeurlPS 22 ）

极市平台

0+阅读 · 2022年11月7日

NeurlPS 2022 | 全新大模型参数高效微调方法SSF：仅需训练0.3M的参数，效果卓越

NeurlPS 2022 | 全新大模型参数高效微调方法SSF：仅需训练0.3M的参数，效果卓越

机器之心

0+阅读 · 2022年11月7日

Ladder Side-Tuning：预训练模型的“过墙梯”

Ladder Side-Tuning：预训练模型的“过墙梯”

PaperWeekly

0+阅读 · 2022年6月24日

已删除

将门创投

14+阅读 · 2019年5月29日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

使用GPU加速银道面尘埃辐射图像的高分辨率模拟与多参数反演

国家自然科学基金

0+阅读 · 2015年12月31日

考虑非线性孔隙流的近海结构物海床地基内波致渐进液化研究

国家自然科学基金

0+阅读 · 2014年12月31日

垂直磁各向异性半金属铁磁电极到半导体自旋注入的研究

国家自然科学基金

0+阅读 · 2014年12月31日

无人机载和地面车载间多平台遥感影像的自动配准方法研究

国家自然科学基金

1+阅读 · 2013年12月31日

水声传感器网络的高效时间同步与定位

国家自然科学基金

0+阅读 · 2013年12月31日

民航视景仿真的照片级逼真度渲染关键技术研究

国家自然科学基金

0+阅读 · 2013年12月31日

金属纳米腔中的受激辐射表面等离激元放大

国家自然科学基金

0+阅读 · 2013年12月31日

基于二维侧扫声纳图像的三维高分辨率海床地形恢复方法研究

国家自然科学基金

1+阅读 · 2012年12月31日

地震波传播与成像保持效率的高精度算法的研究

国家自然科学基金

0+阅读 · 2012年12月31日

极化合成孔径雷达(SAR)图像地物并行分割分类研究与应用

国家自然科学基金

1+阅读 · 2012年12月31日

Consistency Models

Arxiv

0+阅读 · 2023年5月31日

Efficient Training of Energy-Based Models Using Jarzynski Equality

Arxiv

0+阅读 · 2023年5月30日

Jointly Reparametrized Multi-Layer Adaptation for Efficient and Private Tuning

Arxiv

0+阅读 · 2023年5月30日

ConES: Concept Embedding Search for Parameter Efficient Tuning Large Vision Language Models

Arxiv

0+阅读 · 2023年5月30日

Prompt-based Tuning of Transformer Models for Multi-Center Medical Image Segmentation

Arxiv

1+阅读 · 2023年5月30日

Exploring Effectiveness of GPT-3 in Grammatical Error Correction: A Study on Performance and Controllability in Prompt-Based Methods

Arxiv

0+阅读 · 2023年5月29日

On Consistent Bayesian Inference from Synthetic Data

Arxiv

0+阅读 · 2023年5月26日

Parameter-Efficient Fine-Tuning without Introducing New Latency

Arxiv

0+阅读 · 2023年5月26日

Diffusion-Based Adversarial Sample Generation for Improved Stealthiness and Controllability

Arxiv

0+阅读 · 2023年5月25日

Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers

Arxiv

12+阅读 · 2020年6月23日

VIP会员

文章信息

相关主题

相关VIP内容

【CVPR2023】基于图像特定提示学习的零样本生成模型自适应

【CVPR2023】基于图像特定提示学习的零样本生成模型自适应

专知会员服务

31+阅读 · 2023年4月7日

【CVPR2022】基于知识蒸馏的高效预训练

【CVPR2022】基于知识蒸馏的高效预训练

专知会员服务

32+阅读 · 2022年4月23日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【清华大学】Delta调优:预训练语言模型参数有效方法的综合研究，Delta Tuning: A Comprehensive Study of Parameter Efficient Methods for Pre-trained Language Models

【清华大学】Delta调优:预训练语言模型参数有效方法的综合研究，Delta Tuning: A Comprehensive Study of Parameter Efficient Methods for Pre-trained Language Models

专知会员服务

26+阅读 · 2022年3月15日

【CVPR 2022】视觉提示调整（VPT），Vision Prompt Tuning

【CVPR 2022】视觉提示调整（VPT），Vision Prompt Tuning

专知会员服务

32+阅读 · 2022年3月12日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

【微软】大型神经语言模型的对抗性训练，Adversarial Training for Large Neural Language Models

【微软】大型神经语言模型的对抗性训练，Adversarial Training for Large Neural Language Models

专知会员服务

51+阅读 · 2020年5月3日

【斯坦福】探究预训练语言模型中的可迁移性，Investigating Transferability in PLM

【斯坦福】探究预训练语言模型中的可迁移性，Investigating Transferability in PLM

专知会员服务

20+阅读 · 2020年5月3日

【NLP模型压缩方法综述】《A Survey of Methods for Model Compression in NLP》by Madison May

【NLP模型压缩方法综述】《A Survey of Methods for Model Compression in NLP》by Madison May

专知会员服务

43+阅读 · 2020年4月22日

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

专知会员服务

159+阅读 · 2020年2月29日

热门VIP内容

开通专知VIP会员享更多权益服务

《基于大型语言模型的软件工程自动化研究》最新264页

《基于大型语言模型的信号处理管线研究：推进军事电子情报工作流程》最新76页

中文版 | 战争算法：生成式人工智能在战场的崛起

中文版《美国陆军：战术行为性远程医疗实施观察与建议》

相关资讯

NeurlPS 2022 | 全新大模型参数高效微调方法：仅需训练0.3M的参数

NeurlPS 2022 | 全新大模型参数高效微调方法：仅需训练0.3M的参数

PaperWeekly

0+阅读 · 2022年11月9日

全新大模型参数高效微调方法SSF：仅需训练0.3M的参数，效果卓越（NeurlPS 22 ）

全新大模型参数高效微调方法SSF：仅需训练0.3M的参数，效果卓越（NeurlPS 22 ）

极市平台

0+阅读 · 2022年11月7日

NeurlPS 2022 | 全新大模型参数高效微调方法SSF：仅需训练0.3M的参数，效果卓越

NeurlPS 2022 | 全新大模型参数高效微调方法SSF：仅需训练0.3M的参数，效果卓越

机器之心

0+阅读 · 2022年11月7日

Ladder Side-Tuning：预训练模型的“过墙梯”

Ladder Side-Tuning：预训练模型的“过墙梯”

PaperWeekly

0+阅读 · 2022年6月24日

已删除

将门创投

14+阅读 · 2019年5月29日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

相关论文

Consistency Models

Arxiv

0+阅读 · 2023年5月31日

Efficient Training of Energy-Based Models Using Jarzynski Equality

Arxiv

0+阅读 · 2023年5月30日

Jointly Reparametrized Multi-Layer Adaptation for Efficient and Private Tuning

Arxiv

0+阅读 · 2023年5月30日

ConES: Concept Embedding Search for Parameter Efficient Tuning Large Vision Language Models

Arxiv

0+阅读 · 2023年5月30日

Prompt-based Tuning of Transformer Models for Multi-Center Medical Image Segmentation

Arxiv

1+阅读 · 2023年5月30日

Exploring Effectiveness of GPT-3 in Grammatical Error Correction: A Study on Performance and Controllability in Prompt-Based Methods

Arxiv

0+阅读 · 2023年5月29日

On Consistent Bayesian Inference from Synthetic Data

Arxiv

0+阅读 · 2023年5月26日

Parameter-Efficient Fine-Tuning without Introducing New Latency

Arxiv

0+阅读 · 2023年5月26日

Diffusion-Based Adversarial Sample Generation for Improved Stealthiness and Controllability

Arxiv

0+阅读 · 2023年5月25日

Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers

Arxiv

12+阅读 · 2020年6月23日

相关基金

使用GPU加速银道面尘埃辐射图像的高分辨率模拟与多参数反演

国家自然科学基金

0+阅读 · 2015年12月31日

考虑非线性孔隙流的近海结构物海床地基内波致渐进液化研究

国家自然科学基金

0+阅读 · 2014年12月31日

垂直磁各向异性半金属铁磁电极到半导体自旋注入的研究

国家自然科学基金

0+阅读 · 2014年12月31日

无人机载和地面车载间多平台遥感影像的自动配准方法研究

国家自然科学基金

1+阅读 · 2013年12月31日

水声传感器网络的高效时间同步与定位

国家自然科学基金

0+阅读 · 2013年12月31日

民航视景仿真的照片级逼真度渲染关键技术研究

国家自然科学基金

0+阅读 · 2013年12月31日

金属纳米腔中的受激辐射表面等离激元放大

国家自然科学基金

0+阅读 · 2013年12月31日

基于二维侧扫声纳图像的三维高分辨率海床地形恢复方法研究

国家自然科学基金

1+阅读 · 2012年12月31日

地震波传播与成像保持效率的高精度算法的研究

国家自然科学基金

0+阅读 · 2012年12月31日

极化合成孔径雷达(SAR)图像地物并行分割分类研究与应用

国家自然科学基金

1+阅读 · 2012年12月31日

微信扫码咨询专知VIP会员