DiffFit：通过简单和参数高效的微调解锁大型扩散模型的可转移性 (DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-Efficient Fine-Tuning) - 专知论文

会员服务 ·

0

微调 · 参数高效 · 扩散模型 · 预训练 · 因子 ·

2023 年 4 月 20 日

DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-Efficient Fine-Tuning

翻译：DiffFit：通过简单和参数高效的微调解锁大型扩散模型的可转移性

Enze Xie,Lewei Yao,Han Shi,Zhili Liu,Daquan Zhou,Zhaoqiang Liu,Jiawei Li,Zhenguo Li

from arxiv, Tech Report

Diffusion models have proven to be highly effective in generating high-quality images. However, adapting large pre-trained diffusion models to new domains remains an open challenge, which is critical for real-world applications. This paper proposes DiffFit, a parameter-efficient strategy to fine-tune large pre-trained diffusion models that enable fast adaptation to new domains. DiffFit is embarrassingly simple that only fine-tunes the bias term and newly-added scaling factors in specific layers, yet resulting in significant training speed-up and reduced model storage costs. Compared with full fine-tuning, DiffFit achieves 2$\times$ training speed-up and only needs to store approximately 0.12\% of the total model parameters. Intuitive theoretical analysis has been provided to justify the efficacy of scaling factors on fast adaptation. On 8 downstream datasets, DiffFit achieves superior or competitive performances compared to the full fine-tuning while being more efficient. Remarkably, we show that DiffFit can adapt a pre-trained low-resolution generative model to a high-resolution one by adding minimal cost. Among diffusion-based methods, DiffFit sets a new state-of-the-art FID of 3.02 on ImageNet 512$\times$512 benchmark by fine-tuning only 25 epochs from a public pre-trained ImageNet 256$\times$256 checkpoint while being 30$\times$ more training efficient than the closest competitor.

翻译：扩散模型已被证明能够生成高质量的图像。然而，将大型预训练的扩散模型适应到新领域仍然是个未解决的难题，这对于实际应用非常关键。本文提出DiffFit，一种参数高效的策略，可以微调大型预训练的扩散模型，从而实现快速适应新领域。DiffFit 的实现非常简单，只需微调特定层中的偏置项和新添加的缩放因子，却能大大提高训练速度和减少模型存储成本。与完整的微调相比，DiffFit 能够实现2倍的训练加速，并且只需要存储大约0.12％的总模型参数。我们提供了直观的理论分析，以证明缩放因子在快速适应中的有效性。在8个下游数据集上，DiffFit 在效率更高的情况下获得了优越或具有竞争力的性能。值得注意的是，我们展示了DiffFit可以通过最小成本的添加，将预训练的低分辨率生成模型调整为高分辨率模型。在基于扩散的方法中，DiffFit借助只在公共预训练的 ImageNet 256×256 检查点上进行25轮微调就能获得3.02 的 ImageNet 512×512 基准 FID 值，而且比最接近的竞争者高出30倍的训练效率。

0

相关内容

百篇论文纵览大型语言模型最新研究进展

百篇论文纵览大型语言模型最新研究进展

专知会员服务

70+阅读 · 2023年3月31日

【NeurIPS2022】不用微调的加速大规模视觉Transformer的密集预测

【NeurIPS2022】不用微调的加速大规模视觉Transformer的密集预测

专知会员服务

14+阅读 · 2022年10月5日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

清华49页长文全方位分析参数高效微调方案Delta Tuning，揭秘大模型背后的机理

清华49页长文全方位分析参数高效微调方案Delta Tuning，揭秘大模型背后的机理

专知会员服务

49+阅读 · 2022年4月8日

【CVPR 2022】视觉提示调整（VPT），Vision Prompt Tuning

【CVPR 2022】视觉提示调整（VPT），Vision Prompt Tuning

专知会员服务

32+阅读 · 2022年3月12日

【ICML2020】统一预训练伪掩码语言模型

【ICML2020】统一预训练伪掩码语言模型

专知会员服务

27+阅读 · 2020年7月23日

【微软】大型神经语言模型的对抗性训练，Adversarial Training for Large Neural Language Models

【微软】大型神经语言模型的对抗性训练，Adversarial Training for Large Neural Language Models

专知会员服务

51+阅读 · 2020年5月3日

【ICML2020投稿论文】用于半监督图像分类的CowMask，Milking CowMask for Semi-Supervised Image Classification

【ICML2020投稿论文】用于半监督图像分类的CowMask，Milking CowMask for Semi-Supervised Image Classification

专知会员服务

29+阅读 · 2020年3月27日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

GNN 新基准！Long Range Graph Benchmark

GNN 新基准！Long Range Graph Benchmark

图与推荐

0+阅读 · 2022年10月18日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新七篇知识图谱相关论文—嵌入式知识、Zero-shot识别、知识图谱嵌入、网络库、变分推理、解释、弱监督

【论文推荐】最新七篇知识图谱相关论文—嵌入式知识、Zero-shot识别、知识图谱嵌入、网络库、变分推理、解释、弱监督

专知

19+阅读 · 2018年3月26日

【论文推荐】最新5篇图像分割相关论文—条件随机场和深度特征学习、移动端网络、长期视觉定位、主动学习、主动轮廓模型、生成对抗性网络

【论文推荐】最新5篇图像分割相关论文—条件随机场和深度特征学习、移动端网络、长期视觉定位、主动学习、主动轮廓模型、生成对抗性网络

专知

13+阅读 · 2018年1月23日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

基于深度学习的复杂退化模糊图像恢复

国家自然科学基金

5+阅读 · 2015年12月31日

激光等离子体诊断用串列式十六通道X射线KB系统研究

国家自然科学基金

0+阅读 · 2013年12月31日

应用于太阳能热光伏系统的新型平面吸波-辐射元件

国家自然科学基金

0+阅读 · 2013年12月31日

经济周期、融资约束与营运资本的动态协同选择

国家自然科学基金

0+阅读 · 2013年12月31日

基于数据挖掘的大型燃煤发电机组节能诊断理论与方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

Levy过程驱动的随机Fast-Diffusion方程的Harnack不等式及其应用

国家自然科学基金

0+阅读 · 2011年12月31日

混合动态网络的增益稳定性与混合分布控制研究

国家自然科学基金

0+阅读 · 2011年12月31日

Navier-Stokes方程稳定化有限元方法后验误差估计

国家自然科学基金

0+阅读 · 2011年12月31日

管理干预与企业内部决策权有效配置—#8212;基于行为决策视角的研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于系统动力学的闭环供应链库存管理研究

国家自然科学基金

0+阅读 · 2008年12月31日

AutoPEFT: Automatic Configuration Search for Parameter-Efficient Fine-Tuning

Arxiv

0+阅读 · 2023年6月6日

Explanation-based Finetuning Makes Models More Robust to Spurious Cues

Explanation-based Finetuning Makes Models More Robust to Spurious Cues

Arxiv

0+阅读 · 2023年6月6日

A Data-Efficient Approach for Long-Term Human Motion Prediction Using Maps of Dynamics

Arxiv

0+阅读 · 2023年6月6日

A Universal Discriminator for Zero-Shot Generalization

Arxiv

0+阅读 · 2023年6月6日

MotionDiffuser: Controllable Multi-Agent Motion Prediction using Diffusion

MotionDiffuser: Controllable Multi-Agent Motion Prediction using Diffusion

Arxiv

0+阅读 · 2023年6月5日

When Federated Learning Meets Pre-trained Language Models' Parameter-Efficient Tuning Methods

Arxiv

0+阅读 · 2023年6月2日

Leveraging Auxiliary Domain Parallel Data in Intermediate Task Fine-tuning for Low-resource Translation

Arxiv

0+阅读 · 2023年6月2日

Active Code Learning: Benchmarking Sample-Efficient Training of Code Models

Arxiv

0+阅读 · 2023年6月2日

Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers

Arxiv

12+阅读 · 2020年6月23日

Meta-Transfer Learning for Zero-Shot Super-Resolution

Meta-Transfer Learning for Zero-Shot Super-Resolution

Arxiv

43+阅读 · 2020年2月27日

VIP会员

文章信息

相关主题

相关VIP内容

百篇论文纵览大型语言模型最新研究进展

百篇论文纵览大型语言模型最新研究进展

专知会员服务

70+阅读 · 2023年3月31日

【NeurIPS2022】不用微调的加速大规模视觉Transformer的密集预测

【NeurIPS2022】不用微调的加速大规模视觉Transformer的密集预测

专知会员服务

14+阅读 · 2022年10月5日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

清华49页长文全方位分析参数高效微调方案Delta Tuning，揭秘大模型背后的机理

清华49页长文全方位分析参数高效微调方案Delta Tuning，揭秘大模型背后的机理

专知会员服务

49+阅读 · 2022年4月8日

【CVPR 2022】视觉提示调整（VPT），Vision Prompt Tuning

【CVPR 2022】视觉提示调整（VPT），Vision Prompt Tuning

专知会员服务

32+阅读 · 2022年3月12日

【ICML2020】统一预训练伪掩码语言模型

【ICML2020】统一预训练伪掩码语言模型

专知会员服务

27+阅读 · 2020年7月23日

【微软】大型神经语言模型的对抗性训练，Adversarial Training for Large Neural Language Models

【微软】大型神经语言模型的对抗性训练，Adversarial Training for Large Neural Language Models

专知会员服务

51+阅读 · 2020年5月3日

【ICML2020投稿论文】用于半监督图像分类的CowMask，Milking CowMask for Semi-Supervised Image Classification

【ICML2020投稿论文】用于半监督图像分类的CowMask，Milking CowMask for Semi-Supervised Image Classification

专知会员服务

29+阅读 · 2020年3月27日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

人工智能治理的未来

模态感知的特征匹配：单一模态与跨模态技术的全面综述

无监督行人重识别研究综述

【牛津博士论文】面向神经影像应用的可扩展且可解释的空间模型

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

GNN 新基准！Long Range Graph Benchmark

GNN 新基准！Long Range Graph Benchmark

图与推荐

0+阅读 · 2022年10月18日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新七篇知识图谱相关论文—嵌入式知识、Zero-shot识别、知识图谱嵌入、网络库、变分推理、解释、弱监督

【论文推荐】最新七篇知识图谱相关论文—嵌入式知识、Zero-shot识别、知识图谱嵌入、网络库、变分推理、解释、弱监督

专知

19+阅读 · 2018年3月26日

【论文推荐】最新5篇图像分割相关论文—条件随机场和深度特征学习、移动端网络、长期视觉定位、主动学习、主动轮廓模型、生成对抗性网络

【论文推荐】最新5篇图像分割相关论文—条件随机场和深度特征学习、移动端网络、长期视觉定位、主动学习、主动轮廓模型、生成对抗性网络

专知

13+阅读 · 2018年1月23日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

相关论文

AutoPEFT: Automatic Configuration Search for Parameter-Efficient Fine-Tuning

Arxiv

0+阅读 · 2023年6月6日

Explanation-based Finetuning Makes Models More Robust to Spurious Cues

Explanation-based Finetuning Makes Models More Robust to Spurious Cues

Arxiv

0+阅读 · 2023年6月6日

A Data-Efficient Approach for Long-Term Human Motion Prediction Using Maps of Dynamics

Arxiv

0+阅读 · 2023年6月6日

A Universal Discriminator for Zero-Shot Generalization

Arxiv

0+阅读 · 2023年6月6日

MotionDiffuser: Controllable Multi-Agent Motion Prediction using Diffusion

MotionDiffuser: Controllable Multi-Agent Motion Prediction using Diffusion

Arxiv

0+阅读 · 2023年6月5日

When Federated Learning Meets Pre-trained Language Models' Parameter-Efficient Tuning Methods

Arxiv

0+阅读 · 2023年6月2日

Leveraging Auxiliary Domain Parallel Data in Intermediate Task Fine-tuning for Low-resource Translation

Arxiv

0+阅读 · 2023年6月2日

Active Code Learning: Benchmarking Sample-Efficient Training of Code Models

Arxiv

0+阅读 · 2023年6月2日

Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers

Arxiv

12+阅读 · 2020年6月23日

Meta-Transfer Learning for Zero-Shot Super-Resolution

Meta-Transfer Learning for Zero-Shot Super-Resolution

Arxiv

43+阅读 · 2020年2月27日

相关基金

基于深度学习的复杂退化模糊图像恢复

国家自然科学基金

5+阅读 · 2015年12月31日

激光等离子体诊断用串列式十六通道X射线KB系统研究

国家自然科学基金

0+阅读 · 2013年12月31日

应用于太阳能热光伏系统的新型平面吸波-辐射元件

国家自然科学基金

0+阅读 · 2013年12月31日

经济周期、融资约束与营运资本的动态协同选择

国家自然科学基金

0+阅读 · 2013年12月31日

基于数据挖掘的大型燃煤发电机组节能诊断理论与方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

Levy过程驱动的随机Fast-Diffusion方程的Harnack不等式及其应用

国家自然科学基金

0+阅读 · 2011年12月31日

混合动态网络的增益稳定性与混合分布控制研究

国家自然科学基金

0+阅读 · 2011年12月31日

Navier-Stokes方程稳定化有限元方法后验误差估计

国家自然科学基金

0+阅读 · 2011年12月31日

管理干预与企业内部决策权有效配置—#8212;基于行为决策视角的研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于系统动力学的闭环供应链库存管理研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员