FacT: 视觉变形器轻量度适应的因数调整 (FacT: Factor-Tuning for Lightweight Adaptation on Vision Transformer) - 专知论文

会员服务 ·

0

Performer · 变换 · Vision · Storage · Weight ·

2022 年 12 月 6 日

FacT: Factor-Tuning for Lightweight Adaptation on Vision Transformer

翻译：FacT: 视觉变形器轻量度适应的因数调整

Shibo Jie,Zhi-Hong Deng

from arxiv, Accepted at AAAI 2023. Code: https://github.com/JieShibo/PETL-ViT

Recent work has explored the potential to adapt a pre-trained vision transformer (ViT) by updating only a few parameters so as to improve storage efficiency, called parameter-efficient transfer learning (PETL). Current PETL methods have shown that by tuning only 0.5% of the parameters, ViT can be adapted to downstream tasks with even better performance than full fine-tuning. In this paper, we aim to further promote the efficiency of PETL to meet the extreme storage constraint in real-world applications. To this end, we propose a tensorization-decomposition framework to store the weight increments, in which the weights of each ViT are tensorized into a single 3D tensor, and their increments are then decomposed into lightweight factors. In the fine-tuning process, only the factors need to be updated and stored, termed Factor-Tuning (FacT). On VTAB-1K benchmark, our method performs on par with NOAH, the state-of-the-art PETL method, while being 5x more parameter-efficient. We also present a tiny version that only uses 8K (0.01% of ViT's parameters) trainable parameters but outperforms full fine-tuning and many other PETL methods such as VPT and BitFit. In few-shot settings, FacT also beats all PETL baselines using the fewest parameters, demonstrating its strong capability in the low-data regime.

翻译：最近的工作探索了调整预先培训的视觉变压器(VIT)的潜力,仅更新了几个参数,以提高储存效率,即所谓的参数效率转移学习(PETL)。目前的PETL方法显示,通过调试参数的0.5%,VIT可以适应下游任务,其性能甚至比完全微调更好。在本文件中,我们的目标是进一步提高PETL的效率,以满足现实应用中的极端储存限制。为此,我们提议了一个推力分分法框架,以存储增量重量,其中每个VIT的重量被拉成一个单三维拉多,其增量被分解成轻量因素。在微调过程中,只需要更新和储存一些因素,称为质调(FacT)。在VTAB-1K基准中,我们的方法与NOAH(最先进的低级PETL方法)比重,同时具有5x的参数效率。我们还展示了一个微版版本,其中仅使用8K(0.0%)的精度调标准,但比T的精度(ViT)其他精度(FIL)的精度标准。

5

相关内容

Performer

《校准自主性中的信任》2022最新16页slides

《校准自主性中的信任》2022最新16页slides

专知会员服务

20+阅读 · 2022年12月7日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

人大最新《基于Transformer 的视频语言预训练》综述论文

人大最新《基于Transformer 的视频语言预训练》综述论文

专知会员服务

47+阅读 · 2021年9月27日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

HOXB7基因促胃癌转移机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

Schrodinger-Poisson方程的若干问题研究

国家自然科学基金

1+阅读 · 2012年12月31日

啮虫目昆虫线粒体基因组结构与进化及其系统发育研究

国家自然科学基金

0+阅读 · 2012年12月31日

长链非编码RNA HOTAIR参与调控t(8;21)+白血病细胞的分化及其机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

高静温预混超声速气流斜激波诱导脱体爆震波不稳定机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

粘弹性湍流减阻流动的POD低阶模型研究

国家自然科学基金

0+阅读 · 2012年12月31日

盐酸小檗碱调节MTP启动子甲基化与肝脏脂肪含量的关系

国家自然科学基金

0+阅读 · 2009年12月31日

压电陶瓷驱动金属橡胶纹波超高压精密流量阀的研究

国家自然科学基金

0+阅读 · 2009年12月31日

ClC-3氯通道蛋白在肿瘤转移中的功能研究

国家自然科学基金

0+阅读 · 2008年12月31日

Revisiting Pre-training in Audio-Visual Learning

Arxiv

0+阅读 · 2023年2月7日

Contrastive Learning for Unsupervised Domain Adaptation of Time Series

Arxiv

0+阅读 · 2023年2月7日

Improving Transfer Learning with a Dual Image and Video Transformer for Multi-label Movie Trailer Genre Classification

Arxiv

0+阅读 · 2023年2月7日

AIM: Adapting Image Models for Efficient Video Action Recognition

Arxiv

1+阅读 · 2023年2月6日

Adaptive Parameterization of Deep Learning Models for Federated Learning

Adaptive Parameterization of Deep Learning Models for Federated Learning

Arxiv

0+阅读 · 2023年2月6日

Efficient Distributed Vision Transformer Foundation Model for Medical Imaging through Random Masked Sampling

Arxiv

0+阅读 · 2023年2月5日

Efficient Domain Adaptation for Speech Foundation Models

Arxiv

0+阅读 · 2023年2月3日

FiT: Parameter Efficient Few-shot Transfer Learning for Personalized and Federated Image Classification

Arxiv

0+阅读 · 2023年2月2日

Adaptive Transfer Learning on Graph Neural Networks

Arxiv

14+阅读 · 2021年7月20日

Unsupervised Multi-Source Domain Adaptation for Person Re-Identification

Arxiv

14+阅读 · 2021年4月27日

VIP会员

文章信息

相关主题

相关VIP内容

《校准自主性中的信任》2022最新16页slides

《校准自主性中的信任》2022最新16页slides

专知会员服务

20+阅读 · 2022年12月7日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

人大最新《基于Transformer 的视频语言预训练》综述论文

人大最新《基于Transformer 的视频语言预训练》综述论文

专知会员服务

47+阅读 · 2021年9月27日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【伯克利博士论文】通过真实世界实践赋能机器人自主性

军用无人机集群技术尚未成熟——但潜力可期

人工智能安全治理白皮书（2025）

AgentOps综述：分类、挑战与未来方向

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

Revisiting Pre-training in Audio-Visual Learning

Arxiv

0+阅读 · 2023年2月7日

Contrastive Learning for Unsupervised Domain Adaptation of Time Series

Arxiv

0+阅读 · 2023年2月7日

Improving Transfer Learning with a Dual Image and Video Transformer for Multi-label Movie Trailer Genre Classification

Arxiv

0+阅读 · 2023年2月7日

AIM: Adapting Image Models for Efficient Video Action Recognition

Arxiv

1+阅读 · 2023年2月6日

Adaptive Parameterization of Deep Learning Models for Federated Learning

Adaptive Parameterization of Deep Learning Models for Federated Learning

Arxiv

0+阅读 · 2023年2月6日

Efficient Distributed Vision Transformer Foundation Model for Medical Imaging through Random Masked Sampling

Arxiv

0+阅读 · 2023年2月5日

Efficient Domain Adaptation for Speech Foundation Models

Arxiv

0+阅读 · 2023年2月3日

FiT: Parameter Efficient Few-shot Transfer Learning for Personalized and Federated Image Classification

Arxiv

0+阅读 · 2023年2月2日

Adaptive Transfer Learning on Graph Neural Networks

Arxiv

14+阅读 · 2021年7月20日

Unsupervised Multi-Source Domain Adaptation for Person Re-Identification

Arxiv

14+阅读 · 2021年4月27日

相关基金

HOXB7基因促胃癌转移机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

Schrodinger-Poisson方程的若干问题研究

国家自然科学基金

1+阅读 · 2012年12月31日

啮虫目昆虫线粒体基因组结构与进化及其系统发育研究

国家自然科学基金

0+阅读 · 2012年12月31日

长链非编码RNA HOTAIR参与调控t(8;21)+白血病细胞的分化及其机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

高静温预混超声速气流斜激波诱导脱体爆震波不稳定机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

粘弹性湍流减阻流动的POD低阶模型研究

国家自然科学基金

0+阅读 · 2012年12月31日

盐酸小檗碱调节MTP启动子甲基化与肝脏脂肪含量的关系

国家自然科学基金

0+阅读 · 2009年12月31日

压电陶瓷驱动金属橡胶纹波超高压精密流量阀的研究

国家自然科学基金

0+阅读 · 2009年12月31日

ClC-3氯通道蛋白在肿瘤转移中的功能研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员