在不增加计算时间的情况下增加推推推力而增加计算时间的模型汤 (Model soups to increase inference without increasing compute time) - 专知论文

会员服务 ·

0

SOUPS · MoDELS · 贪心 · Performer · EfficientNet ·

2023 年 1 月 24 日

Model soups to increase inference without increasing compute time

翻译：在不增加计算时间的情况下增加推推推力而增加计算时间的模型汤

Charles Dansereau,Milo Sobral,Maninder Bhogal,Mehdi Zalai

In this paper, we compare Model Soups performances on three different models (ResNet, ViT and EfficientNet) using three Soup Recipes (Greedy Soup Sorted, Greedy Soup Random and Uniform soup) from arXiv:2203.05482, and reproduce the results of the authors. We then introduce a new Soup Recipe called Pruned Soup. Results from the soups were better than the best individual model for the pre-trained vision transformer, but were much worst for the ResNet and the EfficientNet. Our pruned soup performed better than the uniform and greedy soups presented in the original paper. We also discuss the limitations of weight-averaging that were found during the experiments. The code for our model soup library and the experiments with different models can be found here: https://github.com/milo-sobral/ModelSoup

翻译：在本文中,我们比较了三种不同模型(ResNet、ViT、高效网络)的“模范苏普”表演,使用ArXiv:2203.05482的三种苏普(Greedy Soup Suproands and Aliver solution)的三种苏普(Greed Suple)食谱(Greedy Soup Suple)和ArXiv:2203.05482,并转载了作者的研究结果。然后我们推出了一个新的苏普鲁内德苏普(Pruned Soup)的新苏普(Sup)食谱。汤的效果比预先训练的视觉变压器的最佳个人模型要好,但对ResNet和高效网络来说却差得多。我们的纯汤比原始文件中的制服和贪婪汤做的要好。我们还讨论了实验中发现的体重保持的局限性。这里可以找到我们的示范汤库的代码和不同模型的实验: https://github.com/milo-sobral/ModelSoup)。

0

相关内容

SOUPS

SOUPS：Symposium On Usable Privacy and Security。 Explanation：可用隐私和安全专题讨论会。 Publisher：USENIX。 SIT： http://dblp.uni-trier.de/db/conf/soups/

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

321+阅读 · 2020年11月26日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

2019年自然语言处理NLP亮点总结，29页pdf，NLP Year in Review — 2019 NLP highlights for the year 2019.

2019年自然语言处理NLP亮点总结，29页pdf，NLP Year in Review — 2019 NLP highlights for the year 2019.

专知会员服务

69+阅读 · 2020年1月2日

【Google可解释人工智能白皮书】27页pdf，AI Explainability Whitepaper ，Introduction to AI Explanations for AI Platform

【Google可解释人工智能白皮书】27页pdf，AI Explainability Whitepaper ，Introduction to AI Explanations for AI Platform

专知会员服务

127+阅读 · 2019年12月13日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

AINLP

40+阅读 · 2019年6月9日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】图像分类必读开创性论文汇总

【推荐】图像分类必读开创性论文汇总

机器学习研究会

14+阅读 · 2017年8月15日

近临界随机环境中随机游动的若干极限性质

国家自然科学基金

0+阅读 · 2015年12月31日

转录因子TEAD4在三阴性乳腺癌中的功能和机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

Notch信号通路在BMP9诱导间充质干细胞成骨分化中的作用研究

国家自然科学基金

0+阅读 · 2013年12月31日

组蛋白去乙酰化酶介导下的miRNA在JAK2V617F突变阴性的骨髓增殖性疾病中的功能研究

国家自然科学基金

0+阅读 · 2012年12月31日

GLP-1/beta-catenin/TCF信号通路对糖尿病鼠心肌细胞凋亡的保护作用及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

LncRNAs在非小细胞肺癌EGFR-TKIs耐药中的作用及分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

益气活血解毒法调控类风湿性关节炎Ths细胞免疫应答网络系统的作用机制

国家自然科学基金

0+阅读 · 2011年12月31日

癌细胞分泌exosome改变CTL细胞功能导致鼻咽癌免疫逃逸的机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

骨髓MSCs抑制B细胞功能及其治疗MRL/lpr狼疮鼠的机制

国家自然科学基金

0+阅读 · 2009年12月31日

SDF-1/CXCR4信号通路的干预及调节关节软骨退变的研究

国家自然科学基金

0+阅读 · 2008年12月31日

Causal Rule Ensemble: Interpretable Discovery and Inference of Heterogeneous Treatment Effects

Arxiv

0+阅读 · 2023年3月14日

Drivers of the decrease of patent similarities from 1976 to 2021

Arxiv

0+阅读 · 2023年3月14日

Exploring ChatGPT's Ability to Rank Content: A Preliminary Study on Consistency with Human Preferences

Arxiv

1+阅读 · 2023年3月14日

Input-length-shortening and text generation via attention values

Arxiv

0+阅读 · 2023年3月14日

WDiscOOD: Out-of-Distribution Detection via Whitened Linear Discriminative Analysis

Arxiv

0+阅读 · 2023年3月14日

Go Wide, Then Narrow: Efficient Training of Deep Thin Networks

Arxiv

15+阅读 · 2020年7月1日

A Modern Introduction to Online Learning

A Modern Introduction to Online Learning

Arxiv

21+阅读 · 2019年12月31日

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

Arxiv

11+阅读 · 2019年10月30日

Extreme Language Model Compression with Optimal Subwords and Shared Projections

Extreme Language Model Compression with Optimal Subwords and Shared Projections

Arxiv

18+阅读 · 2019年9月25日

Attention Is All You Need

Arxiv

27+阅读 · 2017年12月6日

VIP会员

文章信息

相关主题

相关VIP内容

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

321+阅读 · 2020年11月26日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

2019年自然语言处理NLP亮点总结，29页pdf，NLP Year in Review — 2019 NLP highlights for the year 2019.

2019年自然语言处理NLP亮点总结，29页pdf，NLP Year in Review — 2019 NLP highlights for the year 2019.

专知会员服务

69+阅读 · 2020年1月2日

【Google可解释人工智能白皮书】27页pdf，AI Explainability Whitepaper ，Introduction to AI Explanations for AI Platform

【Google可解释人工智能白皮书】27页pdf，AI Explainability Whitepaper ，Introduction to AI Explanations for AI Platform

专知会员服务

127+阅读 · 2019年12月13日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《使用量化测量将传感器节点关联到融合中心的算法设计》171页

军事前沿模型

提升军事训练能力的最佳人工智能模拟工具

《社交媒体信息作战》最新48页技术报告

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

AINLP

40+阅读 · 2019年6月9日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】图像分类必读开创性论文汇总

【推荐】图像分类必读开创性论文汇总

机器学习研究会

14+阅读 · 2017年8月15日

相关论文

Causal Rule Ensemble: Interpretable Discovery and Inference of Heterogeneous Treatment Effects

Arxiv

0+阅读 · 2023年3月14日

Drivers of the decrease of patent similarities from 1976 to 2021

Arxiv

0+阅读 · 2023年3月14日

Exploring ChatGPT's Ability to Rank Content: A Preliminary Study on Consistency with Human Preferences

Arxiv

1+阅读 · 2023年3月14日

Input-length-shortening and text generation via attention values

Arxiv

0+阅读 · 2023年3月14日

WDiscOOD: Out-of-Distribution Detection via Whitened Linear Discriminative Analysis

Arxiv

0+阅读 · 2023年3月14日

Go Wide, Then Narrow: Efficient Training of Deep Thin Networks

Arxiv

15+阅读 · 2020年7月1日

A Modern Introduction to Online Learning

A Modern Introduction to Online Learning

Arxiv

21+阅读 · 2019年12月31日

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

Arxiv

11+阅读 · 2019年10月30日

Extreme Language Model Compression with Optimal Subwords and Shared Projections

Extreme Language Model Compression with Optimal Subwords and Shared Projections

Arxiv

18+阅读 · 2019年9月25日

Attention Is All You Need

Arxiv

27+阅读 · 2017年12月6日

相关基金

近临界随机环境中随机游动的若干极限性质

国家自然科学基金

0+阅读 · 2015年12月31日

转录因子TEAD4在三阴性乳腺癌中的功能和机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

Notch信号通路在BMP9诱导间充质干细胞成骨分化中的作用研究

国家自然科学基金

0+阅读 · 2013年12月31日

组蛋白去乙酰化酶介导下的miRNA在JAK2V617F突变阴性的骨髓增殖性疾病中的功能研究

国家自然科学基金

0+阅读 · 2012年12月31日

GLP-1/beta-catenin/TCF信号通路对糖尿病鼠心肌细胞凋亡的保护作用及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

LncRNAs在非小细胞肺癌EGFR-TKIs耐药中的作用及分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

益气活血解毒法调控类风湿性关节炎Ths细胞免疫应答网络系统的作用机制

国家自然科学基金

0+阅读 · 2011年12月31日

癌细胞分泌exosome改变CTL细胞功能导致鼻咽癌免疫逃逸的机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

骨髓MSCs抑制B细胞功能及其治疗MRL/lpr狼疮鼠的机制

国家自然科学基金

0+阅读 · 2009年12月31日

SDF-1/CXCR4信号通路的干预及调节关节软骨退变的研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员