计量微调的不稳定性 (Measuring the Instability of Fine-Tuning) - 专知论文

会员服务 ·

0

INFORMS · Performer · 语言模型化 · Better · 标准差 ·

2023 年 2 月 15 日

Measuring the Instability of Fine-Tuning

翻译：计量微调的不稳定性

Yupei Du,Dong Nguyen

from arxiv, 20 pages, 26 Figures

Fine-tuning pre-trained language models on downstream tasks with varying random seeds has been shown to be unstable, especially on small datasets. Many previous studies have investigated this instability and proposed methods to mitigate it. However, most studies only used the standard deviation of performance scores (SD) as their measure, which is a narrow characterization of instability. In this paper, we analyze SD and six other measures quantifying instability at different levels of granularity. Moreover, we propose a systematic framework to evaluate the validity of these measures. Finally, we analyze the consistency and difference between different measures by reassessing existing instability mitigation methods. We hope our results will inform the development of better measurements of fine-tuning instability.

翻译：关于使用各种随机种子的下游任务、特别是小型数据集的经培训的预先精密语言模型的微调,已证明不稳定,特别是小型数据集的不稳定性。以前的许多研究调查了这种不稳定性,并提出了减轻这种不稳定性的方法。然而,大多数研究只使用业绩分数的标准差作为衡量不稳定性的尺度,这是对不稳定性的狭义描述。在本文件中,我们分析可持续发展和其他六项量化不同微粒级不稳定性的措施。此外,我们提议了一个系统框架来评价这些措施的有效性。最后,我们通过重新评估现有的减少不稳定性的方法来分析不同措施的一致性和差异。我们希望我们的结果能为更好地衡量微调不稳定性提供参考。

0

相关内容

INFORMS

《计算机信息》杂志发表高质量的论文，扩大了运筹学和计算的范围，寻求有关理论、方法、实验、系统和应用方面的原创研究论文、新颖的调查和教程论文，以及描述新的和有用的软件工具的论文。官网链接：https://pubsonline.informs.org/journal/ijoc

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

全球首个GNN为主的AI创业公司，募资$18.5 million！

全球首个GNN为主的AI创业公司，募资$18.5 million！

图与推荐

1+阅读 · 2022年4月16日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

罗巴代数的表示和罗巴代数在operad中的应用

国家自然科学基金

0+阅读 · 2015年12月31日

CtBP2与SOX-2相互作用参与食管鳞状细胞癌发病机制的研究

国家自然科学基金

0+阅读 · 2015年12月31日

lncRNA在左归丸、右归丸诱导BMSCs软骨分化中的表观遗传学机制

国家自然科学基金

0+阅读 · 2014年12月31日

基于目标域分层的不确定高维多目标优化及其应用研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于Metasurface的THz慢波器件研究

国家自然科学基金

0+阅读 · 2013年12月31日

柽柳Dof转录因子的耐盐调控机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

Ryanodine受体介导钙信号调控成髓鞘细胞分化发育的作用机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

组蛋白甲基化酶复合物COMPASS催化的H3K4me2,H3K4me3对果蝇发育调控的研究

国家自然科学基金

0+阅读 · 2012年12月31日

SDF-1/CXCR4在急性髓系白血病骨髓间充质干细胞胞内转运机制

国家自然科学基金

0+阅读 · 2012年12月31日

煤田深层地下水中稀土元素水文地球化学行为及演化规律研究

国家自然科学基金

0+阅读 · 2008年12月31日

VideoXum: Cross-modal Visual and Textural Summarization of Videos

Arxiv

0+阅读 · 2023年4月6日

UniASM: Binary Code Similarity Detection without Fine-tuning

Arxiv

0+阅读 · 2023年4月6日

Revisiting the Evaluation of Image Synthesis with GANs

Arxiv

0+阅读 · 2023年4月4日

LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models

Arxiv

0+阅读 · 2023年4月4日

Strong Baselines for Parameter Efficient Few-Shot Fine-tuning

Arxiv

0+阅读 · 2023年4月4日

Exploring the Use of Large Language Models for Reference-Free Text Quality Evaluation: A Preliminary Empirical Study

Arxiv

0+阅读 · 2023年4月3日

Towards Understanding the Mechanism of Contrastive Learning via Similarity Structure: A Theoretical Analysis

Arxiv

0+阅读 · 2023年4月1日

Learning Neural Models for Natural Language Processing in the Face of Distributional Shift

Arxiv

11+阅读 · 2021年9月3日

Pre-Trained Models: Past, Present and Future

Arxiv

19+阅读 · 2021年6月15日

The Causal Learning of Retail Delinquency

Arxiv

14+阅读 · 2020年12月17日

VIP会员

文章信息

相关主题

语言模型化

相关VIP内容

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《生成式人工智能与大/小语言模型在供应链管理决策优化与可持续性提升中的作用评估》最新51页

白宫发布《赢得AI竞赛：美国人工智能行动计划》最新28页

地下战：地下空间的战略博弈

《美地下作战条令手册》228页

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

全球首个GNN为主的AI创业公司，募资$18.5 million！

全球首个GNN为主的AI创业公司，募资$18.5 million！

图与推荐

1+阅读 · 2022年4月16日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

相关论文

VideoXum: Cross-modal Visual and Textural Summarization of Videos

Arxiv

0+阅读 · 2023年4月6日

UniASM: Binary Code Similarity Detection without Fine-tuning

Arxiv

0+阅读 · 2023年4月6日

Revisiting the Evaluation of Image Synthesis with GANs

Arxiv

0+阅读 · 2023年4月4日

LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models

Arxiv

0+阅读 · 2023年4月4日

Strong Baselines for Parameter Efficient Few-Shot Fine-tuning

Arxiv

0+阅读 · 2023年4月4日

Exploring the Use of Large Language Models for Reference-Free Text Quality Evaluation: A Preliminary Empirical Study

Arxiv

0+阅读 · 2023年4月3日

Towards Understanding the Mechanism of Contrastive Learning via Similarity Structure: A Theoretical Analysis

Arxiv

0+阅读 · 2023年4月1日

Learning Neural Models for Natural Language Processing in the Face of Distributional Shift

Arxiv

11+阅读 · 2021年9月3日

Pre-Trained Models: Past, Present and Future

Arxiv

19+阅读 · 2021年6月15日

The Causal Learning of Retail Delinquency

Arxiv

14+阅读 · 2020年12月17日

相关基金

罗巴代数的表示和罗巴代数在operad中的应用

国家自然科学基金

0+阅读 · 2015年12月31日

CtBP2与SOX-2相互作用参与食管鳞状细胞癌发病机制的研究

国家自然科学基金

0+阅读 · 2015年12月31日

lncRNA在左归丸、右归丸诱导BMSCs软骨分化中的表观遗传学机制

国家自然科学基金

0+阅读 · 2014年12月31日

基于目标域分层的不确定高维多目标优化及其应用研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于Metasurface的THz慢波器件研究

国家自然科学基金

0+阅读 · 2013年12月31日

柽柳Dof转录因子的耐盐调控机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

Ryanodine受体介导钙信号调控成髓鞘细胞分化发育的作用机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

组蛋白甲基化酶复合物COMPASS催化的H3K4me2,H3K4me3对果蝇发育调控的研究

国家自然科学基金

0+阅读 · 2012年12月31日

SDF-1/CXCR4在急性髓系白血病骨髓间充质干细胞胞内转运机制

国家自然科学基金

0+阅读 · 2012年12月31日

煤田深层地下水中稀土元素水文地球化学行为及演化规律研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员