对 " 拥抱面深深学习模式登记册 " 中培训前模式再利用经验研究</s> (An Empirical Study of Pre-Trained Model Reuse in the Hugging Face Deep Learning Model Registry) - 专知论文

会员服务 ·

0

PTM · MoDELS · Hugging Face · Learning · 可辨认的 ·

2023 年 3 月 5 日

An Empirical Study of Pre-Trained Model Reuse in the Hugging Face Deep Learning Model Registry

翻译：对 " 拥抱面深深学习模式登记册 " 中培训前模式再利用经验研究

Wenxin Jiang,Nicholas Synovic,Matt Hyatt,Taylor R. Schorlemmer,Rohan Sethi,Yung-Hsiang Lu,George K. Thiruvathukal,James C. Davis

from arxiv, Proceedings of the ACM/IEEE 45th International Conference on Software Engineering (ICSE) 2023

Deep Neural Networks (DNNs) are being adopted as components in software systems. Creating and specializing DNNs from scratch has grown increasingly difficult as state-of-the-art architectures grow more complex. Following the path of traditional software engineering, machine learning engineers have begun to reuse large-scale pre-trained models (PTMs) and fine-tune these models for downstream tasks. Prior works have studied reuse practices for traditional software packages to guide software engineers towards better package maintenance and dependency management. We lack a similar foundation of knowledge to guide behaviors in pre-trained model ecosystems. In this work, we present the first empirical investigation of PTM reuse. We interviewed 12 practitioners from the most popular PTM ecosystem, Hugging Face, to learn the practices and challenges of PTM reuse. From this data, we model the decision-making process for PTM reuse. Based on the identified practices, we describe useful attributes for model reuse, including provenance, reproducibility, and portability. Three challenges for PTM reuse are missing attributes, discrepancies between claimed and actual performance, and model risks. We substantiate these identified challenges with systematic measurements in the Hugging Face ecosystem. Our work informs future directions on optimizing deep learning ecosystems by automated measuring useful attributes and potential attacks, and envision future research on infrastructure and standardization for model registries.

翻译：深心网络(DNN)正在被采纳为软件系统的组成部分。从零开始创建和专门设计DNN越来越困难,因为最先进的建筑结构越来越复杂。根据传统软件工程学的路径,机器学习工程师开始重新使用大规模预培训模型,并对这些模型进行微调,用于下游任务。以前的工作研究过传统软件包的再利用做法,以指导软件工程师改进软件包的维护和依赖性管理。我们缺乏类似的知识基础来指导经过培训的模范生态系统的行为。我们在此工作中介绍了对PTM再利用的第一次实证调查。我们采访了来自最受欢迎的PTM生态系统、Huging Face的12名从业者,以学习PTM再利用的做法和挑战。我们从这些数据中为PTM再利用的决策进程建模。我们根据已查明的做法,介绍了模式再利用的有用属性,包括证明、可复制性和可移植性。PTM再利用的三种挑战都缺乏属性,声称的和实际的绩效之间的差异,以及模型风险。我们用系统测量了这些挑战,用系统测量了对Hugg 的生态系统进行标准化研究,并改进了我们未来生态系统研究的标志。</s>

0

相关内容

PTM

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】用TensorFlow实现LSTM社交对话股市情感分析

【推荐】用TensorFlow实现LSTM社交对话股市情感分析

机器学习研究会

11+阅读 · 2018年1月14日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

S100A4-miR155在肝癌组织间充质干细胞调控肝癌增殖及转移中的作用及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

白光LED用三种典型氮掺杂铝、硅酸盐基荧光粉的制备及其发光机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于两相界面势垒控制的高导电性贵金属/LaNiO3复合薄膜的制备与机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

La1-xSrxMnO3/In-MgZnO全氧化物外延异质结器件的制备与性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

Ghrelin对胰岛β细胞分泌胰岛素和增殖的影响及分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

水泥沥青砂浆的热行为及变形机理

国家自然科学基金

0+阅读 · 2009年12月31日

IFNγ22312;异基因造血干细胞移植中aGVHD致肺损伤的作用及机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

半金属磁性薄膜的重离子辐照效应研究

国家自然科学基金

0+阅读 · 2009年12月31日

微小RNA（miRNA）对实验性关节炎CD4阳性T细胞增殖和分化的调控机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

Practices and Challenges of Using GitHub Copilot: An Empirical Study

Arxiv

1+阅读 · 2023年4月25日

User-Centric Federated Learning: Trading off Wireless Resources for Personalization

Arxiv

0+阅读 · 2023年4月25日

Optimizing Deep Learning Models For Raspberry Pi

Arxiv

0+阅读 · 2023年4月25日

Enriching Source Code with Contextual Data for Code Completion Models: An Empirical Study

Arxiv

0+阅读 · 2023年4月24日

LDPTrace: Locally Differentially Private Trajectory Synthesis

Arxiv

0+阅读 · 2023年4月24日

An Empirical Study on Using Large Language Models for Multi-Intent Comment Generation

Arxiv

0+阅读 · 2023年4月22日

The Life Cycle of Knowledge in Big Language Models: A Survey

Arxiv

28+阅读 · 2023年3月14日

A Survey of Deep Causal Model

Arxiv

45+阅读 · 2022年9月19日

Pre-Trained Models: Past, Present and Future

Arxiv

19+阅读 · 2021年6月15日

Recent Advances in Deep Learning-based Dialogue Systems

Arxiv

18+阅读 · 2021年5月10日

VIP会员

文章信息

相关主题

相关VIP内容

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

俄乌战争启示：坦克战与不断演变的战斗形态

《大规模作战行动中与无人机集成的C5ISR系统》

《主观概率约束下寻找可行系统及其军事应用》69页

《美政府问责局：多种挑战影响地面战车任务出勤率》2025最新130页

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】用TensorFlow实现LSTM社交对话股市情感分析

【推荐】用TensorFlow实现LSTM社交对话股市情感分析

机器学习研究会

11+阅读 · 2018年1月14日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

相关论文

Practices and Challenges of Using GitHub Copilot: An Empirical Study

Arxiv

1+阅读 · 2023年4月25日

User-Centric Federated Learning: Trading off Wireless Resources for Personalization

Arxiv

0+阅读 · 2023年4月25日

Optimizing Deep Learning Models For Raspberry Pi

Arxiv

0+阅读 · 2023年4月25日

Enriching Source Code with Contextual Data for Code Completion Models: An Empirical Study

Arxiv

0+阅读 · 2023年4月24日

LDPTrace: Locally Differentially Private Trajectory Synthesis

Arxiv

0+阅读 · 2023年4月24日

An Empirical Study on Using Large Language Models for Multi-Intent Comment Generation

Arxiv

0+阅读 · 2023年4月22日

The Life Cycle of Knowledge in Big Language Models: A Survey

Arxiv

28+阅读 · 2023年3月14日

A Survey of Deep Causal Model

Arxiv

45+阅读 · 2022年9月19日

Pre-Trained Models: Past, Present and Future

Arxiv

19+阅读 · 2021年6月15日

Recent Advances in Deep Learning-based Dialogue Systems

Arxiv

18+阅读 · 2021年5月10日

相关基金

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

S100A4-miR155在肝癌组织间充质干细胞调控肝癌增殖及转移中的作用及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

白光LED用三种典型氮掺杂铝、硅酸盐基荧光粉的制备及其发光机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于两相界面势垒控制的高导电性贵金属/LaNiO3复合薄膜的制备与机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

La1-xSrxMnO3/In-MgZnO全氧化物外延异质结器件的制备与性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

Ghrelin对胰岛β细胞分泌胰岛素和增殖的影响及分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

水泥沥青砂浆的热行为及变形机理

国家自然科学基金

0+阅读 · 2009年12月31日

IFNγ22312;异基因造血干细胞移植中aGVHD致肺损伤的作用及机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

半金属磁性薄膜的重离子辐照效应研究

国家自然科学基金

0+阅读 · 2009年12月31日

微小RNA（miRNA）对实验性关节炎CD4阳性T细胞增殖和分化的调控机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员