以最佳交通方式蒸馏 (Faculty Distillation with Optimal Transport) - 专知论文

会员服务 ·

0

知识 (knowledge) · 蒸馏 · 优化器 · 标记空间 · MoDELS ·

2022 年 6 月 16 日

Faculty Distillation with Optimal Transport

翻译：以最佳交通方式蒸馏

Su Lu,Han-Jia Ye,De-Chuan Zhan

The outpouring of various pre-trained models empowers knowledge distillation~(KD) by providing abundant teacher resources. Meanwhile, exploring the massive model repository to select a suitable teacher and further extracting its knowledge become daunting challenges. Standard KD fails to surmount two obstacles when training a student with the availability of plentiful pre-trained teachers, i.e., the "faculty". First, we need to seek out the most contributive teacher in the faculty efficiently rather than enumerating all of them for a student. Second, since the teacher may be pre-trained on different tasks w.r.t. the student, we must distill the knowledge from a more general label space. This paper studies this ``faculty distillation'' where a student performs teacher assessment and generalized knowledge reuse. We take advantage of optimal transport to construct a unifying objective for both problems, which bridges the semantic gap and measures the relatedness between a pair of models. This objective can select the most relevant teacher, and we minimize the same objective over student parameters to transfer the knowledge from the selected teacher subsequently. Experiments in various settings demonstrate the succinctness and versatility of our proposed method.

翻译：通过提供丰富的师资资源,开发各种预先培训的模型,从而增强知识蒸馏(KD)的能力。同时,探索大规模模型库以选择合适的教师并进一步提取其知识成为艰巨的挑战。标准KD在培训学生时未能克服两个障碍,因为有一个学生拥有丰富的事先培训的教师,即“技艺”。首先,我们需要在教师队伍中寻找最有贡献的教师,而不是为学生列出所有教师。第二,由于该教师可能接受关于不同任务的预先培训,我们必须从一个更通用的标签空间中提取知识。本文研究的是学生进行教师评估和普遍知识再利用的“工艺性蒸馏”。我们利用最佳交通为这两个问题构建一个统一的目标,弥合语义差距,衡量一对模式之间的联系。这个目标可以选择最相关的教师,并且我们尽可能减少学生参数的相同目标,以便随后从选定的教师那里传授知识。在各种环境中进行实验,展示我们提议的简明性和多面方法。

0

相关内容

知识 (knowledge)

知识 (knowledge)

通过学习、实践或探索所获得的认识、判断或技能。

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

讲座报名丨 ICML专场

讲座报名丨 ICML专场

THU数据派

0+阅读 · 2021年9月15日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

构建预测直肠癌新辅助治疗后病理学完全缓解（pCR）的多模态、多参数诊断模型

国家自然科学基金

1+阅读 · 2015年12月31日

白头翁汤调控Rho/ROCK信号通路治疗放射性肠炎的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

随机模糊时变网络最短路径问题研究

国家自然科学基金

0+阅读 · 2013年12月31日

Intraflagellar Transport运输纤毛蛋白的分子机理

国家自然科学基金

0+阅读 · 2012年12月31日

关于AI-半环簇与 Conway半环簇的研究

国家自然科学基金

1+阅读 · 2012年12月31日

玻色爱因斯坦凝聚的李群方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

钙池调控钙内流对丁酸钠诱导大肠癌细胞凋亡调控机制的研究

国家自然科学基金

0+阅读 · 2012年12月31日

人造规范势中冷原子的新奇量子态及动力学研究

国家自然科学基金

0+阅读 · 2012年12月31日

多传感器多速率采样系统分布式异步融合估计

国家自然科学基金

0+阅读 · 2011年12月31日

分数微分方程的定性研究

国家自然科学基金

0+阅读 · 2009年12月31日

Background Modeling for Double Higgs Boson Production: Density Ratios and Optimal Transport

Arxiv

0+阅读 · 2022年8月4日

Query-Based Selection of Optimal Candidates under the Mallows Model

Query-Based Selection of Optimal Candidates under the Mallows Model

Arxiv

0+阅读 · 2022年8月4日

Decay2Distill: Leveraging spatial perturbation and regularization for self-supervised image denoising

Arxiv

0+阅读 · 2022年8月4日

Combinatorial Causal Bandits

Arxiv

0+阅读 · 2022年8月3日

Decay2Distill: Leveraging spatial perturbation and regularization for self-supervised image denoisin

Arxiv

0+阅读 · 2022年8月3日

Optimizing Age of Information with Correlated Sources

Arxiv

0+阅读 · 2022年8月2日

How to reduce the search space of Entity Resolution: with Blocking or Nearest Neighbor search?

Arxiv

0+阅读 · 2022年8月2日

Knowledge Distillation and Student-Teacher Learning for Visual Intelligence: A Review and New Outlooks

Arxiv

18+阅读 · 2021年6月17日

Spatio-Temporal Graph for Video Captioning with Knowledge Distillation

Spatio-Temporal Graph for Video Captioning with Knowledge Distillation

Arxiv

19+阅读 · 2020年3月31日

Reinforced Negative Sampling over Knowledge Graph for Recommendation

Arxiv

17+阅读 · 2020年3月12日

VIP会员

文章信息

相关主题

知识 (knowledge)

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《以人工智能为基准推动现代后勤领域创新和生产力的军事经验》

人工智能驱动的国防战术通信与网络：提升现代战争中的态势感知、安全性与自主决策 | 万字长文

《导航战测试床》报告

《用于全球导航卫星系统电子干扰检测与分类的人工智能模型》2025最新107页

相关资讯

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

讲座报名丨 ICML专场

讲座报名丨 ICML专场

THU数据派

0+阅读 · 2021年9月15日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Background Modeling for Double Higgs Boson Production: Density Ratios and Optimal Transport

Arxiv

0+阅读 · 2022年8月4日

Query-Based Selection of Optimal Candidates under the Mallows Model

Query-Based Selection of Optimal Candidates under the Mallows Model

Arxiv

0+阅读 · 2022年8月4日

Decay2Distill: Leveraging spatial perturbation and regularization for self-supervised image denoising

Arxiv

0+阅读 · 2022年8月4日

Combinatorial Causal Bandits

Arxiv

0+阅读 · 2022年8月3日

Decay2Distill: Leveraging spatial perturbation and regularization for self-supervised image denoisin

Arxiv

0+阅读 · 2022年8月3日

Optimizing Age of Information with Correlated Sources

Arxiv

0+阅读 · 2022年8月2日

How to reduce the search space of Entity Resolution: with Blocking or Nearest Neighbor search?

Arxiv

0+阅读 · 2022年8月2日

Knowledge Distillation and Student-Teacher Learning for Visual Intelligence: A Review and New Outlooks

Arxiv

18+阅读 · 2021年6月17日

Spatio-Temporal Graph for Video Captioning with Knowledge Distillation

Spatio-Temporal Graph for Video Captioning with Knowledge Distillation

Arxiv

19+阅读 · 2020年3月31日

Reinforced Negative Sampling over Knowledge Graph for Recommendation

Arxiv

17+阅读 · 2020年3月12日

相关基金

构建预测直肠癌新辅助治疗后病理学完全缓解（pCR）的多模态、多参数诊断模型

国家自然科学基金

1+阅读 · 2015年12月31日

白头翁汤调控Rho/ROCK信号通路治疗放射性肠炎的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

随机模糊时变网络最短路径问题研究

国家自然科学基金

0+阅读 · 2013年12月31日

Intraflagellar Transport运输纤毛蛋白的分子机理

国家自然科学基金

0+阅读 · 2012年12月31日

关于AI-半环簇与 Conway半环簇的研究

国家自然科学基金

1+阅读 · 2012年12月31日

玻色爱因斯坦凝聚的李群方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

钙池调控钙内流对丁酸钠诱导大肠癌细胞凋亡调控机制的研究

国家自然科学基金

0+阅读 · 2012年12月31日

人造规范势中冷原子的新奇量子态及动力学研究

国家自然科学基金

0+阅读 · 2012年12月31日

多传感器多速率采样系统分布式异步融合估计

国家自然科学基金

0+阅读 · 2011年12月31日

分数微分方程的定性研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员