选择性跨任务蒸馏 (Selective Cross-Task Distillation) - 专知论文

会员服务 ·

0

标记空间 · 知识 (knowledge) · 蒸馏 · 标注 · MoDELS ·

2022 年 9 月 28 日

Selective Cross-Task Distillation

翻译：选择性跨任务蒸馏

Su Lu,Han-Jia Ye,De-Chuan Zhan

The outpouring of various pre-trained models empowers knowledge distillation by providing abundant teacher resources, but there lacks a developed mechanism to utilize these teachers adequately. With a massive model repository composed of teachers pre-trained on diverse tasks, we must surmount two obstacles when using knowledge distillation to learn a new task. First, given a fixed computing budget, it is not affordable to try each teacher and train the student repeatedly, making it necessary to seek out the most contributive teacher precisely and efficiently. Second, semantic gaps exist between the teachers and the target student since they are trained on different tasks. Thus, we need to extract knowledge from a general label space that may be different from the student's. Faced with these two challenges, we study a new setting named selective cross-task distillation that includes teacher assessment and generalized knowledge reuse. We bridge the teacher's label space and the student's label space through optimal transport. The transportation cost from the teacher's prediction to the student's prediction measures the relatedness between two tasks and acts as an objective for distillation. Our method reuses cross-task knowledge from a distinct label space and efficiently assesses teachers without enumerating the model repository. Experiments demonstrate the effectiveness of our proposed method.

翻译：通过提供丰富的师资资源,对各种经过预先培训的模型进行推广,使知识蒸馏成为了知识蒸馏的动力,但缺乏一种发达的机制来充分利用这些教师。有了一个由接受过不同任务培训的教师组成的庞大的模型库,我们必须在利用知识蒸馏学习新任务时克服两个障碍。首先,考虑到固定的计算预算,我们不能够对每个教师进行试验,反复培训学生,从而有必要精确和有效地寻找最有同情心的教师。第二,教师与目标学生在接受不同任务培训后,在语义上存在差距。因此,我们需要从一个可能不同于学生的一般标签空间提取知识。面对这两个挑战,我们研究一个名为选择性的跨任务蒸馏的新环境,其中包括教师评估和普遍的知识再利用。我们通过最佳的交通将教师的标签空间和学生的标签空间连接起来。从教师的预测到学生的预测的运输成本测量了两项任务和作为蒸馏目标的行为之间的关系。我们的方法是重复利用从一个不同的标签空间和有效评估教师的方法的知识。

0

相关内容

标记空间

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

自然语言处理顶会NAACL2022最佳论文出炉！

自然语言处理顶会NAACL2022最佳论文出炉！

专知会员服务

43+阅读 · 2022年6月30日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相容幂domain结构与函数逼近结构相关问题研究

国家自然科学基金

1+阅读 · 2015年12月31日

Notch与HIF-1信号转导通路cross-talk在脑海绵状血管瘤血管新生中作用

国家自然科学基金

0+阅读 · 2013年12月31日

miR-124a靶向PIK3CA调控类风湿关节炎成纤维样滑膜细胞增殖和凋亡及其机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

高氟干扰T淋巴细胞分化定向蛋白质相互作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

Ghrelin抑制胸腺脂肪细胞生成的分子调控网络研究

国家自然科学基金

0+阅读 · 2012年12月31日

近似稀疏高维非参与半参模型的Dantzig Selector的研究

国家自然科学基金

0+阅读 · 2012年12月31日

Galectin-7在哮喘发病中的调控以及作用机制

国家自然科学基金

0+阅读 · 2012年12月31日

基于轮胎－路面复杂动力特性的交叉口沥青路面设计

国家自然科学基金

0+阅读 · 2011年12月31日

近邻星系的中远红外性质

国家自然科学基金

0+阅读 · 2011年12月31日

TRAIL作为治疗银屑病新的药物作用靶点

国家自然科学基金

0+阅读 · 2008年12月31日

Distill and Collect for Semi-Supervised Temporal Action Segmentation

Distill and Collect for Semi-Supervised Temporal Action Segmentation

Arxiv

0+阅读 · 2022年11月3日

Toward Unsupervised Outlier Model Selection

Arxiv

0+阅读 · 2022年11月3日

Speech-based emotion recognition with self-supervised models using attentive channel-wise correlations and label smoothing

Arxiv

0+阅读 · 2022年11月3日

Private Semi-supervised Knowledge Transfer for Deep Learning from Noisy Labels

Arxiv

0+阅读 · 2022年11月3日

Statistical Learning from Biased Training Samples

Arxiv

0+阅读 · 2022年11月1日

Clustering-Based Approaches for Symbolic Knowledge Extraction

Arxiv

0+阅读 · 2022年11月1日

Predicting Multi-Codebook Vector Quantization Indexes for Knowledge Distillation

Arxiv

0+阅读 · 2022年10月31日

A Survey on Multi-Task Learning

Arxiv

31+阅读 · 2021年3月29日

Few-shot Learning for Multi-label Intent Detection

Arxiv

21+阅读 · 2020年10月11日

Enhanced Meta-Learning for Cross-lingual Named Entity Recognition with Minimal Resources

Arxiv

13+阅读 · 2019年11月14日

VIP会员

文章信息

相关主题

知识 (knowledge)

相关VIP内容

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

自然语言处理顶会NAACL2022最佳论文出炉！

自然语言处理顶会NAACL2022最佳论文出炉！

专知会员服务

43+阅读 · 2022年6月30日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《太空边缘（临近空间）的武器化？军事高空平台的进展与前景》

《利用星基增强系统（SBAS）信号进行射频干扰（RFI）检测与特征分析》

美陆军在“艾布拉姆斯”坦克与“布拉德利”步战车上测试“牛蛙”反无人机炮塔

《军事领域特性及其对军事人工智能应用的影响》

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

Distill and Collect for Semi-Supervised Temporal Action Segmentation

Distill and Collect for Semi-Supervised Temporal Action Segmentation

Arxiv

0+阅读 · 2022年11月3日

Toward Unsupervised Outlier Model Selection

Arxiv

0+阅读 · 2022年11月3日

Speech-based emotion recognition with self-supervised models using attentive channel-wise correlations and label smoothing

Arxiv

0+阅读 · 2022年11月3日

Private Semi-supervised Knowledge Transfer for Deep Learning from Noisy Labels

Arxiv

0+阅读 · 2022年11月3日

Statistical Learning from Biased Training Samples

Arxiv

0+阅读 · 2022年11月1日

Clustering-Based Approaches for Symbolic Knowledge Extraction

Arxiv

0+阅读 · 2022年11月1日

Predicting Multi-Codebook Vector Quantization Indexes for Knowledge Distillation

Arxiv

0+阅读 · 2022年10月31日

A Survey on Multi-Task Learning

Arxiv

31+阅读 · 2021年3月29日

Few-shot Learning for Multi-label Intent Detection

Arxiv

21+阅读 · 2020年10月11日

Enhanced Meta-Learning for Cross-lingual Named Entity Recognition with Minimal Resources

Arxiv

13+阅读 · 2019年11月14日

相关基金

相容幂domain结构与函数逼近结构相关问题研究

国家自然科学基金

1+阅读 · 2015年12月31日

Notch与HIF-1信号转导通路cross-talk在脑海绵状血管瘤血管新生中作用

国家自然科学基金

0+阅读 · 2013年12月31日

miR-124a靶向PIK3CA调控类风湿关节炎成纤维样滑膜细胞增殖和凋亡及其机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

高氟干扰T淋巴细胞分化定向蛋白质相互作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

Ghrelin抑制胸腺脂肪细胞生成的分子调控网络研究

国家自然科学基金

0+阅读 · 2012年12月31日

近似稀疏高维非参与半参模型的Dantzig Selector的研究

国家自然科学基金

0+阅读 · 2012年12月31日

Galectin-7在哮喘发病中的调控以及作用机制

国家自然科学基金

0+阅读 · 2012年12月31日

基于轮胎－路面复杂动力特性的交叉口沥青路面设计

国家自然科学基金

0+阅读 · 2011年12月31日

近邻星系的中远红外性质

国家自然科学基金

0+阅读 · 2011年12月31日

TRAIL作为治疗银屑病新的药物作用靶点

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员