可视化嵌入空间以解释知识蒸馏的影响 (Visualizing the embedding space to explain the effect of knowledge distillation) - 专知论文

会员服务 ·

0

蒸馏 · Networking · Extensibility · 可约的 · 层 ·

2021 年 10 月 9 日

Visualizing the embedding space to explain the effect of knowledge distillation

翻译：可视化嵌入空间以解释知识蒸馏的影响

Hyun Seung Lee,Christian Wallraven

Recent research has found that knowledge distillation can be effective in reducing the size of a network and in increasing generalization. A pre-trained, large teacher network, for example, was shown to be able to bootstrap a student model that eventually outperforms the teacher in a limited label environment. Despite these advances, it still is relatively unclear \emph{why} this method works, that is, what the resulting student model does 'better'. To address this issue, here, we utilize two non-linear, low-dimensional embedding methods (t-SNE and IVIS) to visualize representation spaces of different layers in a network. We perform a set of extensive experiments with different architecture parameters and distillation methods. The resulting visualizations and metrics clearly show that distillation guides the network to find a more compact representation space for higher accuracy already in earlier layers compared to its non-distilled version.

翻译：最近的研究发现,知识蒸馏在缩小网络规模和增加普及方面是有效的。例如,经过预先训练的大型教师网络被证明能够将最终在有限标签环境中优于教师的学生模型套牢。尽管取得了这些进步,但这种方法仍然相对不清楚,即由此产生的学生模型“做得更好”是什么样的。为了解决这个问题,我们使用两种非线性、低维嵌入法(t-SNE和IVIS)将网络不同层的演示空间(t-SNE和IVIS)可视化。我们用不同的建筑参数和蒸馏方法进行了一系列广泛的实验。由此产生的可视化和测量清楚显示,蒸馏法引导网络找到比非蒸馏版更精准的更紧凑的展示空间。

0

相关内容

【2021新书】机器学习模型生产部署实践，161页pdf，

【2021新书】机器学习模型生产部署实践，161页pdf，

专知会员服务

113+阅读 · 2021年6月11日

【知识图谱@EMNLP2020】Knowledge Graphs in NLP @ EMNLP 2020

【知识图谱@EMNLP2020】Knowledge Graphs in NLP @ EMNLP 2020

专知会员服务

43+阅读 · 2020年11月22日

【知识图谱@ACL2020】Knowledge Graphs in Natural Language Processing

【知识图谱@ACL2020】Knowledge Graphs in Natural Language Processing

专知会员服务

66+阅读 · 2020年7月12日

知识图嵌入和可解释人工智能 Knowledge Graph Embeddings and Explainable AI

知识图嵌入和可解释人工智能 Knowledge Graph Embeddings and Explainable AI

专知会员服务

135+阅读 · 2020年5月1日

【知识图谱嵌入补全综述论文】embedding models for knowledge base completion

【知识图谱嵌入补全综述论文】embedding models for knowledge base completion

专知会员服务

102+阅读 · 2020年4月25日

图卷积神经网络蒸馏知识，Distillating Knowledge from GCN

图卷积神经网络蒸馏知识，Distillating Knowledge from GCN

专知会员服务

96+阅读 · 2020年3月25日

【Google可解释人工智能白皮书】27页pdf，AI Explainability Whitepaper ，Introduction to AI Explanations for AI Platform

【Google可解释人工智能白皮书】27页pdf，AI Explainability Whitepaper ，Introduction to AI Explanations for AI Platform

专知会员服务

127+阅读 · 2019年12月13日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

ACL2020 | 基于Knowledge Embedding的多跳知识图谱问答

ACL2020 | 基于Knowledge Embedding的多跳知识图谱问答

AI科技评论

18+阅读 · 2020年6月29日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

已删除

将门创投

4+阅读 · 2018年7月31日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

Mask-invariant Face Recognition through Template-level Knowledge Distillation

Arxiv

0+阅读 · 2021年12月10日

DisCo: Effective Knowledge Distillation For Contrastive Learning of Sentence Embeddings

DisCo: Effective Knowledge Distillation For Contrastive Learning of Sentence Embeddings

Arxiv

0+阅读 · 2021年12月10日

KGE-CL: Contrastive Learning of Knowledge Graph Embeddings

Arxiv

0+阅读 · 2021年12月9日

Does Knowledge Distillation Really Work?

Arxiv

0+阅读 · 2021年12月6日

Topology Distillation for Recommender System

Arxiv

9+阅读 · 2021年6月16日

Knowledge Distillation in Wide Neural Networks: Risk Bound, Data Efficiency and Imperfect Teacher

Arxiv

4+阅读 · 2020年10月20日

已删除

Arxiv

32+阅读 · 2020年3月23日

Knowledge Distillation from Internal Representations

Knowledge Distillation from Internal Representations

Arxiv

4+阅读 · 2019年10月8日

Knowledge Flow: Improve Upon Your Teachers

Knowledge Flow: Improve Upon Your Teachers

Arxiv

5+阅读 · 2019年4月11日

Explainable Reasoning over Knowledge Graphs for Recommendation

Arxiv

11+阅读 · 2018年11月12日

VIP会员

文章信息

相关主题

相关VIP内容

【2021新书】机器学习模型生产部署实践，161页pdf，

【2021新书】机器学习模型生产部署实践，161页pdf，

专知会员服务

113+阅读 · 2021年6月11日

【知识图谱@EMNLP2020】Knowledge Graphs in NLP @ EMNLP 2020

【知识图谱@EMNLP2020】Knowledge Graphs in NLP @ EMNLP 2020

专知会员服务

43+阅读 · 2020年11月22日

【知识图谱@ACL2020】Knowledge Graphs in Natural Language Processing

【知识图谱@ACL2020】Knowledge Graphs in Natural Language Processing

专知会员服务

66+阅读 · 2020年7月12日

知识图嵌入和可解释人工智能 Knowledge Graph Embeddings and Explainable AI

知识图嵌入和可解释人工智能 Knowledge Graph Embeddings and Explainable AI

专知会员服务

135+阅读 · 2020年5月1日

【知识图谱嵌入补全综述论文】embedding models for knowledge base completion

【知识图谱嵌入补全综述论文】embedding models for knowledge base completion

专知会员服务

102+阅读 · 2020年4月25日

图卷积神经网络蒸馏知识，Distillating Knowledge from GCN

图卷积神经网络蒸馏知识，Distillating Knowledge from GCN

专知会员服务

96+阅读 · 2020年3月25日

【Google可解释人工智能白皮书】27页pdf，AI Explainability Whitepaper ，Introduction to AI Explanations for AI Platform

【Google可解释人工智能白皮书】27页pdf，AI Explainability Whitepaper ，Introduction to AI Explanations for AI Platform

专知会员服务

127+阅读 · 2019年12月13日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《生成式人工智能与大/小语言模型在供应链管理决策优化与可持续性提升中的作用评估》最新51页

白宫发布《赢得AI竞赛：美国人工智能行动计划》最新28页

地下战：地下空间的战略博弈

《美地下作战条令手册》228页

相关资讯

ACL2020 | 基于Knowledge Embedding的多跳知识图谱问答

ACL2020 | 基于Knowledge Embedding的多跳知识图谱问答

AI科技评论

18+阅读 · 2020年6月29日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

已删除

将门创投

4+阅读 · 2018年7月31日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

相关论文

Mask-invariant Face Recognition through Template-level Knowledge Distillation

Arxiv

0+阅读 · 2021年12月10日

DisCo: Effective Knowledge Distillation For Contrastive Learning of Sentence Embeddings

DisCo: Effective Knowledge Distillation For Contrastive Learning of Sentence Embeddings

Arxiv

0+阅读 · 2021年12月10日

KGE-CL: Contrastive Learning of Knowledge Graph Embeddings

Arxiv

0+阅读 · 2021年12月9日

Does Knowledge Distillation Really Work?

Arxiv

0+阅读 · 2021年12月6日

Topology Distillation for Recommender System

Arxiv

9+阅读 · 2021年6月16日

Knowledge Distillation in Wide Neural Networks: Risk Bound, Data Efficiency and Imperfect Teacher

Arxiv

4+阅读 · 2020年10月20日

已删除

Arxiv

32+阅读 · 2020年3月23日

Knowledge Distillation from Internal Representations

Knowledge Distillation from Internal Representations

Arxiv

4+阅读 · 2019年10月8日

Knowledge Flow: Improve Upon Your Teachers

Knowledge Flow: Improve Upon Your Teachers

Arxiv

5+阅读 · 2019年4月11日

Explainable Reasoning over Knowledge Graphs for Recommendation

Arxiv

11+阅读 · 2018年11月12日

微信扫码咨询专知VIP会员