异构支路协作学习用于对话生成 (Heterogeneous-Branch Collaborative Learning for Dialogue Generation) - 专知论文

会员服务 ·

0

任务对话系统 · Branch · 蒸馏 · Learning · 知识 (knowledge) ·

2023 年 3 月 21 日

Heterogeneous-Branch Collaborative Learning for Dialogue Generation

翻译：异构支路协作学习用于对话生成

Yiwei Li,Shaoxiong Feng,Bin Sun,Kan Li

from arxiv, Accepted by AAAI 2023

With the development of deep learning, advanced dialogue generation methods usually require a greater amount of computational resources. One promising approach to obtaining a high-performance and lightweight model is knowledge distillation, which relies heavily on the pre-trained powerful teacher. Collaborative learning, also known as online knowledge distillation, is an effective way to conduct one-stage group distillation in the absence of a well-trained large teacher model. However, previous work has a severe branch homogeneity problem due to the same training objective and the independent identical training sets. To alleviate this problem, we consider the dialogue attributes in the training of network branches. Each branch learns the attribute-related features based on the selected subset. Furthermore, we propose a dual group-based knowledge distillation method, consisting of positive distillation and negative distillation, to further diversify the features of different branches in a steadily and interpretable way. The proposed approach significantly improves branch heterogeneity and outperforms state-of-the-art collaborative learning methods on two widely used open-domain dialogue datasets.

翻译：随着深度学习的发展，先进的对话生成方法通常需要更多的计算资源。一种获得高性能和轻量级模型的有前途的方法是知识蒸馏，其严重依赖于预训练的先进 teacher 模型。协作学习，也称为在线知识蒸馏，是在没有训练良好的大型 teacher 模型的情况下进行一阶段群组蒸馏的有效方法。然而，由于相同的训练目标和独立相同的训练集，之前的工作存在严重的支路同质性问题。为了缓解这个问题，我们在支路的训练中考虑对话属性。每个支路基于所选的子集学习属性相关的特征。此外，我们提出了一个双重群组知识蒸馏方法，包括正蒸馏和负蒸馏，以进一步以稳定和可解释的方式使不同支路的特征多样化。所提出的方法显着改善了支路的异质性，并在两个广泛使用的开放域对话数据集上优于最先进的协作学习方法。

0

相关内容

任务对话系统

任务对话系统

图挖掘与多关系学习，亚马逊与CMU-WWW2021教程，附161页ppt

专知会员服务

37+阅读 · 2021年4月20日

【WWW2021】合作记忆网络的个性化任务导向对话系统

【WWW2021】合作记忆网络的个性化任务导向对话系统

专知会员服务

15+阅读 · 2021年2月17日

AAAI2021 | 图神经网络的异质图结构学习，Heterogeneous Graph Structure Learning for Graph Neural Networks

专知会员服务

92+阅读 · 2021年1月20日

【ACL2019】基于学习注意力机制的知识图谱中关系预测的嵌入 Learning Attention-based Embeddings for Relation Prediction in Knowledge Graphs

【ACL2019】基于学习注意力机制的知识图谱中关系预测的嵌入 Learning Attention-based Embeddings for Relation Prediction in Knowledge Graphs

专知会员服务

122+阅读 · 2020年3月29日

AAAI 2020 | 姿态辅助下的多相机协作实现主动目标追踪 Pose-Assisted Multi-Camera Collaboration for Active Object Tracking

AAAI 2020 | 姿态辅助下的多相机协作实现主动目标追踪 Pose-Assisted Multi-Camera Collaboration for Active Object Tracking

专知会员服务

34+阅读 · 2020年3月21日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【CIKM 2019论文】基于关系型图卷积网络的代理发起的社会化电子商务推荐（Relation-Aware Graph Convolutional Networks for Agent-Initiated Social E-Commerce Recommendation）

【CIKM 2019论文】基于关系型图卷积网络的代理发起的社会化电子商务推荐（Relation-Aware Graph Convolutional Networks for Agent-Initiated Social E-Commerce Recommendation）

专知会员服务

56+阅读 · 2019年11月20日

【AAAI2020论文】隐私保留GBDT（Privacy-Preserving Gradient Boosting Decision Trees）

【AAAI2020论文】隐私保留GBDT（Privacy-Preserving Gradient Boosting Decision Trees）

专知会员服务

36+阅读 · 2019年11月15日

【CIKM2019 Tutorial】Recent Developments of Deep Heterogeneous Information Network Analysis（深度异构信息网络分析的最新进展），附157页PDF免费下载

【CIKM2019 Tutorial】Recent Developments of Deep Heterogeneous Information Network Analysis（深度异构信息网络分析的最新进展），附157页PDF免费下载

专知会员服务

29+阅读 · 2019年11月3日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

WSDM2022推荐算法部分论文整理（附直播课程）

WSDM2022推荐算法部分论文整理（附直播课程）

机器学习与推荐算法

0+阅读 · 2022年7月21日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec智能推荐

50+阅读 · 2018年8月27日

社交网络环境下基于协同过滤的上下文感知推荐系统研究

国家自然科学基金

6+阅读 · 2014年12月31日

频谱异构环境下基于协作感知的认知无线ad hoc网络MAC技术研究

国家自然科学基金

0+阅读 · 2014年12月31日

室内可见光定位系统及网络研究

国家自然科学基金

1+阅读 · 2014年12月31日

异构动态移动通信网络的延时优化

国家自然科学基金

2+阅读 · 2013年12月31日

基于无线网络小车的协同控制研究

国家自然科学基金

0+阅读 · 2013年12月31日

梯度域内光照感知的可视媒体无缝融合技术研究

国家自然科学基金

0+阅读 · 2013年12月31日

Massive MIMO系统关键技术的研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于社交访问行为与传播特性的在线视频内容部署与传输方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

三维水下传感网部署与组网联合设计

国家自然科学基金

0+阅读 · 2012年12月31日

基于协同半监督学习和稀疏表示的极化SAR地物分类

国家自然科学基金

0+阅读 · 2011年12月31日

DaGAN++: Depth-Aware Generative Adversarial Network for Talking Head Video Generation

Arxiv

0+阅读 · 2023年5月10日

Multi-stage Progressive Reasoning for Dunhuang Murals Inpainting

Arxiv

1+阅读 · 2023年5月10日

Universal Adaptive Data Augmentation

Arxiv

0+阅读 · 2023年5月10日

GNNs,You can be Stronger,Deeper and Faster

Arxiv

1+阅读 · 2023年5月9日

Deep Class-Incremental Learning: A Survey

Arxiv

13+阅读 · 2023年2月7日

Adaptive Universal Generalized PageRank Graph Neural Network

Arxiv

10+阅读 · 2021年1月22日

Rethinking Knowledge Graph Propagation for Zero-Shot Learning

Arxiv

17+阅读 · 2018年5月31日

Zero-Shot Object Detection by Hybrid Region Embedding

Arxiv

19+阅读 · 2018年5月17日

Transferring Common-Sense Knowledge for Object Detection

Arxiv

12+阅读 · 2018年4月3日

Zero-Shot Transfer Learning for Event Extraction

Arxiv

10+阅读 · 2017年7月4日

VIP会员

文章信息

相关主题

任务对话系统

知识 (knowledge)

相关VIP内容

图挖掘与多关系学习，亚马逊与CMU-WWW2021教程，附161页ppt

专知会员服务

37+阅读 · 2021年4月20日

【WWW2021】合作记忆网络的个性化任务导向对话系统

【WWW2021】合作记忆网络的个性化任务导向对话系统

专知会员服务

15+阅读 · 2021年2月17日

AAAI2021 | 图神经网络的异质图结构学习，Heterogeneous Graph Structure Learning for Graph Neural Networks

专知会员服务

92+阅读 · 2021年1月20日

【ACL2019】基于学习注意力机制的知识图谱中关系预测的嵌入 Learning Attention-based Embeddings for Relation Prediction in Knowledge Graphs

【ACL2019】基于学习注意力机制的知识图谱中关系预测的嵌入 Learning Attention-based Embeddings for Relation Prediction in Knowledge Graphs

专知会员服务

122+阅读 · 2020年3月29日

AAAI 2020 | 姿态辅助下的多相机协作实现主动目标追踪 Pose-Assisted Multi-Camera Collaboration for Active Object Tracking

AAAI 2020 | 姿态辅助下的多相机协作实现主动目标追踪 Pose-Assisted Multi-Camera Collaboration for Active Object Tracking

专知会员服务

34+阅读 · 2020年3月21日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【CIKM 2019论文】基于关系型图卷积网络的代理发起的社会化电子商务推荐（Relation-Aware Graph Convolutional Networks for Agent-Initiated Social E-Commerce Recommendation）

【CIKM 2019论文】基于关系型图卷积网络的代理发起的社会化电子商务推荐（Relation-Aware Graph Convolutional Networks for Agent-Initiated Social E-Commerce Recommendation）

专知会员服务

56+阅读 · 2019年11月20日

【AAAI2020论文】隐私保留GBDT（Privacy-Preserving Gradient Boosting Decision Trees）

【AAAI2020论文】隐私保留GBDT（Privacy-Preserving Gradient Boosting Decision Trees）

专知会员服务

36+阅读 · 2019年11月15日

【CIKM2019 Tutorial】Recent Developments of Deep Heterogeneous Information Network Analysis（深度异构信息网络分析的最新进展），附157页PDF免费下载

【CIKM2019 Tutorial】Recent Developments of Deep Heterogeneous Information Network Analysis（深度异构信息网络分析的最新进展），附157页PDF免费下载

专知会员服务

29+阅读 · 2019年11月3日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

《小型无人机系统侦测追踪技术：声学、计算机视觉与深度学习融合方案》最新98页

《"牧羊人网格"拦截策略：实现无人机集群可靠拦截的新范式》

光纤无人机：反无人机系统的重大挑战

《作战建模与仿真实证研究》

相关资讯

WSDM2022推荐算法部分论文整理（附直播课程）

WSDM2022推荐算法部分论文整理（附直播课程）

机器学习与推荐算法

0+阅读 · 2022年7月21日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec智能推荐

50+阅读 · 2018年8月27日

相关论文

DaGAN++: Depth-Aware Generative Adversarial Network for Talking Head Video Generation

Arxiv

0+阅读 · 2023年5月10日

Multi-stage Progressive Reasoning for Dunhuang Murals Inpainting

Arxiv

1+阅读 · 2023年5月10日

Universal Adaptive Data Augmentation

Arxiv

0+阅读 · 2023年5月10日

GNNs,You can be Stronger,Deeper and Faster

Arxiv

1+阅读 · 2023年5月9日

Deep Class-Incremental Learning: A Survey

Arxiv

13+阅读 · 2023年2月7日

Adaptive Universal Generalized PageRank Graph Neural Network

Arxiv

10+阅读 · 2021年1月22日

Rethinking Knowledge Graph Propagation for Zero-Shot Learning

Arxiv

17+阅读 · 2018年5月31日

Zero-Shot Object Detection by Hybrid Region Embedding

Arxiv

19+阅读 · 2018年5月17日

Transferring Common-Sense Knowledge for Object Detection

Arxiv

12+阅读 · 2018年4月3日

Zero-Shot Transfer Learning for Event Extraction

Arxiv

10+阅读 · 2017年7月4日

相关基金

社交网络环境下基于协同过滤的上下文感知推荐系统研究

国家自然科学基金

6+阅读 · 2014年12月31日

频谱异构环境下基于协作感知的认知无线ad hoc网络MAC技术研究

国家自然科学基金

0+阅读 · 2014年12月31日

室内可见光定位系统及网络研究

国家自然科学基金

1+阅读 · 2014年12月31日

异构动态移动通信网络的延时优化

国家自然科学基金

2+阅读 · 2013年12月31日

基于无线网络小车的协同控制研究

国家自然科学基金

0+阅读 · 2013年12月31日

梯度域内光照感知的可视媒体无缝融合技术研究

国家自然科学基金

0+阅读 · 2013年12月31日

Massive MIMO系统关键技术的研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于社交访问行为与传播特性的在线视频内容部署与传输方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

三维水下传感网部署与组网联合设计

国家自然科学基金

0+阅读 · 2012年12月31日

基于协同半监督学习和稀疏表示的极化SAR地物分类

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员