DialogZoo:大型对话框学习任务 (DialogZoo: Large-Scale Dialog-Oriented Task Learning) - 专知论文

会员服务 ·

0

任务对话系统 · MoDELS · 学成 · 样例 · 会话智能体 ·

2022 年 5 月 25 日

DialogZoo: Large-Scale Dialog-Oriented Task Learning

翻译：DialogZoo:大型对话框学习任务

Zhi Chen,Jijia Bao,Lu Chen,Yuncong Liu,Da Ma,Bei Chen,Mengyue Wu,Su Zhu,Jian-Guang Lou,Kai Yu

from arxiv, Work in Progress

Building unified conversational agents has been a long-standing goal of the dialogue research community. Most previous works only focus on a subset of various dialogue tasks. In this work, we aim to build a unified foundation model which can solve massive diverse dialogue tasks. To achieve this goal, we first collect a large-scale well-labeled dialogue dataset from 73 publicly available datasets. In addition to this dataset, we further propose two dialogue-oriented self-supervised tasks, and finally use the mixture of supervised and self-supervised datasets to train our foundation model. The supervised examples make the model learn task-specific skills, while the self-supervised examples make the model learn more general skills. We evaluate our model on various downstream dialogue tasks. The experimental results show that our method not only improves the ability of dialogue generation and knowledge distillation, but also the representation ability of models.

翻译：建立统一的对话代理器一直是对话研究界的长期目标。以往的工作大多只侧重于一系列不同的对话任务。在这项工作中, 我们的目标是建立一个统一的基建模型, 解决大规模多样的对话任务。为了实现这一目标, 我们首先从73个公开提供的数据集中收集一个大尺度的标签良好的对话数据集。除了这个数据集外, 我们还进一步提议两项面向对话的自我监督任务, 最后使用监督的和自我监督的数据集组合来培训我们的基建模型。受监督的示例让模型学习特定任务的技能, 而由自我监督的示例让模型学习更多一般的技能。我们评估了我们关于各种下游对话任务的模式。实验结果显示,我们的方法不仅提高了对话生成和知识蒸馏的能力,而且提高了模型的代表性能力。

0

相关内容

任务对话系统

任务对话系统

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【微软研究院】IMAGEBERT: CROSS-MODAL PRE-TRAINING WITH LARGE-SCALE WEAK-SUPERVISED IMAGE-TEXT DATA

【微软研究院】IMAGEBERT: CROSS-MODAL PRE-TRAINING WITH LARGE-SCALE WEAK-SUPERVISED IMAGE-TEXT DATA

专知会员服务

43+阅读 · 2020年1月28日

【大规模数据系统，552页ppt】Large-scale Data Systems

【大规模数据系统，552页ppt】Large-scale Data Systems

专知会员服务

61+阅读 · 2019年12月21日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【推荐】深度学习目标检测概览

【推荐】深度学习目标检测概览

机器学习研究会

10+阅读 · 2017年9月1日

氨功能化与碱金属掺杂对MOF膜CO2吸附分离性能的协同机制

国家自然科学基金

0+阅读 · 2015年12月31日

全钛基背投式PIN异质结钙钛矿型太阳电池研究

国家自然科学基金

0+阅读 · 2014年12月31日

二维异质复合薄膜材料的可控制备及储锂性能研究

国家自然科学基金

0+阅读 · 2014年12月31日

非金属掺杂钛氧团簇的合成和光电性质

国家自然科学基金

0+阅读 · 2013年12月31日

钨青铜的可控制备及在染料敏化太阳能电池中的应用研究

国家自然科学基金

0+阅读 · 2013年12月31日

富勒烯-宽禁带金属氧化物异质纳米晶的空穴-电子分离及其高压研究

国家自然科学基金

0+阅读 · 2012年12月31日

新型碳纳米材料高效太阳电池

国家自然科学基金

0+阅读 · 2012年12月31日

基于联1,3,4噁二唑衍生物构建具有手性和铁电性反转功能分子开关的研究

国家自然科学基金

0+阅读 · 2012年12月31日

退化k-Hessian方程解的正则性研究

国家自然科学基金

0+阅读 · 2011年12月31日

客体分子调控的微孔稀土光磁材料的研究

国家自然科学基金

0+阅读 · 2011年12月31日

FedHAP: Federated Hashing with Global Prototypes for Cross-silo Retrieval

Arxiv

0+阅读 · 2022年7月12日

Exploring the Role of Task Transferability in Large-Scale Multi-Task Learning

Arxiv

0+阅读 · 2022年7月12日

Multiple-Modality Associative Memory: a framework for Learning

Arxiv

0+阅读 · 2022年7月11日

Multi-Task Learning with Sentiment, Emotion, and Target Detection to Recognize Hate Speech and Offensive Language

Arxiv

0+阅读 · 2022年7月11日

Learning Large-scale Universal User Representation with Sparse Mixture of Experts

Arxiv

0+阅读 · 2022年7月11日

Motley: Benchmarking Heterogeneity and Personalization in Federated Learning

Arxiv

0+阅读 · 2022年7月10日

Smart Multi-tenant Federated Learning

Arxiv

0+阅读 · 2022年7月9日

Beyond Transfer Learning: Co-finetuning for Action Localisation

Arxiv

0+阅读 · 2022年7月8日

Advancing High-Resolution Video-Language Representation with Large-Scale Video Transcriptions

Arxiv

0+阅读 · 2022年7月8日

Zero-Shot Transfer Learning for Event Extraction

Arxiv

10+阅读 · 2017年7月4日

VIP会员

文章信息

相关主题

任务对话系统

会话智能体

相关VIP内容

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【微软研究院】IMAGEBERT: CROSS-MODAL PRE-TRAINING WITH LARGE-SCALE WEAK-SUPERVISED IMAGE-TEXT DATA

【微软研究院】IMAGEBERT: CROSS-MODAL PRE-TRAINING WITH LARGE-SCALE WEAK-SUPERVISED IMAGE-TEXT DATA

专知会员服务

43+阅读 · 2020年1月28日

【大规模数据系统，552页ppt】Large-scale Data Systems

【大规模数据系统，552页ppt】Large-scale Data Systems

专知会员服务

61+阅读 · 2019年12月21日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【伯克利博士论文】通过真实世界实践赋能机器人自主性

军用无人机集群技术尚未成熟——但潜力可期

人工智能安全治理白皮书（2025）

AgentOps综述：分类、挑战与未来方向

相关资讯

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【推荐】深度学习目标检测概览

【推荐】深度学习目标检测概览

机器学习研究会

10+阅读 · 2017年9月1日

相关论文

FedHAP: Federated Hashing with Global Prototypes for Cross-silo Retrieval

Arxiv

0+阅读 · 2022年7月12日

Exploring the Role of Task Transferability in Large-Scale Multi-Task Learning

Arxiv

0+阅读 · 2022年7月12日

Multiple-Modality Associative Memory: a framework for Learning

Arxiv

0+阅读 · 2022年7月11日

Multi-Task Learning with Sentiment, Emotion, and Target Detection to Recognize Hate Speech and Offensive Language

Arxiv

0+阅读 · 2022年7月11日

Learning Large-scale Universal User Representation with Sparse Mixture of Experts

Arxiv

0+阅读 · 2022年7月11日

Motley: Benchmarking Heterogeneity and Personalization in Federated Learning

Arxiv

0+阅读 · 2022年7月10日

Smart Multi-tenant Federated Learning

Arxiv

0+阅读 · 2022年7月9日

Beyond Transfer Learning: Co-finetuning for Action Localisation

Arxiv

0+阅读 · 2022年7月8日

Advancing High-Resolution Video-Language Representation with Large-Scale Video Transcriptions

Arxiv

0+阅读 · 2022年7月8日

Zero-Shot Transfer Learning for Event Extraction

Arxiv

10+阅读 · 2017年7月4日

相关基金

氨功能化与碱金属掺杂对MOF膜CO2吸附分离性能的协同机制

国家自然科学基金

0+阅读 · 2015年12月31日

全钛基背投式PIN异质结钙钛矿型太阳电池研究

国家自然科学基金

0+阅读 · 2014年12月31日

二维异质复合薄膜材料的可控制备及储锂性能研究

国家自然科学基金

0+阅读 · 2014年12月31日

非金属掺杂钛氧团簇的合成和光电性质

国家自然科学基金

0+阅读 · 2013年12月31日

钨青铜的可控制备及在染料敏化太阳能电池中的应用研究

国家自然科学基金

0+阅读 · 2013年12月31日

富勒烯-宽禁带金属氧化物异质纳米晶的空穴-电子分离及其高压研究

国家自然科学基金

0+阅读 · 2012年12月31日

新型碳纳米材料高效太阳电池

国家自然科学基金

0+阅读 · 2012年12月31日

基于联1,3,4噁二唑衍生物构建具有手性和铁电性反转功能分子开关的研究

国家自然科学基金

0+阅读 · 2012年12月31日

退化k-Hessian方程解的正则性研究

国家自然科学基金

0+阅读 · 2011年12月31日

客体分子调控的微孔稀土光磁材料的研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员