TOD-DD:努力增强以任务为导向的对话的有力性,模拟关于口头对话的对话 (TOD-DA: Towards Boosting the Robustness of Task-oriented Dialogue Modeling on Spoken Conversations) - 专知论文

会员服务 ·

0

任务对话系统 · 稳健性 · Boosting（一种模型训练加速方式） · MoDELS · 数据增强 ·

2021 年 12 月 23 日

TOD-DA: Towards Boosting the Robustness of Task-oriented Dialogue Modeling on Spoken Conversations

翻译：TOD-DD:努力增强以任务为导向的对话的有力性,模拟关于口头对话的对话

Xin Tian,Xinxian Huang,Dongfeng He,Yingzhan Lin,Siqi Bao,Huang He,Liankai Huang,Qiang Ju,Xiyuan Zhang,Jian Xie,Shuqi Sun,Fan Wang,Hua Wu,Haifeng Wang

from arxiv, Accepted to the AAAI-22 DSTC10 Workshop. First three authors contributed equally to this work

Task-oriented dialogue systems have been plagued by the difficulties of obtaining large-scale and high-quality annotated conversations. Furthermore, most of the publicly available datasets only include written conversations, which are insufficient to reflect actual human behaviors in practical spoken dialogue systems. In this paper, we propose Task-oriented Dialogue Data Augmentation (TOD-DA), a novel model-agnostic data augmentation paradigm to boost the robustness of task-oriented dialogue modeling on spoken conversations. The TOD-DA consists of two modules: 1) Dialogue Enrichment to expand training data on task-oriented conversations for easing data sparsity and 2) Spoken Conversation Simulator to imitate oral style expressions and speech recognition errors in diverse granularities for bridging the gap between written and spoken conversations. With such designs, our approach ranked first in both tasks of DSTC10 Track2, a benchmark for task-oriented dialogue modeling on spoken conversations, demonstrating the superiority and effectiveness of our proposed TOD-DA.

翻译：以任务为导向的对话系统因难以获得大规模和高质量的附加说明的对话而受到困扰,此外,大多数公开的数据集仅包括书面对话,不足以在实际的口头对话系统中反映实际的人类行为;在本文件中,我们提议采用面向任务的对话数据增强模式(TOD-DA),这是一个新型的模范-不可知性数据增强模式,目的是增强以任务为导向的对话模式在口述对话上的稳健性。TOD-DA由两个模块组成:(1) 强化对话,以扩大关于以任务为导向的对话的培训数据,以缓解数据散居状态;(2) 口述调同声模拟器,以模拟不同颗粒体的口述式表达和语音识别错误,以缩小书面和口述对话之间的差距。有了这种设计,我们的方法在DSTC10轨道2的两项任务中名列第一,这是对口述对话进行任务性对话建模的基准,显示了我们提议的TOD-DD的优越性和有效性。

0

相关内容

任务对话系统

任务对话系统

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【深度学习视频分析/多模态学习资源大列表】

【深度学习视频分析/多模态学习资源大列表】

专知会员服务

92+阅读 · 2019年10月16日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

（精品干货）ACL 2018最新论文归类（最全最细）分享

（精品干货）ACL 2018最新论文归类（最全最细）分享

深度学习与NLP

19+阅读 · 2018年5月14日

【论文推荐】最新八篇图像检索相关论文—三元组、深度特征图、判别式、卷积特征聚合、视觉-关系知识图谱、大规模图像检索

【论文推荐】最新八篇图像检索相关论文—三元组、深度特征图、判别式、卷积特征聚合、视觉-关系知识图谱、大规模图像检索

专知

33+阅读 · 2018年4月23日

【论文推荐】最新5篇图像分割相关论文—条件随机场和深度特征学习、移动端网络、长期视觉定位、主动学习、主动轮廓模型、生成对抗性网络

【论文推荐】最新5篇图像分割相关论文—条件随机场和深度特征学习、移动端网络、长期视觉定位、主动学习、主动轮廓模型、生成对抗性网络

专知

13+阅读 · 2018年1月23日

基于压缩感知的高精度实时视觉跟踪方法研究

国家自然科学基金

3+阅读 · 2015年12月31日

风电机组独立变桨距系统概率模糊建模与协调优化控制

国家自然科学基金

0+阅读 · 2014年12月31日

下三角型异构无人机群鲁棒协调控制

国家自然科学基金

9+阅读 · 2014年12月31日

基于不确定先验知识的支持向量机理论与算法研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于马氏抽样的机器学习理论与算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于可移动摄像头的协同式安全监控和目标跟踪

国家自然科学基金

0+阅读 · 2012年12月31日

扩展的模糊逻辑与基于蕴涵算子的Rough逻辑

国家自然科学基金

0+阅读 · 2011年12月31日

基于压缩感知的无线信道质量反馈新方法探索

国家自然科学基金

0+阅读 · 2009年12月31日

球面学习理论研究

国家自然科学基金

1+阅读 · 2008年12月31日

无线传感器网络定位技术研究

国家自然科学基金

3+阅读 · 2008年12月31日

A Corpus for Understanding and Generating Moral Stories

A Corpus for Understanding and Generating Moral Stories

Arxiv

1+阅读 · 2022年4月20日

Exploring Dense Retrieval for Dialogue Response Selection

Arxiv

0+阅读 · 2022年4月20日

DialoKG: Knowledge-Structure Aware Task-Oriented Dialogue Generation

Arxiv

0+阅读 · 2022年4月19日

Retrieval Enhanced Data Augmentation for Question Answering on Privacy Policies

Retrieval Enhanced Data Augmentation for Question Answering on Privacy Policies

Arxiv

0+阅读 · 2022年4月19日

Interval Privacy: A Framework for Privacy-Preserving Data Collection

Arxiv

0+阅读 · 2022年4月19日

Auto-Icon+: An Automated End-to-End Code Generation Tool for Icon Designs in UI Development

Arxiv

0+阅读 · 2022年4月19日

Retrieval-Free Knowledge-Grounded Dialogue Response Generation with Adapters

Arxiv

0+阅读 · 2022年4月18日

DialAug: Mixing up Dialogue Contexts in Contrastive Learning for Robust Conversational Modeling

Arxiv

0+阅读 · 2022年4月15日

QAConv: Question Answering on Informative Conversations

Arxiv

0+阅读 · 2022年4月14日

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Arxiv

17+阅读 · 2019年9月9日

VIP会员

文章信息

相关主题

任务对话系统

Boosting（一种模型训练加速方式）

相关VIP内容

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【深度学习视频分析/多模态学习资源大列表】

【深度学习视频分析/多模态学习资源大列表】

专知会员服务

92+阅读 · 2019年10月16日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【牛津大学博士论文】将序列结构与几何结构融入深度神经网络

工程视角：影响战争进程的小型无人机

企业级AI应用开发：从技术选型到生产落地

AI生成代码缺陷综述

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

（精品干货）ACL 2018最新论文归类（最全最细）分享

（精品干货）ACL 2018最新论文归类（最全最细）分享

深度学习与NLP

19+阅读 · 2018年5月14日

【论文推荐】最新八篇图像检索相关论文—三元组、深度特征图、判别式、卷积特征聚合、视觉-关系知识图谱、大规模图像检索

【论文推荐】最新八篇图像检索相关论文—三元组、深度特征图、判别式、卷积特征聚合、视觉-关系知识图谱、大规模图像检索

专知

33+阅读 · 2018年4月23日

【论文推荐】最新5篇图像分割相关论文—条件随机场和深度特征学习、移动端网络、长期视觉定位、主动学习、主动轮廓模型、生成对抗性网络

【论文推荐】最新5篇图像分割相关论文—条件随机场和深度特征学习、移动端网络、长期视觉定位、主动学习、主动轮廓模型、生成对抗性网络

专知

13+阅读 · 2018年1月23日

相关论文

A Corpus for Understanding and Generating Moral Stories

A Corpus for Understanding and Generating Moral Stories

Arxiv

1+阅读 · 2022年4月20日

Exploring Dense Retrieval for Dialogue Response Selection

Arxiv

0+阅读 · 2022年4月20日

DialoKG: Knowledge-Structure Aware Task-Oriented Dialogue Generation

Arxiv

0+阅读 · 2022年4月19日

Retrieval Enhanced Data Augmentation for Question Answering on Privacy Policies

Retrieval Enhanced Data Augmentation for Question Answering on Privacy Policies

Arxiv

0+阅读 · 2022年4月19日

Interval Privacy: A Framework for Privacy-Preserving Data Collection

Arxiv

0+阅读 · 2022年4月19日

Auto-Icon+: An Automated End-to-End Code Generation Tool for Icon Designs in UI Development

Arxiv

0+阅读 · 2022年4月19日

Retrieval-Free Knowledge-Grounded Dialogue Response Generation with Adapters

Arxiv

0+阅读 · 2022年4月18日

DialAug: Mixing up Dialogue Contexts in Contrastive Learning for Robust Conversational Modeling

Arxiv

0+阅读 · 2022年4月15日

QAConv: Question Answering on Informative Conversations

Arxiv

0+阅读 · 2022年4月14日

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Arxiv

17+阅读 · 2019年9月9日

相关基金

基于压缩感知的高精度实时视觉跟踪方法研究

国家自然科学基金

3+阅读 · 2015年12月31日

风电机组独立变桨距系统概率模糊建模与协调优化控制

国家自然科学基金

0+阅读 · 2014年12月31日

下三角型异构无人机群鲁棒协调控制

国家自然科学基金

9+阅读 · 2014年12月31日

基于不确定先验知识的支持向量机理论与算法研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于马氏抽样的机器学习理论与算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于可移动摄像头的协同式安全监控和目标跟踪

国家自然科学基金

0+阅读 · 2012年12月31日

扩展的模糊逻辑与基于蕴涵算子的Rough逻辑

国家自然科学基金

0+阅读 · 2011年12月31日

基于压缩感知的无线信道质量反馈新方法探索

国家自然科学基金

0+阅读 · 2009年12月31日

球面学习理论研究

国家自然科学基金

1+阅读 · 2008年12月31日

无线传感器网络定位技术研究

国家自然科学基金

3+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员