MoNET:通过 " 对话国家跟踪 " 的强化培训,通过 " 对话国家跟踪 " 应对国家势头 (MoNET: Tackle State Momentum via Noise-Enhanced Training for Dialogue State Tracking) - 专知论文

会员服务 ·

0

任务对话系统 · MONET · 动量 · DST (Digital Sky Technologies) · MoDELS ·

2022 年 11 月 10 日

MoNET: Tackle State Momentum via Noise-Enhanced Training for Dialogue State Tracking

翻译：MoNET:通过 " 对话国家跟踪 " 的强化培训,通过 " 对话国家跟踪 " 应对国家势头

Haoning Zhang,Junwei Bao,Haipeng Sun,Youzheng Wu,Wenye Li,Shuguang Cui,Xiaodong He

from arxiv, 8 pages, 6 figures, 3 tables

Dialogue state tracking (DST) aims to convert the dialogue history into dialogue states which consist of slot-value pairs. As condensed structural information memorizing all history information, the dialogue state in the last turn is typically adopted as the input for predicting the current state by DST models. However, these models tend to keep the predicted slot values unchanged, which is defined as state momentum in this paper. Specifically, the models struggle to update slot values that need to be changed and correct wrongly predicted slot values in the last turn. To this end, we propose MoNET to tackle state momentum via noise-enhanced training. First, the previous state of each turn in the training data is noised via replacing some of its slot values. Then, the noised previous state is used as the input to learn to predict the current state, improving the model's ability to update and correct slot values. Furthermore, a contrastive context matching framework is designed to narrow the representation distance between a state and its corresponding noised variant, which reduces the impact of noised state and makes the model better understand the dialogue history. Experimental results on MultiWOZ datasets show that MoNET outperforms previous DST methods. Ablations and analysis verify the effectiveness of MoNET in alleviating state momentum and improving anti-noise ability.

翻译：对话状态跟踪( DST) 旨在将对话历史转换成由空值对等组成的对话状态。随着压缩结构信息对全部历史信息进行校正, 对话状态在最后转折时通常被DST模型作为预测当前状态的输入。然而, 这些模型倾向于保持预测的空档值不变, 也就是本文中的国家动力。具体地说, 模型努力更新需要修改的空档值, 并在最后转弯中纠正错误预测的空档值。为此, 我们建议莫内特通过噪声强化培训来应对国家势头。首先, 培训数据中的每个转折点的先前状态通过替换其一些空档值而被忽略。然后, 将前转点化状态用作学习预测当前状态的投入, 提高模型更新和纠正空档值的能力。此外, 对比性环境匹配框架旨在缩小州与相应零度变量之间的代表距离, 从而降低新状态的影响, 并使模型更好地了解对话历史。在多WOZ 数据库中的实验性结果和 Monet- 变现能力分析显示MINET 之前的变现速度。

0

相关内容

任务对话系统

任务对话系统

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

TCDD经SSeCKS/TRAF6通路诱导星形胶质细胞激活致神经毒性的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

耐药菌特异性蛋白FtsZ新型抑制剂的发现与作用机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

硫化物/石墨烯纳米复合材料的室温固相合成及光催化性能

国家自然科学基金

0+阅读 · 2012年12月31日

基于Rho/ROCK信号通路的双黄连注射液致过敏样反应机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

鸡传染性法氏囊病毒VP4和VP5蛋白抑制机体I型干扰素信号通路的探索

国家自然科学基金

0+阅读 · 2012年12月31日

miR-23调控疲劳型亚健康骨骼肌线粒体再生的机制及维康颗粒的干预研究

国家自然科学基金

0+阅读 · 2012年12月31日

催化型氮杂Wittig反应合成多取代杂环的新方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

小分子干扰RNA沉默HIF-1α和VEGF抗TACE术后肝癌复发转移的实验研究

国家自然科学基金

0+阅读 · 2011年12月31日

高温抗氧化Mo(Si,Al)2/(Cr+Si)复合涂层引起Nb-Si基合金力学性能退化的机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

新型可见光全分解水制氢催化剂的设计与可控合成

国家自然科学基金

0+阅读 · 2009年12月31日

Adversarial Attacks on Neural Models of Code via Code Difference Reduction

Arxiv

0+阅读 · 2023年1月6日

Value Enhancement of Reinforcement Learning via Efficient and Robust Trust Region Optimization

Arxiv

0+阅读 · 2023年1月5日

Domain Generalization via Ensemble Stacking for Face Presentation Attack Detection

Arxiv

0+阅读 · 2023年1月5日

Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers

Arxiv

0+阅读 · 2023年1月5日

Enhancement attacks in biomedical machine learning

Arxiv

0+阅读 · 2023年1月5日

Multi-Task Learning with Prior Information

Arxiv

0+阅读 · 2023年1月4日

Balanced Multimodal Learning via On-the-fly Gradient Modulation

Arxiv

13+阅读 · 2022年3月29日

Multi-Object Tracking with Deep Learning Ensemble for Unmanned Aerial System Applications

Arxiv

26+阅读 · 2021年10月5日

Learning Neural Models for Natural Language Processing in the Face of Distributional Shift

Arxiv

11+阅读 · 2021年9月3日

Knowledge Graph Transfer Network for Few-Shot Recognition

Arxiv

15+阅读 · 2019年11月21日

VIP会员

文章信息

相关主题

任务对话系统

DST (Digital Sky Technologies)

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《复杂工程系统模型驱动设计决策支持系统：早期设计阶段挑战》最新138页

《日本陆上自卫队2040年作战方式与未来作战研究》最新23页slides

人工智能作为战争武器

《后勤保障》最新23页

相关资讯

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

相关论文

Adversarial Attacks on Neural Models of Code via Code Difference Reduction

Arxiv

0+阅读 · 2023年1月6日

Value Enhancement of Reinforcement Learning via Efficient and Robust Trust Region Optimization

Arxiv

0+阅读 · 2023年1月5日

Domain Generalization via Ensemble Stacking for Face Presentation Attack Detection

Arxiv

0+阅读 · 2023年1月5日

Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers

Arxiv

0+阅读 · 2023年1月5日

Enhancement attacks in biomedical machine learning

Arxiv

0+阅读 · 2023年1月5日

Multi-Task Learning with Prior Information

Arxiv

0+阅读 · 2023年1月4日

Balanced Multimodal Learning via On-the-fly Gradient Modulation

Arxiv

13+阅读 · 2022年3月29日

Multi-Object Tracking with Deep Learning Ensemble for Unmanned Aerial System Applications

Arxiv

26+阅读 · 2021年10月5日

Learning Neural Models for Natural Language Processing in the Face of Distributional Shift

Arxiv

11+阅读 · 2021年9月3日

Knowledge Graph Transfer Network for Few-Shot Recognition

Arxiv

15+阅读 · 2019年11月21日

相关基金

TCDD经SSeCKS/TRAF6通路诱导星形胶质细胞激活致神经毒性的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

耐药菌特异性蛋白FtsZ新型抑制剂的发现与作用机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

硫化物/石墨烯纳米复合材料的室温固相合成及光催化性能

国家自然科学基金

0+阅读 · 2012年12月31日

基于Rho/ROCK信号通路的双黄连注射液致过敏样反应机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

鸡传染性法氏囊病毒VP4和VP5蛋白抑制机体I型干扰素信号通路的探索

国家自然科学基金

0+阅读 · 2012年12月31日

miR-23调控疲劳型亚健康骨骼肌线粒体再生的机制及维康颗粒的干预研究

国家自然科学基金

0+阅读 · 2012年12月31日

催化型氮杂Wittig反应合成多取代杂环的新方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

小分子干扰RNA沉默HIF-1α和VEGF抗TACE术后肝癌复发转移的实验研究

国家自然科学基金

0+阅读 · 2011年12月31日

高温抗氧化Mo(Si,Al)2/(Cr+Si)复合涂层引起Nb-Si基合金力学性能退化的机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

新型可见光全分解水制氢催化剂的设计与可控合成

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员