以内文学习模拟可控对话框 (Controllable Dialogue Simulation with In-Context Learning) - 专知论文

会员服务 ·

0

任务对话系统 · 控制器 · Learning · 情景 · Automator ·

2022 年 10 月 25 日

Controllable Dialogue Simulation with In-Context Learning

翻译：以内文学习模拟可控对话框

Zekun Li,Wenhu Chen,Shiyang Li,Hong Wang,Jing Qian,Xifeng Yan

from arxiv, EMNLP 2022 Findings, code and data are available at https://github.com/Leezekun/dialogic

Building dialogue systems requires a large corpus of annotated dialogues. Such datasets are usually created via crowdsourcing, which is expensive and time-consuming. In this paper, we propose \textsc{Dialogic}, a novel dialogue simulation method based on large language model in-context learning to automate dataset creation. Seeded with a few annotated dialogues, \textsc{Dialogic} automatically selects in-context examples for demonstration and prompts GPT-3 to generate new dialogues and annotations in a controllable way. Our method can rapidly expand a small set of dialogue data with minimum or zero \textit{human involvement} and \textit{parameter update} and is thus much more cost-efficient and time-saving than crowdsourcing. Experimental results on the MultiWOZ dataset demonstrate that training a model on the simulated dialogues leads to even better performance than using the same amount of human-generated dialogues under the challenging low-resource settings, with as few as 85 dialogues as a seed. When the full training set is given, our method can still serve as an effective data augmentation method to further improve performance. Human evaluation results show that our simulated dialogues have near-human fluency and annotation accuracy. The code and data are available at \textbf{\url{https://github.com/Leezekun/dialogic}}.

翻译：建立对话框系统需要大量附加说明的对话框。这些数据集通常是通过众包创建的, 费用昂贵且耗时。在本文中, 我们提议了\ textsc{ Dialogic}, 这是基于大型语言模型内文体学习的新的对话模拟方法, 以自动创建数据集。种子用一些附加说明的对话框,\ textsc{ Dialogic} 自动选择演示的文本示例, 并促使 GPT-3 以可控制的方式生成新的对话框和说明。我们的方法可以快速扩展小套对话数据, 其最小或零 \ textit{ 人类参与} 和\ textit{ parameter 更新}, 并且因此比众包更具有成本效益和节省时间。 MultiWoZ 数据集的实验结果显示, 在挑战性低资源环境下培训一个模型比使用同样数量的人类生成的对话, 少至85个对话作为种子。当提供全部培训时, 我们的方法仍然可以作为近于 http/ annubrqual 的增强数据/ drodeal disal disal dal comdeal

0

相关内容

任务对话系统

任务对话系统

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

33页PPT【AI+天气预测】，AI and Machine learning for weather predictions

33页PPT【AI+天气预测】，AI and Machine learning for weather predictions

专知会员服务

34+阅读 · 2022年3月5日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

体外定点糖基化修饰的内皮抑素活性短肽靶向性抗新生血管生成与抗肿瘤作用机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

蛋白激酶LIMK1活性在小鼠卵母细胞染色体分离过程中的作用和分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

天然产物RP-66453的仿生合成研究

国家自然科学基金

0+阅读 · 2012年12月31日

PCL聚合物纳米粒子控释HIF-1α诱导OSTERIX修饰的iPS细胞成骨作用及再血管化的研究

国家自然科学基金

0+阅读 · 2012年12月31日

mTOR信号通路介导的APOBEC-1互补因子对肾脏发育调控的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

IRES调控EV71神经毒性的分子机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

Witten Laplacian的特征值及与其相关的Ricci Soliton研究

国家自然科学基金

0+阅读 · 2012年12月31日

癌基因Pim-1对细胞衰老的调节作用及其分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

Anginex重组腺相关病毒抗血管生成信号转导通路的研究

国家自然科学基金

0+阅读 · 2011年12月31日

脂联素调控足细胞Wnt信号通路保护糖尿病肾病的机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

MoDem: Accelerating Visual Model-Based Reinforcement Learning with Demonstrations

Arxiv

0+阅读 · 2022年12月12日

Controllability of complex networks: input node placement restricting the longest control chain

Arxiv

0+阅读 · 2022年12月9日

SINE: SINgle Image Editing with Text-to-Image Diffusion Models

Arxiv

0+阅读 · 2022年12月8日

General-Purpose In-Context Learning by Meta-Learning Transformers

Arxiv

0+阅读 · 2022年12月8日

DDSupport: Language Learning Support System that Displays Differences and Distances from Model Speech

Arxiv

0+阅读 · 2022年12月8日

Compositional Visual Generation with Composable Diffusion Models

Arxiv

0+阅读 · 2022年12月7日

Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better

Arxiv

28+阅读 · 2021年6月16日

Recent Advances in Deep Learning-based Dialogue Systems

Arxiv

18+阅读 · 2021年5月10日

Low-Shot Learning from Imaginary Data

Arxiv

15+阅读 · 2018年4月3日

Adversarial Learning for Chinese NER from Crowd Annotations

Arxiv

15+阅读 · 2018年1月16日

VIP会员

文章信息

相关主题

任务对话系统

相关VIP内容

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

33页PPT【AI+天气预测】，AI and Machine learning for weather predictions

33页PPT【AI+天气预测】，AI and Machine learning for weather predictions

专知会员服务

34+阅读 · 2022年3月5日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《人与智能体在系统工程建模语言V2任务中的性能表现：基于用户中心化的评估方法》308页

《数据安全国家标准体系（2025版）》征求意见稿

AlphaMosaic：人工智能赋能的作战管理系统

《军事行动中通信平台的战略价值：提升战术效能与作战优势》

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

MoDem: Accelerating Visual Model-Based Reinforcement Learning with Demonstrations

Arxiv

0+阅读 · 2022年12月12日

Controllability of complex networks: input node placement restricting the longest control chain

Arxiv

0+阅读 · 2022年12月9日

SINE: SINgle Image Editing with Text-to-Image Diffusion Models

Arxiv

0+阅读 · 2022年12月8日

General-Purpose In-Context Learning by Meta-Learning Transformers

Arxiv

0+阅读 · 2022年12月8日

DDSupport: Language Learning Support System that Displays Differences and Distances from Model Speech

Arxiv

0+阅读 · 2022年12月8日

Compositional Visual Generation with Composable Diffusion Models

Arxiv

0+阅读 · 2022年12月7日

Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better

Arxiv

28+阅读 · 2021年6月16日

Recent Advances in Deep Learning-based Dialogue Systems

Arxiv

18+阅读 · 2021年5月10日

Low-Shot Learning from Imaginary Data

Arxiv

15+阅读 · 2018年4月3日

Adversarial Learning for Chinese NER from Crowd Annotations

Arxiv

15+阅读 · 2018年1月16日

相关基金

体外定点糖基化修饰的内皮抑素活性短肽靶向性抗新生血管生成与抗肿瘤作用机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

蛋白激酶LIMK1活性在小鼠卵母细胞染色体分离过程中的作用和分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

天然产物RP-66453的仿生合成研究

国家自然科学基金

0+阅读 · 2012年12月31日

PCL聚合物纳米粒子控释HIF-1α诱导OSTERIX修饰的iPS细胞成骨作用及再血管化的研究

国家自然科学基金

0+阅读 · 2012年12月31日

mTOR信号通路介导的APOBEC-1互补因子对肾脏发育调控的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

IRES调控EV71神经毒性的分子机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

Witten Laplacian的特征值及与其相关的Ricci Soliton研究

国家自然科学基金

0+阅读 · 2012年12月31日

癌基因Pim-1对细胞衰老的调节作用及其分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

Anginex重组腺相关病毒抗血管生成信号转导通路的研究

国家自然科学基金

0+阅读 · 2011年12月31日

脂联素调控足细胞Wnt信号通路保护糖尿病肾病的机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员