使用Seq2seq 模型的有条件设定生成 (Conditional set generation using Seq2seq models) - 专知论文

会员服务 ·

0

seq2seq · 情景 · MoDELS · INFORMS · 任务对话系统 ·

2022 年 5 月 25 日

Conditional set generation using Seq2seq models

翻译：使用Seq2seq 模型的有条件设定生成

Aman Madaan,Dheeraj Rajagopal,Niket Tandon,Yiming Yang,Antoine Bosselut

Conditional set generation learns a mapping from an input sequence of tokens to a set. Several NLP tasks, such as entity typing and dialogue emotion tagging, are instances of set generation. Sequence-to-sequence~(Seq2seq) models are a popular choice to model set generation, but they treat a set as a sequence and do not fully leverage its key properties, namely order-invariance and cardinality. We propose a novel algorithm for effectively sampling informative orders over the combinatorial space of label orders. Further, we jointly model the set cardinality and output by adding the set size as the first element and taking advantage of the autoregressive factorization used by Seq2seq models. Our method is a model-independent data augmentation approach that endows any Seq2seq model with the signals of order-invariance and cardinality. Training a Seq2seq model on this new augmented data~(without any additional annotations) gets an average relative improvement of 20% for four benchmarks datasets across models spanning from BART-base, T5-xxl, and GPT-3.

翻译：有条件设定生成会从一个输入序列的符号序列到一个集。一些 NLP 任务, 如实体打字和对话情感标记, 是设定生成的例子。序列到序列~ (Seq2seq) 模型是模型生成的流行选择, 但是它们把一组模型当作一个序列, 没有充分利用其关键属性, 即顺序差异和基点。我们提出了一个新算法, 用于在标签订单的组合空间中有效抽样信息订单。此外, 我们通过添加设定的大小作为第一个元素并利用Seq2seq 模型使用的自动递增因数化因子化, 来联合模拟设定的基点和输出。我们的方法是一种依赖模型的数据增强方法, 将任何Seq2seqseq 模型与秩序差异和基点信号相连接。在这种新增强的数据~ ( 无需附加说明) 上培训一个Seq2seqeqeq 模型, 得到平均20%的相对改进, 用于来自 BART- base、 T5- xl 和 GPT-3 。

0

相关内容

seq2seq

seq2seq 是一个Encoder–Decoder 结构的网络，它的输入是一个序列，输出也是一个序列， Encoder 中将一个可变长度的信号序列变为固定长度的向量表达，Decoder 将这个固定长度的向量变成可变长度的目标的信号序列

20篇「ACL2020」最新论文抢先看！看自然语言处理2020在研究什么？

20篇「ACL2020」最新论文抢先看！看自然语言处理2020在研究什么？

专知会员服务

97+阅读 · 2020年4月10日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

浅谈问题生成（Question Generation）

浅谈问题生成（Question Generation）

PaperWeekly

5+阅读 · 2021年12月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

专知

20+阅读 · 2018年6月29日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】用Tensorflow理解LSTM

【推荐】用Tensorflow理解LSTM

机器学习研究会

36+阅读 · 2017年9月11日

拓扑绝缘体与超导体耦合体系中交叉Andreev反射研究

国家自然科学基金

1+阅读 · 2014年12月31日

新型Ca5A4(VO4)6基陶瓷结构与微波介电性能调控研究

国家自然科学基金

0+阅读 · 2013年12月31日

手性金属纳米粒子的温控相分离催化体系及应用

国家自然科学基金

0+阅读 · 2012年12月31日

含极性非质子溶剂的离子液体-kosmotropic盐双水相体系的研究

国家自然科学基金

0+阅读 · 2012年12月31日

新型白光LED用玻璃陶瓷制备与发光性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

低导热、高强度多孔陶瓷材料的结构调控、制备及性能

国家自然科学基金

0+阅读 · 2011年12月31日

石墨烯中自旋和类自旋自由度的调控

国家自然科学基金

1+阅读 · 2011年12月31日

矩阵分解的低延迟并行算法

国家自然科学基金

0+阅读 · 2009年12月31日

碳纳米管类流体超高介电响应、调控及特性研究

国家自然科学基金

0+阅读 · 2009年12月31日

CaCu3Ti4O12基微/纳米陶瓷的制备与介电性能调控

国家自然科学基金

0+阅读 · 2008年12月31日

Object Detection as Probabilistic Set Prediction

Arxiv

0+阅读 · 2022年7月13日

Fuse It More Deeply! A Variational Transformer with Layer-Wise Latent Variable Inference for Text Generation

Arxiv

1+阅读 · 2022年7月13日

Estimation of non-symmetric and unbounded region of attraction using shifted shape function and R-composition

Arxiv

0+阅读 · 2022年7月12日

Multi-Task Learning with Sentiment, Emotion, and Target Detection to Recognize Hate Speech and Offensive Language

Arxiv

0+阅读 · 2022年7月11日

SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks

Arxiv

0+阅读 · 2022年7月10日

Internal Language Model Estimation based Language Model Fusion for Cross-Domain Code-Switching Speech Recognition

Arxiv

0+阅读 · 2022年7月9日

WeSinger 2: Fully Parallel Singing Voice Synthesis via Multi-Singer Conditional Adversarial Training

Arxiv

0+阅读 · 2022年7月8日

G2L: A Geometric Approach for Generating Pseudo-labels that Improve Transfer Learning

Arxiv

0+阅读 · 2022年7月7日

Deep Neural Networks for Rank-Consistent Ordinal Regression Based On Conditional Probabilities

Arxiv

0+阅读 · 2022年7月7日

Unsupervised Domain Clusters in Pretrained Language Models

Arxiv

11+阅读 · 2020年4月5日

VIP会员

文章信息

相关主题

任务对话系统

相关VIP内容

20篇「ACL2020」最新论文抢先看！看自然语言处理2020在研究什么？

20篇「ACL2020」最新论文抢先看！看自然语言处理2020在研究什么？

专知会员服务

97+阅读 · 2020年4月10日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【伯克利博士论文】通过真实世界实践赋能机器人自主性

军用无人机集群技术尚未成熟——但潜力可期

人工智能安全治理白皮书（2025）

AgentOps综述：分类、挑战与未来方向

相关资讯

浅谈问题生成（Question Generation）

浅谈问题生成（Question Generation）

PaperWeekly

5+阅读 · 2021年12月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

专知

20+阅读 · 2018年6月29日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】用Tensorflow理解LSTM

【推荐】用Tensorflow理解LSTM

机器学习研究会

36+阅读 · 2017年9月11日

相关论文

Object Detection as Probabilistic Set Prediction

Arxiv

0+阅读 · 2022年7月13日

Fuse It More Deeply! A Variational Transformer with Layer-Wise Latent Variable Inference for Text Generation

Arxiv

1+阅读 · 2022年7月13日

Estimation of non-symmetric and unbounded region of attraction using shifted shape function and R-composition

Arxiv

0+阅读 · 2022年7月12日

Multi-Task Learning with Sentiment, Emotion, and Target Detection to Recognize Hate Speech and Offensive Language

Arxiv

0+阅读 · 2022年7月11日

SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks

Arxiv

0+阅读 · 2022年7月10日

Internal Language Model Estimation based Language Model Fusion for Cross-Domain Code-Switching Speech Recognition

Arxiv

0+阅读 · 2022年7月9日

WeSinger 2: Fully Parallel Singing Voice Synthesis via Multi-Singer Conditional Adversarial Training

Arxiv

0+阅读 · 2022年7月8日

G2L: A Geometric Approach for Generating Pseudo-labels that Improve Transfer Learning

Arxiv

0+阅读 · 2022年7月7日

Deep Neural Networks for Rank-Consistent Ordinal Regression Based On Conditional Probabilities

Arxiv

0+阅读 · 2022年7月7日

Unsupervised Domain Clusters in Pretrained Language Models

Arxiv

11+阅读 · 2020年4月5日

相关基金

拓扑绝缘体与超导体耦合体系中交叉Andreev反射研究

国家自然科学基金

1+阅读 · 2014年12月31日

新型Ca5A4(VO4)6基陶瓷结构与微波介电性能调控研究

国家自然科学基金

0+阅读 · 2013年12月31日

手性金属纳米粒子的温控相分离催化体系及应用

国家自然科学基金

0+阅读 · 2012年12月31日

含极性非质子溶剂的离子液体-kosmotropic盐双水相体系的研究

国家自然科学基金

0+阅读 · 2012年12月31日

新型白光LED用玻璃陶瓷制备与发光性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

低导热、高强度多孔陶瓷材料的结构调控、制备及性能

国家自然科学基金

0+阅读 · 2011年12月31日

石墨烯中自旋和类自旋自由度的调控

国家自然科学基金

1+阅读 · 2011年12月31日

矩阵分解的低延迟并行算法

国家自然科学基金

0+阅读 · 2009年12月31日

碳纳米管类流体超高介电响应、调控及特性研究

国家自然科学基金

0+阅读 · 2009年12月31日

CaCu3Ti4O12基微/纳米陶瓷的制备与介电性能调控

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员