转换序列拖到 A Seq2SeqSeq 任务 (Transforming Sequence Tagging Into A Seq2Seq Task) - 专知论文

会员服务 ·

0

seq2seq · 可理解性 · MoDELS · 输出 · 变换 ·

2022 年 10 月 25 日

Transforming Sequence Tagging Into A Seq2Seq Task

翻译：转换序列拖到 A Seq2SeqSeq 任务

Karthik Raman,Iftekhar Naim,Jiecao Chen,Kazuma Hashimoto,Kiran Yalasangi,Krishna Srinivasan

from arxiv, Accepted at EMNLP 2022

Pretrained, large, generative language models (LMs) have had great success in a wide range of sequence tagging and structured prediction tasks. Casting a sequence tagging task as a Seq2Seq one requires deciding the formats of the input and output sequences. However, we lack a principled understanding of the trade-offs associated with these formats (such as the effect on model accuracy, sequence length, multilingual generalization, hallucination). In this paper, we rigorously study different formats one could use for casting input text sentences and their output labels into the input and target (i.e., output) of a Seq2Seq model. Along the way, we introduce a new format, which we show to to be both simpler and more effective. Additionally the new format demonstrates significant gains in the multilingual settings -- both zero-shot transfer learning and joint training. Lastly, we find that the new format is more robust and almost completely devoid of hallucination -- an issue we find common in existing formats. With well over a 1000 experiments studying 14 different formats, over 7 diverse public benchmarks -- including 3 multilingual datasets spanning 7 languages -- we believe our findings provide a strong empirical basis in understanding how we should tackle sequence tagging tasks.

翻译：在一系列广泛的序列标记和结构化的预测任务中,先入为主的、大型的、具有基因特征的语言模型(LMS)取得了巨大成功。作为Seq2Seq1, 将序列标记任务作为Seq2Seq1, 需要决定输入和输出序列的格式。然而,我们对这些格式的权衡缺乏原则性理解(例如,对模型精确度、序列长度、多语种概括、幻觉的影响等)。在本文中,我们严格地研究一种不同的格式,一种可以用来将输入文本句及其输出标签输入Seq2Seq模型的投入和目标(即产出)。在前进的道路上,我们引入一种新的格式,我们表明这种格式既简单又有效。此外,新格式展示了多语种环境中的重大收益 -- -- 零光转学和联合培训。最后,我们发现,新格式更加稳健,几乎完全没有幻觉 -- -- 我们在现有格式中发现一个共同的问题。超过1000个实验研究了14种不同格式,超过7种不同的公共基准 -- -- 包括3种跨7种多语言的多语种多语种数据集 -- -- -- -- 我们相信我们的调查结果提供了坚实的经验基础,我们理解了我们如何解决的序列任务。

0

相关内容

seq2seq

seq2seq 是一个Encoder–Decoder 结构的网络，它的输入是一个序列，输出也是一个序列， Encoder 中将一个可变长度的信号序列变为固定长度的向量表达，Decoder 将这个固定长度的向量变成可变长度的目标的信号序列

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】用Tensorflow理解LSTM

【推荐】用Tensorflow理解LSTM

机器学习研究会

36+阅读 · 2017年9月11日

矮牵牛DUF620蛋白家族基因PhADR1的功能及调控机理解析

国家自然科学基金

0+阅读 · 2015年12月31日

PPR蛋白OsPPR920参与调控水稻花粉发育的分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

PPAR β/δ基因在结直肠癌血管生成调控中的作用及分子机理

国家自然科学基金

2+阅读 · 2014年12月31日

泛素化在棉花曲叶病毒侵染中的作用

国家自然科学基金

0+阅读 · 2014年12月31日

板结性退化草地（羊草）土壤-根系复合体结构与耕作部件作用关系研究

国家自然科学基金

0+阅读 · 2013年12月31日

退化红壤区人工林林下植物根系生长与凋落物分解的互作机制

国家自然科学基金

0+阅读 · 2012年12月31日

整合寄主因子（IHF）对水稻基腐病细菌致病性调控机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

5羟甲基胞嘧啶及Tet蛋白在早期胚胎DNA去甲基化过程中的作用机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

杨树种絮发育的分子调控机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

调控家蚕发育非编码RNA（non-coding RNA, ncRNA）的功能解析

国家自然科学基金

0+阅读 · 2011年12月31日

Generic Tagging for RISC-V Binaries

Arxiv

0+阅读 · 2022年12月11日

Leveraging Unlabeled Data to Track Memorization

Arxiv

0+阅读 · 2022年12月8日

Whose Emotion Matters? Speaker Detection without Prior Knowledge

Arxiv

0+阅读 · 2022年12月8日

Successive Prompting for Decomposing Complex Questions

Arxiv

0+阅读 · 2022年12月8日

Multimodal Personality Recognition using Cross-Attention Transformer and Behaviour Encoding

Arxiv

0+阅读 · 2022年12月7日

Sequence Level Contrastive Learning for Text Summarization

Sequence Level Contrastive Learning for Text Summarization

Arxiv

14+阅读 · 2021年9月24日

Updating Embeddings for Dynamic Knowledge Graphs

Arxiv

20+阅读 · 2021年9月22日

Learning from History: Modeling Temporal Knowledge Graphs with Sequential Copy-Generation Networks

Arxiv

11+阅读 · 2020年12月15日

Reasoning in Dialog: Improving Response Generation by Context Reading Comprehension

Arxiv

12+阅读 · 2020年12月14日

Semi-supervised Medical Image Segmentation through Dual-task Consistency

Arxiv

14+阅读 · 2020年9月9日

VIP会员

文章信息

相关主题

相关VIP内容

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《小型无人机系统侦测追踪技术：声学、计算机视觉与深度学习融合方案》最新98页

《"牧羊人网格"拦截策略：实现无人机集群可靠拦截的新范式》

光纤无人机：反无人机系统的重大挑战

《作战建模与仿真实证研究》

相关资讯

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】用Tensorflow理解LSTM

【推荐】用Tensorflow理解LSTM

机器学习研究会

36+阅读 · 2017年9月11日

相关论文

Generic Tagging for RISC-V Binaries

Arxiv

0+阅读 · 2022年12月11日

Leveraging Unlabeled Data to Track Memorization

Arxiv

0+阅读 · 2022年12月8日

Whose Emotion Matters? Speaker Detection without Prior Knowledge

Arxiv

0+阅读 · 2022年12月8日

Successive Prompting for Decomposing Complex Questions

Arxiv

0+阅读 · 2022年12月8日

Multimodal Personality Recognition using Cross-Attention Transformer and Behaviour Encoding

Arxiv

0+阅读 · 2022年12月7日

Sequence Level Contrastive Learning for Text Summarization

Sequence Level Contrastive Learning for Text Summarization

Arxiv

14+阅读 · 2021年9月24日

Updating Embeddings for Dynamic Knowledge Graphs

Arxiv

20+阅读 · 2021年9月22日

Learning from History: Modeling Temporal Knowledge Graphs with Sequential Copy-Generation Networks

Arxiv

11+阅读 · 2020年12月15日

Reasoning in Dialog: Improving Response Generation by Context Reading Comprehension

Arxiv

12+阅读 · 2020年12月14日

Semi-supervised Medical Image Segmentation through Dual-task Consistency

Arxiv

14+阅读 · 2020年9月9日

相关基金

矮牵牛DUF620蛋白家族基因PhADR1的功能及调控机理解析

国家自然科学基金

0+阅读 · 2015年12月31日

PPR蛋白OsPPR920参与调控水稻花粉发育的分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

PPAR β/δ基因在结直肠癌血管生成调控中的作用及分子机理

国家自然科学基金

2+阅读 · 2014年12月31日

泛素化在棉花曲叶病毒侵染中的作用

国家自然科学基金

0+阅读 · 2014年12月31日

板结性退化草地（羊草）土壤-根系复合体结构与耕作部件作用关系研究

国家自然科学基金

0+阅读 · 2013年12月31日

退化红壤区人工林林下植物根系生长与凋落物分解的互作机制

国家自然科学基金

0+阅读 · 2012年12月31日

整合寄主因子（IHF）对水稻基腐病细菌致病性调控机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

5羟甲基胞嘧啶及Tet蛋白在早期胚胎DNA去甲基化过程中的作用机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

杨树种絮发育的分子调控机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

调控家蚕发育非编码RNA（non-coding RNA, ncRNA）的功能解析

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员