El Volumen 更大声的 Por For Favor: 在任务导向语义解析中转换代码 (El Volumen Louder Por Favor: Code-switching in Task-oriented Semantic Parsing) - 专知论文

会员服务 ·

0

语义分析 · MoDELS · 小样本学习 · 计算机科学 · 训练实例 ·

2021 年 1 月 28 日

El Volumen Louder Por Favor: Code-switching in Task-oriented Semantic Parsing

翻译：El Volumen 更大声的 Por For Favor: 在任务导向语义解析中转换代码

Arash Einolghozati,Abhinav Arora,Lorena Sainz-Maza Lecanda,Anuj Kumar,Sonal Gupta

Being able to parse code-switched (CS) utterances, such as Spanish+English or Hindi+English, is essential to democratize task-oriented semantic parsing systems for certain locales. In this work, we focus on Spanglish (Spanish+English) and release a dataset, CSTOP, containing 5800 CS utterances alongside their semantic parses. We examine the CS generalizability of various Cross-lingual (XL) models and exhibit the advantage of pre-trained XL language models when data for only one language is present. As such, we focus on improving the pre-trained models for the case when only English corpus alongside either zero or a few CS training instances are available. We propose two data augmentation methods for the zero-shot and the few-shot settings: fine-tune using translate-and-align and augment using a generation model followed by match-and-filter. Combining the few-shot setting with the above improvements decreases the initial 30-point accuracy gap between the zero-shot and the full-data settings by two thirds.

翻译：能够解析密码转换(CS)语句,例如西班牙语+英语或印地语+英语,对于使某些地方的任务导向语义解析系统民主化至关重要。在这项工作中,我们侧重于Spanglish(西班牙语+英语)并发布数据集,即SUSP, 包含5800 CS语及其语义剖析。我们检查了多种跨语言(XL)模式的CS通用性,并展示了在只有一种语言的数据存在时受过预先训练的 XL 语言模型的优势。因此,我们侧重于在只有英语材料同时提供零点或少量 CS 培训时改进案件预先培训的模式。我们建议了两种数据增强方法,即:使用翻译和对等的生成模型进行微调,并使用配对和过滤器辅助生成模型。将微小的设定与以上改进相结合,将零点和全数据设置之间的初始30点准确度差距缩小三分之二。

0

相关内容

语义分析

语义分析的最终目的是理解句子表达的真实语义。但是，语义应该采用什么表示形式一直困扰着研究者们，至今这个问题也没有一个统一的答案。语义角色标注（semantic role labeling）是目前比较成熟的浅层语义分析技术。基于逻辑表达的语义分析也得到学术界的长期关注。

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

专知会员服务

108+阅读 · 2020年5月1日

【ICLR2020-CMU】学习使用主动神经SLAM进行探索，Active Neural SLAM

【ICLR2020-CMU】学习使用主动神经SLAM进行探索，Active Neural SLAM

专知会员服务

38+阅读 · 2020年4月13日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

【得克萨斯大学奥斯汀分校】无记忆元学习（Meta-Learning without Memorization），Mingzhang Yin，Mingyuan Zhou

【得克萨斯大学奥斯汀分校】无记忆元学习（Meta-Learning without Memorization），Mingzhang Yin，Mingyuan Zhou

专知会员服务

10+阅读 · 2019年12月12日

【NLP| 推荐文章】基于文本和知识库的语义搜索（Semantic search on text and knowledge bases）

专知会员服务

46+阅读 · 2019年11月24日

【O'Reilly AI Conference 2019】高管简报：您会学中文以提高AI吗？（Executive Briefing: Will you learn Chinese to advance in AI?），Charlotte Han

【O'Reilly AI Conference 2019】高管简报：您会学中文以提高AI吗？（Executive Briefing: Will you learn Chinese to advance in AI?），Charlotte Han

专知会员服务

8+阅读 · 2019年11月5日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Deep Compression/Acceleration：模型压缩加速论文汇总

Deep Compression/Acceleration：模型压缩加速论文汇总

极市平台

14+阅读 · 2019年5月15日

一文读懂依存句法分析

一文读懂依存句法分析

AINLP

16+阅读 · 2019年4月28日

别说还不懂依存句法分析

别说还不懂依存句法分析

人工智能头条

23+阅读 · 2019年4月8日

TorchSeg：基于pytorch的语义分割算法开源了

TorchSeg：基于pytorch的语义分割算法开源了

极市平台

20+阅读 · 2019年1月28日

免费自然语言处理(NLP)课程及教材分享

免费自然语言处理(NLP)课程及教材分享

深度学习与NLP

29+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【推荐】图像分类必读开创性论文汇总

【推荐】图像分类必读开创性论文汇总

机器学习研究会

14+阅读 · 2017年8月15日

The Interplay of Task Success and Dialogue Quality: An in-depth Evaluation in Task-Oriented Visual Dialogues

Arxiv

0+阅读 · 2021年3月20日

Few-shot Natural Language Generation for Task-Oriented Dialog

Few-shot Natural Language Generation for Task-Oriented Dialog

Arxiv

30+阅读 · 2020年2月27日

A Sketch-Based System for Semantic Parsing

A Sketch-Based System for Semantic Parsing

Arxiv

4+阅读 · 2019年9月12日

Deep Metric Transfer for Label Propagation with Limited Annotated Data

Arxiv

3+阅读 · 2018年12月20日

Deformable ConvNets v2: More Deformable, Better Results

Deformable ConvNets v2: More Deformable, Better Results

Arxiv

4+阅读 · 2018年11月27日

Attentive Convolution: Equipping CNNs with RNN-style Attention Mechanisms

Arxiv

3+阅读 · 2018年11月13日

Linguistically-Informed Self-Attention for Semantic Role Labeling

Arxiv

17+阅读 · 2018年8月28日

Hierarchical Pointer Memory Network for Task Oriented Dialogue

Arxiv

3+阅读 · 2018年5月3日

Scheduled Multi-Task Learning: From Syntax to Translation

Arxiv

5+阅读 · 2018年4月24日

Cross-type Biomedical Named Entity Recognition with Deep Multi-Task Learning

Arxiv

10+阅读 · 2018年1月30日

VIP会员

文章信息

相关主题

小样本学习

计算机科学

相关VIP内容

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

专知会员服务

108+阅读 · 2020年5月1日

【ICLR2020-CMU】学习使用主动神经SLAM进行探索，Active Neural SLAM

【ICLR2020-CMU】学习使用主动神经SLAM进行探索，Active Neural SLAM

专知会员服务

38+阅读 · 2020年4月13日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

【得克萨斯大学奥斯汀分校】无记忆元学习（Meta-Learning without Memorization），Mingzhang Yin，Mingyuan Zhou

【得克萨斯大学奥斯汀分校】无记忆元学习（Meta-Learning without Memorization），Mingzhang Yin，Mingyuan Zhou

专知会员服务

10+阅读 · 2019年12月12日

【NLP| 推荐文章】基于文本和知识库的语义搜索（Semantic search on text and knowledge bases）

专知会员服务

46+阅读 · 2019年11月24日

【O'Reilly AI Conference 2019】高管简报：您会学中文以提高AI吗？（Executive Briefing: Will you learn Chinese to advance in AI?），Charlotte Han

【O'Reilly AI Conference 2019】高管简报：您会学中文以提高AI吗？（Executive Briefing: Will you learn Chinese to advance in AI?），Charlotte Han

专知会员服务

8+阅读 · 2019年11月5日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

数据驱动死亡：以色列AI战争机器如何锁定目标

【普林斯顿博士论文】通过以人为本的评估推动负责任的人工智能

ICML 2025 | BiAssemble: 双臂机器人几何拼合问题的协同可供性学习

ICML 2025杰出论文出炉：8篇获奖，南大研究者榜上有名

相关资讯

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Deep Compression/Acceleration：模型压缩加速论文汇总

Deep Compression/Acceleration：模型压缩加速论文汇总

极市平台

14+阅读 · 2019年5月15日

一文读懂依存句法分析

一文读懂依存句法分析

AINLP

16+阅读 · 2019年4月28日

别说还不懂依存句法分析

别说还不懂依存句法分析

人工智能头条

23+阅读 · 2019年4月8日

TorchSeg：基于pytorch的语义分割算法开源了

TorchSeg：基于pytorch的语义分割算法开源了

极市平台

20+阅读 · 2019年1月28日

免费自然语言处理(NLP)课程及教材分享

免费自然语言处理(NLP)课程及教材分享

深度学习与NLP

29+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【推荐】图像分类必读开创性论文汇总

【推荐】图像分类必读开创性论文汇总

机器学习研究会

14+阅读 · 2017年8月15日

相关论文

The Interplay of Task Success and Dialogue Quality: An in-depth Evaluation in Task-Oriented Visual Dialogues

Arxiv

0+阅读 · 2021年3月20日

Few-shot Natural Language Generation for Task-Oriented Dialog

Few-shot Natural Language Generation for Task-Oriented Dialog

Arxiv

30+阅读 · 2020年2月27日

A Sketch-Based System for Semantic Parsing

A Sketch-Based System for Semantic Parsing

Arxiv

4+阅读 · 2019年9月12日

Deep Metric Transfer for Label Propagation with Limited Annotated Data

Arxiv

3+阅读 · 2018年12月20日

Deformable ConvNets v2: More Deformable, Better Results

Deformable ConvNets v2: More Deformable, Better Results

Arxiv

4+阅读 · 2018年11月27日

Attentive Convolution: Equipping CNNs with RNN-style Attention Mechanisms

Arxiv

3+阅读 · 2018年11月13日

Linguistically-Informed Self-Attention for Semantic Role Labeling

Arxiv

17+阅读 · 2018年8月28日

Hierarchical Pointer Memory Network for Task Oriented Dialogue

Arxiv

3+阅读 · 2018年5月3日

Scheduled Multi-Task Learning: From Syntax to Translation

Arxiv

5+阅读 · 2018年4月24日

Cross-type Biomedical Named Entity Recognition with Deep Multi-Task Learning

Arxiv

10+阅读 · 2018年1月30日

微信扫码咨询专知VIP会员