评估用于大规模多语言语义解析的字节和单字级模型 (Evaluating Byte and Wordpiece Level Models for Massively Multilingual Semantic Parsing) - 专知论文

会员服务 ·

0

语义分析 · MoDELS · 可约的 · state-of-the-art · 张成子空间 ·

2022 年 12 月 14 日

Evaluating Byte and Wordpiece Level Models for Massively Multilingual Semantic Parsing

翻译：评估用于大规模多语言语义解析的字节和单字级模型

Massimo Nicosia,Francesco Piccinno

from arxiv, Massively Multilingual NLU 2022 Workshop Paper @ EMNLP 2022 - Winning approach of the MMNLU-22 Zero-Shot Challenge

Token free approaches have been successfully applied to a series of word and span level tasks. In this work, we compare a byte-level (ByT5) and a wordpiece based (mT5) sequence to sequence model on the 51 languages of the MASSIVE multilingual semantic parsing dataset. We examine multiple experimental settings: (i) zero-shot, (ii) full gold data and (iii) zero-shot with synthetic data. By leveraging a state-of-the-art label projection method for machine translated examples, we are able to reduce the gap in exact match accuracy to only 5 points with respect to a model trained on gold data from all the languages. We additionally provide insights on the cross-lingual transfer of ByT5 and show how the model compares with respect to mT5 across all parameter sizes.

翻译：在这项工作中,我们比较了一个字节级(ByT5)和一个基于字件(mT5)的顺序序列,以模型的形式排列了MSAive多语种语义解解析数据集的51种语言。我们研究了多种实验设置:(一) 零射,(二) 完整的黄金数据,(三) 合成数据的零射。通过利用最先进的标签投影法来计算机器翻译的示例,我们能够将精确匹配率的差距缩小到仅5个点,以所有语言的金数据模型为对象。我们还提供了关于BYT5跨语言传输的洞见,并展示了该模型在所有参数大小上相对于 mT5 的对比。

0

相关内容

语义分析

语义分析的最终目的是理解句子表达的真实语义。但是，语义应该采用什么表示形式一直困扰着研究者们，至今这个问题也没有一个统一的答案。语义角色标注（semantic role labeling）是目前比较成熟的浅层语义分析技术。基于逻辑表达的语义分析也得到学术界的长期关注。

【2022新书】Python数据分析第三版，579页pdf

【2022新书】Python数据分析第三版，579页pdf

专知会员服务

252+阅读 · 2022年8月31日

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【ICLR2020 预训练的百科全书】弱监督的知识-预训练的语言模型（PRETRAINED ENCYCLOPEDIA: WEAKLY SUPERVISED KNOWLEDGE-PRETRAINED LANGUAGE MODEL）

【ICLR2020 预训练的百科全书】弱监督的知识-预训练的语言模型（PRETRAINED ENCYCLOPEDIA: WEAKLY SUPERVISED KNOWLEDGE-PRETRAINED LANGUAGE MODEL）

专知会员服务

25+阅读 · 2019年12月26日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

245+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

开放知识图谱

0+阅读 · 2021年9月9日

RoBERTa中文预训练模型：RoBERTa for Chinese

RoBERTa中文预训练模型：RoBERTa for Chinese

PaperWeekly

57+阅读 · 2019年9月16日

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

AINLP

30+阅读 · 2019年9月8日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

高速水流冲磨与空蚀耦合作用下过流壁面蚀损机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

分布式滑坡形变PSI（永久散射体干涉雷达）监测模型及技术

国家自然科学基金

0+阅读 · 2014年12月31日

离子型稀土矿区土壤中氮化物迁移规律及转化机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

圆柱壳体振动陀螺品质特征的飞秒激光精密修调机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

非对称导叶布局及其减振机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

方形低台诱导的高超声速边界层转捩机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于多频信息融合的探地雷达林木根系原位探测识别研究

国家自然科学基金

0+阅读 · 2012年12月31日

碳酸盐岩区硫化物尾矿中重金属赋存状态研究

国家自然科学基金

0+阅读 · 2012年12月31日

共振Schottky探针研制

国家自然科学基金

0+阅读 · 2012年12月31日

大规模混成式准三维神经元探针阵列

国家自然科学基金

0+阅读 · 2009年12月31日

Dictionary-based Phrase-level Prompting of Large Language Models for Machine Translation

Arxiv

0+阅读 · 2023年2月15日

The Programmer's Assistant: Conversational Interaction with a Large Language Model for Software Development

Arxiv

0+阅读 · 2023年2月14日

QASem Parsing: Text-to-text Modeling of QA-based Semantics

Arxiv

0+阅读 · 2023年2月14日

The Role of Semantic Parsing in Understanding Procedural Text

Arxiv

0+阅读 · 2023年2月14日

USCORE: An Effective Approach to Fully Unsupervised Evaluation Metrics for Machine Translation

Arxiv

0+阅读 · 2023年2月11日

CodeBERTScore: Evaluating Code Generation with Pretrained Models of Code

Arxiv

0+阅读 · 2023年2月10日

Sequence Generation with Label Augmentation for Relation Extraction

Arxiv

0+阅读 · 2023年2月10日

Language-Aware Multilingual Machine Translation with Self-Supervised Learning

Arxiv

0+阅读 · 2023年2月10日

PROP: Pre-training with Representative Words Prediction for Ad-hoc Retrieval

Arxiv

11+阅读 · 2020年10月20日

Commonsense Knowledge Base Completion with Structural and Semantic Context

Commonsense Knowledge Base Completion with Structural and Semantic Context

Arxiv

20+阅读 · 2019年12月19日

VIP会员

文章信息

相关主题

state-of-the-art

张成子空间

相关VIP内容

【2022新书】Python数据分析第三版，579页pdf

【2022新书】Python数据分析第三版，579页pdf

专知会员服务

252+阅读 · 2022年8月31日

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【ICLR2020 预训练的百科全书】弱监督的知识-预训练的语言模型（PRETRAINED ENCYCLOPEDIA: WEAKLY SUPERVISED KNOWLEDGE-PRETRAINED LANGUAGE MODEL）

【ICLR2020 预训练的百科全书】弱监督的知识-预训练的语言模型（PRETRAINED ENCYCLOPEDIA: WEAKLY SUPERVISED KNOWLEDGE-PRETRAINED LANGUAGE MODEL）

专知会员服务

25+阅读 · 2019年12月26日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

245+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

最新《扩散模型原理》新书，470页pdf

无人机作战：演进、创新与未来战场

AI 智能体简史

多模态空间推理在大模型时代：综述与基准测试

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

开放知识图谱

0+阅读 · 2021年9月9日

RoBERTa中文预训练模型：RoBERTa for Chinese

RoBERTa中文预训练模型：RoBERTa for Chinese

PaperWeekly

57+阅读 · 2019年9月16日

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

AINLP

30+阅读 · 2019年9月8日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Dictionary-based Phrase-level Prompting of Large Language Models for Machine Translation

Arxiv

0+阅读 · 2023年2月15日

The Programmer's Assistant: Conversational Interaction with a Large Language Model for Software Development

Arxiv

0+阅读 · 2023年2月14日

QASem Parsing: Text-to-text Modeling of QA-based Semantics

Arxiv

0+阅读 · 2023年2月14日

The Role of Semantic Parsing in Understanding Procedural Text

Arxiv

0+阅读 · 2023年2月14日

USCORE: An Effective Approach to Fully Unsupervised Evaluation Metrics for Machine Translation

Arxiv

0+阅读 · 2023年2月11日

CodeBERTScore: Evaluating Code Generation with Pretrained Models of Code

Arxiv

0+阅读 · 2023年2月10日

Sequence Generation with Label Augmentation for Relation Extraction

Arxiv

0+阅读 · 2023年2月10日

Language-Aware Multilingual Machine Translation with Self-Supervised Learning

Arxiv

0+阅读 · 2023年2月10日

PROP: Pre-training with Representative Words Prediction for Ad-hoc Retrieval

Arxiv

11+阅读 · 2020年10月20日

Commonsense Knowledge Base Completion with Structural and Semantic Context

Commonsense Knowledge Base Completion with Structural and Semantic Context

Arxiv

20+阅读 · 2019年12月19日

相关基金

高速水流冲磨与空蚀耦合作用下过流壁面蚀损机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

分布式滑坡形变PSI（永久散射体干涉雷达）监测模型及技术

国家自然科学基金

0+阅读 · 2014年12月31日

离子型稀土矿区土壤中氮化物迁移规律及转化机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

圆柱壳体振动陀螺品质特征的飞秒激光精密修调机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

非对称导叶布局及其减振机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

方形低台诱导的高超声速边界层转捩机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于多频信息融合的探地雷达林木根系原位探测识别研究

国家自然科学基金

0+阅读 · 2012年12月31日

碳酸盐岩区硫化物尾矿中重金属赋存状态研究

国家自然科学基金

0+阅读 · 2012年12月31日

共振Schottky探针研制

国家自然科学基金

0+阅读 · 2012年12月31日

大规模混成式准三维神经元探针阵列

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员