通用量化器作为多语种NLU基准误差源 (Generalized Quantifiers as a Source of Error in Multilingual NLU Benchmarks) - 专知论文

会员服务 ·

0

NLU · MoDELS · 语言模型化 · 讲稿 · Performer ·

2022 年 4 月 22 日

Generalized Quantifiers as a Source of Error in Multilingual NLU Benchmarks

翻译：通用量化器作为多语种NLU基准误差源

Ruixiang Cui,Daniel Hershcovich,Anders Søgaard

from arxiv, To appear at NAACL 2022

Logical approaches to representing language have developed and evaluated computational models of quantifier words since the 19th century, but today's NLU models still struggle to capture their semantics. We rely on Generalized Quantifier Theory for language-independent representations of the semantics of quantifier words, to quantify their contribution to the errors of NLU models. We find that quantifiers are pervasive in NLU benchmarks, and their occurrence at test time is associated with performance drops. Multilingual models also exhibit unsatisfying quantifier reasoning abilities, but not necessarily worse for non-English languages. To facilitate directly-targeted probing, we present an adversarial generalized quantifier NLI task (GQNLI) and show that pre-trained language models have a clear lack of robustness in generalized quantifier reasoning.

翻译：自19世纪以来,代表语言的逻辑方法已经开发并评估了量化词的计算模型,但今天的NLU模型仍然难以捕捉其语义。我们依靠通用量化词理论来独立语言表达量化词的语义,以量化其对NLU模型错误的贡献。我们发现,量化词在NLU基准中很普遍,测试时的出现与性能下降有关。多语言模型也表现出不满意量化词的推理能力,但对于非英语语言来说不一定更差。为了便利直接有针对性的标注,我们提出了一个对抗性通用量化词NLI任务(GQNLI),并表明预先培训的语言模型在通用量化推理中明显缺乏稳健性。

0

相关内容

NLU

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【技术报告】诺亚开源中文预训练语言模型“哪吒”（NEZHA: Neural Contextualized Representation for Chinese Language Understanding）

【技术报告】诺亚开源中文预训练语言模型“哪吒”（NEZHA: Neural Contextualized Representation for Chinese Language Understanding）

专知会员服务

21+阅读 · 2019年12月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Serglycin调控TGF-β信号通路诱导EMT促进膀胱癌转移机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

“核HO-1”调控miRNA-125a-5p影响血脊髓屏障结构和功能的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

Anderson型多酸的不对称修饰及可控组装研究

国家自然科学基金

1+阅读 · 2014年12月31日

miR-124靶向TRAF6在骨肉瘤中的作用

国家自然科学基金

0+阅读 · 2013年12月31日

M2L2型水溶性金属-药物配合物的定向合成与抗肿瘤活性研究

国家自然科学基金

0+阅读 · 2013年12月31日

小青龙汤通过PKCδ/ERK/PARP-1信号通路调节H1R表达治疗变应性鼻炎的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

稳定度条件与环的正则性、clean性

国家自然科学基金

0+阅读 · 2012年12月31日

面向Web Service的服务质量预测技术研究

国家自然科学基金

0+阅读 · 2010年12月31日

金属与有机小分子共催化合成几类环状化合物

国家自然科学基金

0+阅读 · 2009年12月31日

多基线InSAR获取高精度DEM技术研究

国家自然科学基金

1+阅读 · 2008年12月31日

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Arxiv

0+阅读 · 2022年6月10日

Bayesian calibration of coupled computational mechanics models under uncertainty based on interface deformation

Arxiv

0+阅读 · 2022年6月10日

Teacher Perception of Automatically Extracted Grammar Concepts for L2 Language Learning

Teacher Perception of Automatically Extracted Grammar Concepts for L2 Language Learning

Arxiv

0+阅读 · 2022年6月10日

Referring Image Matting

Referring Image Matting

Arxiv

0+阅读 · 2022年6月10日

Globally-Optimal Contrast Maximisation for Event Cameras

Arxiv

0+阅读 · 2022年6月10日

LassoBench: A High-Dimensional Hyperparameter Optimization Benchmark Suite for Lasso

Arxiv

0+阅读 · 2022年6月10日

The CLEAR Benchmark: Continual LEArning on Real-World Imagery

Arxiv

0+阅读 · 2022年6月9日

FedHPO-B: A Benchmark Suite for Federated Hyperparameter Optimization

FedHPO-B: A Benchmark Suite for Federated Hyperparameter Optimization

Arxiv

0+阅读 · 2022年6月8日

Blind Face Restoration: Benchmark Datasets and a Baseline Model

Arxiv

0+阅读 · 2022年6月8日

Hyperparameter Ensembles for Robustness and Uncertainty Quantification

Arxiv

12+阅读 · 2020年6月24日

VIP会员

文章信息

相关主题

语言模型化

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【技术报告】诺亚开源中文预训练语言模型“哪吒”（NEZHA: Neural Contextualized Representation for Chinese Language Understanding）

【技术报告】诺亚开源中文预训练语言模型“哪吒”（NEZHA: Neural Contextualized Representation for Chinese Language Understanding）

专知会员服务

21+阅读 · 2019年12月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【牛津博士论文】零样本强化学习综述

《美军条令：陆军指挥官与规划人员地理空间指南》60页

战术边缘指挥控制：防务面临的核心挑战

迈向开放世界检测：综述

相关资讯

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Arxiv

0+阅读 · 2022年6月10日

Bayesian calibration of coupled computational mechanics models under uncertainty based on interface deformation

Arxiv

0+阅读 · 2022年6月10日

Teacher Perception of Automatically Extracted Grammar Concepts for L2 Language Learning

Teacher Perception of Automatically Extracted Grammar Concepts for L2 Language Learning

Arxiv

0+阅读 · 2022年6月10日

Referring Image Matting

Referring Image Matting

Arxiv

0+阅读 · 2022年6月10日

Globally-Optimal Contrast Maximisation for Event Cameras

Arxiv

0+阅读 · 2022年6月10日

LassoBench: A High-Dimensional Hyperparameter Optimization Benchmark Suite for Lasso

Arxiv

0+阅读 · 2022年6月10日

The CLEAR Benchmark: Continual LEArning on Real-World Imagery

Arxiv

0+阅读 · 2022年6月9日

FedHPO-B: A Benchmark Suite for Federated Hyperparameter Optimization

FedHPO-B: A Benchmark Suite for Federated Hyperparameter Optimization

Arxiv

0+阅读 · 2022年6月8日

Blind Face Restoration: Benchmark Datasets and a Baseline Model

Arxiv

0+阅读 · 2022年6月8日

Hyperparameter Ensembles for Robustness and Uncertainty Quantification

Arxiv

12+阅读 · 2020年6月24日

相关基金

Serglycin调控TGF-β信号通路诱导EMT促进膀胱癌转移机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

“核HO-1”调控miRNA-125a-5p影响血脊髓屏障结构和功能的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

Anderson型多酸的不对称修饰及可控组装研究

国家自然科学基金

1+阅读 · 2014年12月31日

miR-124靶向TRAF6在骨肉瘤中的作用

国家自然科学基金

0+阅读 · 2013年12月31日

M2L2型水溶性金属-药物配合物的定向合成与抗肿瘤活性研究

国家自然科学基金

0+阅读 · 2013年12月31日

小青龙汤通过PKCδ/ERK/PARP-1信号通路调节H1R表达治疗变应性鼻炎的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

稳定度条件与环的正则性、clean性

国家自然科学基金

0+阅读 · 2012年12月31日

面向Web Service的服务质量预测技术研究

国家自然科学基金

0+阅读 · 2010年12月31日

金属与有机小分子共催化合成几类环状化合物

国家自然科学基金

0+阅读 · 2009年12月31日

多基线InSAR获取高精度DEM技术研究

国家自然科学基金

1+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员