评估输入扰动神经语言模型的强力性 (Evaluating the Robustness of Neural Language Models to Input Perturbations) - 专知论文

会员服务 ·

0

语言模型化 · 神经语言模型 · 稳健性 · MoDELS · NLP ·

2021 年 8 月 27 日

Evaluating the Robustness of Neural Language Models to Input Perturbations

翻译：评估输入扰动神经语言模型的强力性

Milad Moradi,Matthias Samwald

from arxiv, Accepted by EMNLP 2021

High-performance neural language models have obtained state-of-the-art results on a wide range of Natural Language Processing (NLP) tasks. However, results for common benchmark datasets often do not reflect model reliability and robustness when applied to noisy, real-world data. In this study, we design and implement various types of character-level and word-level perturbation methods to simulate realistic scenarios in which input texts may be slightly noisy or different from the data distribution on which NLP systems were trained. Conducting comprehensive experiments on different NLP tasks, we investigate the ability of high-performance language models such as BERT, XLNet, RoBERTa, and ELMo in handling different types of input perturbations. The results suggest that language models are sensitive to input perturbations and their performance can decrease even when small changes are introduced. We highlight that models need to be further improved and that current benchmarks are not reflecting model robustness well. We argue that evaluations on perturbed inputs should routinely complement widely-used benchmarks in order to yield a more realistic understanding of NLP systems robustness.

翻译：高性能神经语言模型在一系列广泛的自然语言处理(NLP)任务中取得了最先进的结果,然而,通用基准数据集的结果往往不能反映模型的可靠性和稳健性,而应用于吵闹的、真实世界的数据。在本研究中,我们设计和执行了各种类型的品格水平和字级扰动方法,以模拟现实情景,在这些情景中,输入文本可能略为吵动或与NLP系统所培训的数据分布不同。对不同的自然语言处理任务进行全面试验,我们调查高性能语言模型(如BERT、XLNet、ROBERTA和ELMO)在处理不同类型输入扰动时的能力。结果显示,语言模型对输入扰动很敏感,即使在引入小的改动时,其性能也会降低。我们强调,模型需要进一步改进,目前的基准并不反映模型稳健。我们主张,关于渗透性投入的评价应经常补充广泛使用的基准,以便更现实地了解NLP系统是否稳健。

0

相关内容

语言模型化

语言模型化

深度概率图模型，Deep Probabilistic Models

专知会员服务

29+阅读 · 2021年8月2日

【斯坦福CS224N硬核课】自然语言生成NLG，79页ppt

专知会员服务

37+阅读 · 2021年2月22日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

【CIKM2020】神经逻辑推理，Neural Logic Reasoning

【CIKM2020】神经逻辑推理，Neural Logic Reasoning

专知会员服务

51+阅读 · 2020年8月25日

【ST2020硬核课】深度学习即统计学习，50页ppt

【ST2020硬核课】深度学习即统计学习，50页ppt

专知会员服务

67+阅读 · 2020年8月17日

神经网络与形式语言综述，12页pdf，A Survey of Neural Networks and Formal Languages

神经网络与形式语言综述，12页pdf，A Survey of Neural Networks and Formal Languages

专知会员服务

21+阅读 · 2020年6月4日

【微软】大型神经语言模型的对抗性训练，Adversarial Training for Large Neural Language Models

【微软】大型神经语言模型的对抗性训练，Adversarial Training for Large Neural Language Models

专知会员服务

51+阅读 · 2020年5月3日

【理解计算机视觉损失函数】《Understanding Loss Functions in Computer Vision!》by Sowmya Yellapragad

【理解计算机视觉损失函数】《Understanding Loss Functions in Computer Vision!》by Sowmya Yellapragad

专知会员服务

44+阅读 · 2020年3月4日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Yoshua Bengio，使算法知道“为什么”

Yoshua Bengio，使算法知道“为什么”

专知会员服务

8+阅读 · 2019年10月10日

GPT3 论文详解 | GPT-3: Language Models are Few-Shot Learners

GPT3 论文详解 | GPT-3: Language Models are Few-Shot Learners

AINLP

8+阅读 · 2020年6月3日

LibRec 精选：EfficientNet、XLNet 论文及代码实现

LibRec 精选：EfficientNet、XLNet 论文及代码实现

LibRec智能推荐

5+阅读 · 2019年7月9日

ICLR2019最佳论文出炉

ICLR2019最佳论文出炉

专知

12+阅读 · 2019年5月6日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

清华大学NLP组整理的机器翻译论文阅读清单

清华大学NLP组整理的机器翻译论文阅读清单

AINLP

5+阅读 · 2018年12月29日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Exploring Generalization Ability of Pretrained Language Models on Arithmetic and Logical Reasoning

Arxiv

0+阅读 · 2021年10月19日

Schrödinger's Tree -- On Syntax and Neural Language Models

Arxiv

0+阅读 · 2021年10月17日

On the Robustness of Reading Comprehension Models to Entity Renaming

Arxiv

0+阅读 · 2021年10月16日

An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-Trained Language Models

Arxiv

0+阅读 · 2021年10月16日

RAP: Robustness-Aware Perturbations for Defending against Backdoor Attacks on NLP Models

Arxiv

0+阅读 · 2021年10月15日

Identifying and Mitigating Spurious Correlations for Improving Robustness in NLP Models

Arxiv

2+阅读 · 2021年10月14日

Learning Neural Models for Natural Language Processing in the Face of Distributional Shift

Arxiv

11+阅读 · 2021年9月3日

Attribute-Guided Adversarial Training for Robustness to Natural Perturbations

Arxiv

15+阅读 · 2020年12月3日

Adversarial NLI: A New Benchmark for Natural Language Understanding

Arxiv

4+阅读 · 2019年10月31日

Language Models as Knowledge Bases?

Arxiv

6+阅读 · 2019年9月4日

VIP会员

文章信息

相关主题

语言模型化

神经语言模型

相关VIP内容

深度概率图模型，Deep Probabilistic Models

专知会员服务

29+阅读 · 2021年8月2日

【斯坦福CS224N硬核课】自然语言生成NLG，79页ppt

专知会员服务

37+阅读 · 2021年2月22日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

【CIKM2020】神经逻辑推理，Neural Logic Reasoning

【CIKM2020】神经逻辑推理，Neural Logic Reasoning

专知会员服务

51+阅读 · 2020年8月25日

【ST2020硬核课】深度学习即统计学习，50页ppt

【ST2020硬核课】深度学习即统计学习，50页ppt

专知会员服务

67+阅读 · 2020年8月17日

神经网络与形式语言综述，12页pdf，A Survey of Neural Networks and Formal Languages

神经网络与形式语言综述，12页pdf，A Survey of Neural Networks and Formal Languages

专知会员服务

21+阅读 · 2020年6月4日

【微软】大型神经语言模型的对抗性训练，Adversarial Training for Large Neural Language Models

【微软】大型神经语言模型的对抗性训练，Adversarial Training for Large Neural Language Models

专知会员服务

51+阅读 · 2020年5月3日

【理解计算机视觉损失函数】《Understanding Loss Functions in Computer Vision!》by Sowmya Yellapragad

【理解计算机视觉损失函数】《Understanding Loss Functions in Computer Vision!》by Sowmya Yellapragad

专知会员服务

44+阅读 · 2020年3月4日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Yoshua Bengio，使算法知道“为什么”

Yoshua Bengio，使算法知道“为什么”

专知会员服务

8+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【牛津博士论文】零样本强化学习综述

《美军条令：陆军指挥官与规划人员地理空间指南》60页

战术边缘指挥控制：防务面临的核心挑战

迈向开放世界检测：综述

相关资讯

GPT3 论文详解 | GPT-3: Language Models are Few-Shot Learners

GPT3 论文详解 | GPT-3: Language Models are Few-Shot Learners

AINLP

8+阅读 · 2020年6月3日

LibRec 精选：EfficientNet、XLNet 论文及代码实现

LibRec 精选：EfficientNet、XLNet 论文及代码实现

LibRec智能推荐

5+阅读 · 2019年7月9日

ICLR2019最佳论文出炉

ICLR2019最佳论文出炉

专知

12+阅读 · 2019年5月6日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

清华大学NLP组整理的机器翻译论文阅读清单

清华大学NLP组整理的机器翻译论文阅读清单

AINLP

5+阅读 · 2018年12月29日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Exploring Generalization Ability of Pretrained Language Models on Arithmetic and Logical Reasoning

Arxiv

0+阅读 · 2021年10月19日

Schrödinger's Tree -- On Syntax and Neural Language Models

Arxiv

0+阅读 · 2021年10月17日

On the Robustness of Reading Comprehension Models to Entity Renaming

Arxiv

0+阅读 · 2021年10月16日

An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-Trained Language Models

Arxiv

0+阅读 · 2021年10月16日

RAP: Robustness-Aware Perturbations for Defending against Backdoor Attacks on NLP Models

Arxiv

0+阅读 · 2021年10月15日

Identifying and Mitigating Spurious Correlations for Improving Robustness in NLP Models

Arxiv

2+阅读 · 2021年10月14日

Learning Neural Models for Natural Language Processing in the Face of Distributional Shift

Arxiv

11+阅读 · 2021年9月3日

Attribute-Guided Adversarial Training for Robustness to Natural Perturbations

Arxiv

15+阅读 · 2020年12月3日

Adversarial NLI: A New Benchmark for Natural Language Understanding

Arxiv

4+阅读 · 2019年10月31日

Language Models as Knowledge Bases?

Arxiv

6+阅读 · 2019年9月4日

微信扫码咨询专知VIP会员