评价非液化液化油模型模型概念正确性评估的有秩序敏感模型的形状价值 (Order-sensitive Shapley Values for Evaluating Conceptual Soundness of NLP Models)

Previous works show that deep NLP models are not always conceptually sound: they do not always learn the correct linguistic concepts. Specifically, they can be insensitive to word order. In order to systematically evaluate models for their conceptual soundness with respect to word order, we introduce a new explanation method for sequential data: Order-sensitive Shapley Values (OSV). We conduct an extensive empirical evaluation to validate the method and surface how well various deep NLP models learn word order. Using synthetic data, we first show that OSV is more faithful in explaining model behavior than gradient-based methods. Second, applying to the HANS dataset, we discover that the BERT-based NLI model uses only the word occurrences without word orders. Although simple data augmentation improves accuracy on HANS, OSV shows that the augmented model does not fundamentally improve the model's learning of order. Third, we discover that not all sentiment analysis models learn negation properly: some fail to capture the correct syntax of the negation construct. Finally, we show that pretrained language models such as BERT may rely on the absolute positions of subject words to learn long-range Subject-Verb Agreement. With each NLP task, we also demonstrate how OSV can be leveraged to generate adversarial examples.

翻译：先前的作品显示,深 NLP 模型在概念上并不总是很健全: 它们并不总是能学习正确的语言概念。具体地说, 它们可能对单词顺序不敏感。为了系统评估其字顺序概念正确性模型, 我们为顺序数据引入了一种新的解释方法: 秩序敏感的沙皮值( OSV ) 。我们进行了广泛的实验性评估, 以验证方法和表层不同深层 NLP 模型学会字型顺序。使用合成数据, 我们首先显示 OSV 在解释模型行为方面比基于梯度的方法更忠实。其次, 应用 HANNS 数据集, 我们发现基于 BERT 的 NLI 模型只使用没有单词顺序的字数发生。尽管简单的数据增强提高了 HANS 的准确性, OS VV 表明, 增强型模型并没有从根本上改进模型的秩序学习。第三, 我们发现并非所有的情绪分析模型都能正确地学会否定。一些没有捕捉到否定结构的正确语法。最后, 我们显示, 像 NERT 这样的预先语言模型可以依赖绝对的字根基位置位置来学习远程的单词以学习远程端点操作V 。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/