当一项判决不引入一个对话实体时,基于变换模式的变换模式有时仍然提及它 (When a sentence does not introduce a discourse entity, Transformer-based models still sometimes refer to it)

Understanding longer narratives or participating in conversations requires tracking of discourse entities that have been mentioned. Indefinite noun phrases (NPs), such as 'a dog', frequently introduce discourse entities but this behavior is modulated by sentential operators such as negation. For example, 'a dog' in 'Arthur doesn't own a dog' does not introduce a discourse entity due to the presence of negation. In this work, we adapt the psycholinguistic assessment of language models paradigm to higher-level linguistic phenomena and introduce an English evaluation suite that targets the knowledge of the interactions between sentential operators and indefinite NPs. We use this evaluation suite for a fine-grained investigation of the entity tracking abilities of the Transformer-based models GPT-2 and GPT-3. We find that while the models are to a certain extent sensitive to the interactions we investigate, they are all challenged by the presence of multiple NPs and their behavior is not systematic, which suggests that even models at the scale of GPT-3 do not fully acquire basic entity tracking abilities.

翻译：了解更长远的叙事或参与对话需要跟踪所提到的话语实体。无限期的名词词( NPs), 如“ 狗”, 经常引入话语实体, 但这种行为由感官操作者调节, 例如否定。例如, “ Arthur” 中的“ 狗” 并不拥有狗 ”, 因为存在否定, 并不引入话语实体。在这项工作中, 我们对语言模式模式模式的心理语言评估进行调整, 以更高级别的语言现象为对象, 并引入一个英语评价套件, 将感官操作者和无限期NPs之间的相互作用知识作为目标。我们使用这个评价套子对基于变换器的GPT-2 和 GPT-3 模型的实体跟踪能力进行精细细致的调查。我们发现, 虽然模型在某种程度上对我们调查的交互作用敏感, 但是它们都受到多个NPs的存在及其行为不系统化的挑战, 这表明即使是GPT-3 规模的模型也不能完全获得基本的实体跟踪能力。

相关内容

美国海军研究生院

关注 57

海军研究生院（The Naval Postgraduate School，NPS）是一所公立研究生院，成立于1909年，目前该学院位于美国加利福尼亚州蒙特雷市。海军研究生院主要提供以国防为重点的研究生教育，针对海军独特需求、美国武装部队、国防部文职人员和国际合作伙伴提供多个研究领域的硕士和博士学位，主要包括电子与计算机工程、机械与航空工程、计算机科学、信息科学、国防分析、国家安全事务专业、国防资源管理、军民关系研究、持久和平的领导与教育、本土防御与安全等专业，该研究院旨在提高海军服务的作战效率、技术领导力和作战优势。

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日