企鹅不会飞：通过实例化和例外推理通用语 (Penguins Don't Fly: Reasoning about Generics through Instantiations and Exceptions)

Generics express generalizations about the world (e.g., birds can fly) that are not universally true (e.g., newborn birds and penguins cannot fly). Commonsense knowledge bases, used extensively in NLP, encode some generic knowledge but rarely enumerate such exceptions and knowing when a generic statement holds or does not hold true is crucial for developing a comprehensive understanding of generics. We present a novel framework informed by linguistic theory to generate exemplars -- specific cases when a generic holds true or false. We generate ~19k exemplars for ~650 generics and show that our framework outperforms a strong GPT-3 baseline by 12.8 precision points. Our analysis highlights the importance of linguistic theory-based controllability for generating exemplars, the insufficiency of knowledge bases as a source of exemplars, and the challenges exemplars pose for the task of natural language inference.

翻译：通用语表达对世界的一般概括（例如，鸟可以飞）并非普遍适用（例如，新生鸟和企鹅不能飞）。常识知识库在NLP中被广泛应用，用于编码一些通用知识，但很少列举这些例外，了解通用语何时成立和不成立对于发展对通用语的全面理解至关重要。我们提出了一个新颖的框架，基于语言学理论生成实例-通用语成立或不成立的具体案例。针对约650个通用语，我们生成了约19k个实例，并展示了我们的框架比强大的GPT-3基线高出12.8个精度点。我们的分析凸显了基于语言学理论的可控性对生成实例的重要性，知识库作为实例来源的不足以及实例对自然语言推理任务的挑战。

相关内容

知识库

关注 65

知识库(Knowledge Base)是知识工程中结构化，易操作，易利用，全面有组织的知识集群，是针对某一(或某些)领域问题求解的需要，采用某种(或若干)知识表示方式在计算机存储器中存储、组织、管理和使用的互相联系的知识片集合。这些知识片包括与领域相关的理论知识、事实数据，由专家经验得到的启发式知识，如某领域内有关的定义、定理和运算法则以及常识性知识等。

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【华盛顿大学】知识建模+生成式推理，60页ppt，Cracking Commonsense Intelligence with Knowledge Modeling + Generative Reasoning

专知会员服务

54+阅读 · 2019年12月27日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日