Commonsense knowledge is essential for many AI applications, including those in natural language processing, visual processing, and planning. Consequently, many sources that include commonsense knowledge have been designed and constructed over the past decades. Recently, the focus has been on large text-based sources, which facilitate easier integration with neural (language) models and application on textual tasks, typically at the expense of the semantics of the sources. Such practice prevents the harmonization of these sources, understanding their coverage and gaps, and may hinder the semantic alignment of their knowledge with downstream tasks. Efforts to consolidate commonsense knowledge have yielded partial success, but provide no clear path towards a comprehensive consolidation of existing commonsense knowledge. The ambition of this paper is to organize these sources around a common set of dimensions of commonsense knowledge. For this purpose, we survey a wide range of popular commonsense sources with a special focus on their relations. We consolidate these relations into 13 knowledge dimensions, each abstracting over more specific relations found in sources. This consolidation allows us to unify the separate sources and to compute indications of their coverage, overlap, and gaps with respect to the knowledge dimensions. Moreover, we analyze the impact of each dimension on downstream reasoning tasks that require commonsense knowledge, observing that the temporal and desire/goal dimensions are very beneficial for reasoning on current downstream tasks, while distinctness and lexical knowledge have little impact. These results reveal focus towards some dimensions in current evaluation, and potential neglect of others.
翻译:常识知识对于许多AI应用,包括自然语言处理、视觉处理和规划方面的应用至关重要,因此,过去几十年来设计并构建了许多包括常识知识在内的许多来源,因此,在过去几十年中,设计并构建了许多来源,最近,重点是大量基于文本的来源,这便于与神经(语言)模型结合,并适用于文本任务,通常以来源的语义为代价,这种做法妨碍这些来源的协调统一,理解其覆盖面和差距,并可能妨碍其知识与下游任务之间的语义一致;合并努力巩固常识的努力取得了部分成功,但没有为全面巩固现有常识提供明确的道路;本文件的雄心是要将这些来源组织成一套共同的常识知识层面;为此目的,我们调查范围广泛的各种流行的常识来源,特别注重其关系;我们将这些关系整合成13个知识层面,每个层面都抽象地反映来源的更为具体的关系;这种整合使我们得以统一不同的来源,并汇编其覆盖面、重叠和差距的迹象,从而实现现有常识性知识层面的全面整合;此外,我们对当前常识性知识层面的潜在影响进行组织安排;此外,我们还要分析当前常识度和下游方向的每个层面的影响。