Commonsense knowledge is essential for many AI applications, including those in natural language processing, visual processing, and planning. Consequently, many sources that include commonsense knowledge have been designed and constructed over the past decades. Recently, the focus has been on large text-based sources, which facilitate easier integration with neural (language) models and application to textual tasks, typically at the expense of the semantics of the sources and their harmonization. Efforts to consolidate commonsense knowledge have yielded partial success, with no clear path towards a comprehensive solution. We aim to organize these sources around a common set of dimensions of commonsense knowledge. We survey a wide range of popular commonsense sources with a special focus on their relations. We consolidate these relations into 13 knowledge dimensions. This consolidation allows us to unify the separate sources and to compute indications of their coverage, overlap, and gaps with respect to the knowledge dimensions. Moreover, we analyze the impact of each dimension on downstream reasoning tasks that require commonsense knowledge, observing that the temporal and desire/goal dimensions are very beneficial for reasoning on current downstream tasks, while distinctness and lexical knowledge have little impact. These results reveal preferences for some dimensions in current evaluation, and potential neglect of others.
翻译:常识知识对于许多AI应用,包括自然语言处理、视觉处理和规划方面的应用至关重要,因此,过去几十年来设计并构建了许多包括常识知识在内的许多来源,最近,重点是大量基于文本的来源,这些来源便于与神经(语言)模型结合,并适用于文字任务,通常牺牲源的语义和统一;巩固常识知识的努力取得了部分成功,没有通往全面解决的明确道路。我们的目标是围绕一套共同的常识知识来组织这些来源。我们调查了广泛的流行性常识来源,特别侧重于它们之间的关系。我们将这些关系合并为13个知识层面。这种合并使我们能够将不同的来源统一起来,并了解它们在知识层面方面的覆盖面、重叠和差距。此外,我们分析了每个层面对需要常识知识的下游推理任务的影响,指出时间和愿望/目标层面对于当前下游任务的推理非常有益,而独特性和法理学知识则影响不大。这些结果揭示了当前评估中某些层面的偏向,以及其它层面的偏好。