Commonsense knowledge acquisition is a key problem for artificial intelligence. Conventional methods of acquiring commonsense knowledge generally require laborious and costly human annotations, which are not feasible on a large scale. In this paper, we explore a practical way of mining commonsense knowledge from linguistic graphs, with the goal of transferring cheap knowledge obtained with linguistic patterns into expensive commonsense knowledge. The result is a conversion of ASER [Zhang et al., 2020], a large-scale selectional preference knowledge resource, into TransOMCS, of the same representation as ConceptNet [Liu and Singh, 2004] but two orders of magnitude larger. Experimental results demonstrate the transferability of linguistic knowledge to commonsense knowledge and the effectiveness of the proposed approach in terms of quantity, novelty, and quality. TransOMCS is publicly available at: https://github.com/HKUST-KnowComp/TransOMCS.
翻译:获取常识知识是人工智能的一个关键问题。获取常识知识的常规方法通常需要耗费大量人力且费用高昂的人类说明,而这种方法在大规模上是行不通的。在本文件中,我们探讨了一种从语言图中挖掘常识的实用方法,目的是将以语言模式获得的廉价知识转化为昂贵的常识知识。结果将ASER[Zhang等人,2020年]这一大规模选择偏好知识资源转换成TransOMCS,与概念网[Liu和Singh,2004年]具有同样的代表性,但规模更大。实验结果表明语言知识可转让到常识知识,在数量、新颖性和质量方面,拟议方法的有效性。TransOMCS在https://github.com/HKUST-KnowComp/TransOMCS上公开提供。