Community Question Answering (CQA) platforms contain plenty of CQA texts (i.e., questions and answers corresponding to the question) where named entities appear ubiquitously. In this paper, we define a new task of CQA entity linking (CQAEL) as linking the textual entity mentions detected from CQA texts with their corresponding entities in a knowledge base. This task can facilitate many downstream applications including expert finding and knowledge base enrichment. Traditional entity linking methods mainly focus on linking entities in news documents, and are suboptimal over this new task of CQAEL since they cannot effectively leverage various informative auxiliary data involved in the CQA platform to aid entity linking, such as parallel answers and two types of meta-data (i.e., topic tags and users). To remedy this crucial issue, we propose a novel transformer-based framework to effectively harness the knowledge delivered by different kinds of auxiliary data to promote the linking performance. We validate the superiority of our framework through extensive experiments over a newly released CQAEL data set against state-of-the-art entity linking methods.
翻译:社区问题回答平台(CQA)包含大量CQA文本(即与问题相对应的问答),名称实体无处不在,本文将CQA实体链接(CQAEL)的新任务定义为将文本实体提到的CQA文本与知识库中相应实体连接起来。这项任务可以促进许多下游应用,包括专家发现和知识库的丰富。连接传统实体的方法主要侧重于新闻文件实体的链接,并且比CQAEL的这一新任务更不理想,因为它们无法有效地利用CQA平台中涉及的各种信息辅助数据协助实体的链接,例如平行答案和两种类型的元数据(即专题标签和用户)。为了解决这一关键问题,我们提议了一个新的基于变异器的框架,以便有效地利用不同辅助数据提供的知识促进联系业绩。我们通过对新发布的CQEL数据组进行的广泛试验,验证我们框架的优越性。