False claims that have been previously fact-checked can still spread on social media. To mitigate their continual spread, detecting previously fact-checked claims is indispensable. Given a claim, existing works focus on providing evidence for detection by reranking candidate fact-checking articles (FC-articles) retrieved by BM25. However, these performances may be limited because they ignore the following characteristics of FC-articles: (1) claims are often quoted to describe the checked events, providing lexical information besides semantics; (2) sentence templates to introduce or debunk claims are common across articles, providing pattern information. Models that ignore the two aspects only leverage semantic relevance and may be misled by sentences that describe similar but irrelevant events. In this paper, we propose a novel reranker, MTM (Memory-enhanced Transformers for Matching) to rank FC-articles using key sentences selected with event (lexical and semantic) and pattern information. For event information, we propose a ROUGE-guided Transformer which is finetuned with regression of ROUGE. For pattern information, we generate pattern vectors for matching with sentences. By fusing event and pattern information, we select key sentences to represent an article and then predict if the article fact-checks the given claim using the claim, key sentences, and patterns. Experiments on two real-world datasets show that MTM outperforms existing methods. Human evaluation proves that MTM can capture key sentences for explanations. The code and the dataset are at https://github.com/ICTMCG/MTM.
翻译:先前经过事实检查的虚假主张仍然可以在社交媒体上传播。 为了减轻持续扩散, 检测先前经过事实检查的主张是不可或缺的。 根据一项主张, 现有的作品侧重于提供证据, 以便通过重新排序候选人的BM25检索到的检查文章( FC- 文章) 来检测。 但是, 这些表现可能有限, 因为它们忽略了FC 文章的以下特性:(1) 常常引用这些主张来描述所检查的事件, 提供词汇信息之外的其他词汇信息; (2) 在各个文章中, 引入或拆解索赔的句式模板很常见, 提供模式信息。 忽略这两个方面只能影响语义相关性的模型, 并且可能被描述类似但无关事件的句子误导。 在本文中, 我们提出一个新的重新排序器, MTM (MM- 匹配的增强变异器), 使用事件( 灵活和语义) 和模式信息, 我们提议一个 ROUG- 指南变换的变换器, 使用当前判词法 。