With the rapid development of automatic fake news detection technology, fact extraction and verification (FEVER) has been attracting more attention. The task aims to extract the most related fact evidences from millions of open-domain Wikipedia documents and then verify the credibility of corresponding claims. Although several strong models have been proposed for the task and they have made great progress, we argue that they fail to utilize multi-view contextual information and thus cannot obtain better performance. In this paper, we propose to integrate multi-view contextual information (IMCI) for fact extraction and verification. For each evidence sentence, we define two kinds of context, i.e. intra-document context and inter-document context}. Intra-document context consists of the document title and all the other sentences from the same document. Inter-document context consists of all other evidences which may come from different documents. Then we integrate the multi-view contextual information to encode the evidence sentences to handle the task. Our experimental results on FEVER 1.0 shared task show that our IMCI framework makes great progress on both fact extraction and verification, and achieves state-of-the-art performance with a winning FEVER score of 72.97% and label accuracy of 75.84% on the online blind test set. We also conduct ablation study to detect the impact of multi-view contextual information. Our codes will be released at https://github.com/phoenixsecularbird/IMCI.
翻译:随着自动假新闻探测技术的迅速发展,事实提取和核查(FEWER)吸引了更多的关注。任务旨在从数百万个开放域维基百科文件中提取最相关的事实证据,然后核查相应主张的可信度。虽然为这项任务提出了几个强有力的模型,而且取得了巨大进展,但我们认为,这些模型没有利用多视角背景信息,因此无法取得更好的业绩。在本文件中,我们提议将多视角背景信息(IMCI)纳入事实提取和核实。对于每一个证据句子,我们定义了两种背景,即文件内部背景和文件间背景。在文档中,文件背景包括文件标题和同一份文件中的所有其他句子。跨文件背景包括来自不同文件的所有其他证据。然后,我们整合了多视角背景信息,以编码证据判决处理这项任务。我们在FEWEl1.0共同任务中的实验结果显示,我们的IMIFI框架在事实提取和核查两方面都取得了巨大进展,并且实现了与FEVElelegalal对72.97 % 和MLALIFILIM 的图像测试结果,还将在网上测试75中完成。