用于自动实况调查的跨专题 (Check-worthy Claim Detection across Topics for Automated Fact-checking)

An important component of an automated fact-checking system is the claim check-worthiness detection system, which ranks sentences by prioritising them based on their need to be checked. Despite a body of research tackling the task, previous research has overlooked the challenging nature of identifying check-worthy claims across different topics. In this paper, we assess and quantify the challenge of detecting check-worthy claims for new, unseen topics. After highlighting the problem, we propose the AraCWA model to mitigate the performance deterioration when detecting check-worthy claims across topics. The AraCWA model enables boosting the performance for new topics by incorporating two components for few-shot learning and data augmentation. Using a publicly available dataset of Arabic tweets consisting of 14 different topics, we demonstrate that our proposed data augmentation strategy achieves substantial improvements across topics overall, where the extent of the improvement varies across topics. Further, we analyse the semantic similarities between topics, suggesting that the similarity metric could be used as a proxy to determine the difficulty level of an unseen topic prior to undertaking the task of labelling the underlying sentences.

翻译：自动事实检查系统的一个重要组成部分是索赔核实标准检测系统,该系统根据需要检查他们,根据需要对判决进行优先排序。尽管进行了大量研究,但先前的研究忽略了在不同专题中查明可核对索赔的难度性。在本文件中,我们评估和量化了发现对新的、隐性专题的可核对索赔的挑战。在突出问题之后,我们建议AraCWA模式在发现跨专题的可核对索赔时减轻性能恶化。AraCWA模式通过纳入由14个不同专题组成的可公开获取的阿拉伯推文数据集,提高了新专题的性能。我们利用由14个不同专题组成的阿拉伯推文数据集,表明我们拟议的数据增强战略在总体上取得了实质性改进,各个专题的改进程度各不相同。此外,我们分析各专题之间的语义相似性,建议使用类似度指标作为代用来确定在进行基本判决标注之前难以完成的隐性专题的难度。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日