Our society produces and shares overwhelming amounts of information through the Online Social Networks (OSNs). Within this environment, misinformation and disinformation have proliferated, becoming a public safety concern on every country. Allowing the public and professionals to efficiently find reliable evidence about the factual veracity of a claim is crucial to mitigate this harmful spread. To this end, we propose FacTeR-Check, a multilingual architecture for semi-automated fact-checking that can be used for either the general public but also useful for fact-checking organisations. FacTeR-Check enables retrieving fact-checked information, unchecked claims verification and tracking dangerous information over social media. This architectures involves several modules developed to evaluate semantic similarity, to calculate natural language inference and to retrieve information from Online Social Networks. The union of all these modules builds a semi-automated fact-checking tool able of verifying new claims, to extract related evidence, and to track the evolution of a hoax on a OSN. While individual modules are validated on related benchmarks (mainly MSTS and SICK), the complete architecture is validated using a new dataset called NLI19-SP that is publicly released with COVID-19 related hoaxes and tweets from Spanish social media. Our results show state-of-the-art performance on the individual benchmarks, as well as producing useful analysis of the evolution over time of 61 different hoaxes.
翻译:我们的社会通过在线社会网络(OSNs)制作和分享大量信息。在这个环境中,错误和虚假信息已经扩散,成为每个国家的公共安全问题。允许公众和专业人员高效地找到可靠证据,证明索赔的真实性,对于减轻这种有害传播至关重要。为此,我们提议FacTeR-Check为半自动的实况调查建立一个多语种架构,既可以用于公众,也可以用于事实审查组织。 FacTeR- Check能够检索经核实的事实信息、不受限制的索赔核查和跟踪社会媒体上的危险信息。这个架构涉及几个模块,以评价语义相似性、计算自然语言推论和检索在线社会网络信息。所有这些模块的结合建立了一个半自动化的事实审查工具,能够核实新的索赔,提取相关证据,跟踪对OSNSN的有用组织的演进。在相关基准(主要是MSTS和SICK)上验证了个人模块,而完整的架构正在使用名为NLIS19-SP的新的数据设置来评估语义相似性语言推导和检索系统-SHLISIS-SP的运行结果,以公开显示与SHIIS-IS-IS-IS-IS-IS-IS-IS-IS-IS-Sal相关的成绩分析。