Recent work has demonstrated the viability of using crowdsourcing as a tool for evaluating the truthfulness of public statements. Under certain conditions such as: (1) having a balanced set of workers with different backgrounds and cognitive abilities; (2) using an adequate set of mechanisms to control the quality of the collected data; and (3) using a coarse grained assessment scale, the crowd can provide reliable identification of fake news. However, fake news are a subtle matter: statements can be just biased ("cherrypicked"), imprecise, wrong, etc. and the unidimensional truth scale used in existing work cannot account for such differences. In this paper we propose a multidimensional notion of truthfulness and we ask the crowd workers to assess seven different dimensions of truthfulness selected based on existing literature: Correctness, Neutrality, Comprehensibility, Precision, Completeness, Speaker's Trustworthiness, and Informativeness. We deploy a set of quality control mechanisms to ensure that the thousands of assessments collected on 180 publicly available fact-checked statements distributed over two datasets are of adequate quality, including a custom search engine used by the crowd workers to find web pages supporting their truthfulness assessments. A comprehensive analysis of crowdsourced judgments shows that: (1) the crowdsourced assessments are reliable when compared to an expert-provided gold standard; (2) the proposed dimensions of truthfulness capture independent pieces of information; (3) the crowdsourcing task can be easily learned by the workers; and (4) the resulting assessments provide a useful basis for a more complete estimation of statement truthfulness.
翻译:最近的工作表明,利用众包作为评估公开声明真实性的工具是可行的,在以下某些条件下,如:(1) 拥有一组具有不同背景和认知能力的均衡工人;(2) 使用一套适当的机制来控制所收集数据的质量;(3) 使用粗糙的粮食评估尺度,人群可以可靠地识别假新闻;然而,假新闻是一个微妙的问题:现有工作中使用的公开证据作为评估公开声明真实性的工具,其内容不准确、不准确、错误等等,以及简单真实的估算尺度不能说明这种差异。在本文件中,我们提出了一个多层面的真实性概念,要求人群工人评估根据现有文献选择的七个不同层面的真实性:正确性、中立性、兼容性、准确性、准确性、完整性、完整性、演讲者的信任性、以及知情性。我们设置一套质量控制机制,以确保在公开提供的180份经过事实核对的完整报表中收集的数千份评估能够达到充分的质量,包括由人群工人使用的定制搜索引擎,以找到支持其真实性评估的网页。 () 进行关于众组工人的拟议数据评估,通过提供数据分析,从而得出可靠的数据,从而进行独立的分析。