Given the widespread dissemination of inaccurate medical advice related to the 2019 coronavirus pandemic (COVID-19), such as fake remedies, treatments and prevention suggestions, misinformation detection has emerged as an open problem of high importance and interest for the research community. Several works study health misinformation detection, yet little attention has been given to the perceived severity of misinformation posts. In this work, we frame health misinformation as a risk assessment task. More specifically, we study the severity of each misinformation story and how readers perceive this severity, i.e., how harmful a message believed by the audience can be and what type of signals can be used to recognize potentially malicious fake news and detect refuted claims. To address our research questions, we introduce a new benchmark dataset, accompanied by detailed data analysis. We evaluate several traditional and state-of-the-art models and show there is a significant gap in performance when applying traditional misinformation classification models to this task. We conclude with open challenges and future directions.
翻译:由于广泛散发了与2019年冠状病毒(COVID-19)流行有关的不准确的医疗建议,例如假的补救办法、治疗和预防建议,错误信息检测已成为研究界高度重视和感兴趣的一个公开问题。一些工作研究健康错误信息检测,但很少注意错误信息站站的可感严重性。在这项工作中,我们把健康错误信息定为风险评估任务。更具体地说,我们研究每个错误信息故事的严重性,以及读者如何看待这种严重性,即听众所相信的信息可能是多么有害,以及使用何种类型的信号来识别潜在的恶意假消息和被驳斥的主张。为了解决我们的研究问题,我们采用了一套新的基准数据集,并辅以详细的数据分析。我们评估了一些传统和最新模型,并表明在应用传统的错误分类模型来完成这项任务时存在重大差距。我们最后提出了公开的挑战和今后的方向。