The spreading COVID-19 misinformation over social media already draws the attention of many researchers. According to Google Scholar, about 26000 COVID-19 related misinformation studies have been published to date. Most of these studies focusing on 1) detect and/or 2) analysing the characteristics of COVID-19 related misinformation. However, the study of the social behaviours related to misinformation is often neglected. In this paper, we introduce a fine-grained annotated misinformation tweets dataset including social behaviours annotation (e.g. comment or question to the misinformation). The dataset not only allows social behaviours analysis but also suitable for both evidence-based or non-evidence-based misinformation classification task. In addition, we introduce leave claim out validation in our experiments and demonstrate the misinformation classification performance could be significantly different when applying to real-world unseen misinformation.
翻译:在社交媒体上传播的COVID-19错误信息已经引起许多研究人员的注意。据谷歌学者称,迄今为止,已经发表了大约26000份与COVID-19有关的错误信息研究。这些研究大多侧重于(1) 检测和/或(2) 分析COVID-19相关错误信息的特点。然而,对与错误信息有关的社会行为的研究往往被忽视。在本文中,我们引入了一条细微的附加说明的错误信息推文数据集,包括社会行为注释(例如,对错误信息的评论或提问)。数据集不仅允许社会行为分析,而且适合基于证据或非证据的错误信息分类任务。此外,我们还在实验中引入了休假申请证明,并展示错误信息分类在应用现实世界的无形错误信息时表现可能大不相同。