The explosion of disinformation accompanying the COVID-19 pandemic has overloaded fact-checkers and media worldwide, and brought a new major challenge to government responses worldwide. Not only is disinformation creating confusion about medical science amongst citizens, but it is also amplifying distrust in policy makers and governments. To help tackle this, we developed computational methods to categorise COVID-19 disinformation. The COVID-19 disinformation categories could be used for a) focusing fact-checking efforts on the most damaging kinds of COVID-19 disinformation; b) guiding policy makers who are trying to deliver effective public health messages and counter effectively COVID-19 disinformation. This paper presents: 1) a corpus containing what is currently the largest available set of manually annotated COVID-19 disinformation categories; 2) a classification-aware neural topic model (CANTM) designed for COVID-19 disinformation category classification and topic discovery; 3) an extensive analysis of COVID-19 disinformation categories with respect to time, volume, false type, media type and origin source.
翻译:伴随COVID-19大流行的虚假信息爆炸给全世界范围的事实检查员和媒体带来了过多的负担,给全球各国政府的反应带来了新的重大挑战。不仅假信息给公民造成了医学方面的混乱,而且还扩大了决策者和政府中的不信任。为了解决这个问题,我们制定了计算方法,将COVID-19大流行的虚假信息分类。COVID-19大流行的虚假信息类别可用于:(a) 将事实检查工作重点放在COVID-19大流行最具破坏性的信息类别上;(b) 指导决策者努力提供有效的公共卫生信息并有效对抗COVID-19大坏信息。本文介绍:(1) 载有目前最大的手动的一组COVID-19大不真实信息类别;(2) 为COVID-19大坏信息分类和专题发现设计的分类-认知神经主题模型(CANTM);(3) 广泛分析COVID-19坏信息类别,涉及时间、数量、假型、媒体类型和来源。