COVID-19 misinformation on social media platforms such as twitter is a threat to effective pandemic management. Prior works on tweet COVID-19 misinformation negates the role of semantic features common to twitter such as charged emotions. Thus, we present a novel COVID-19 misinformation model, which uses both a tweet emotion encoder and COVID-19 misinformation encoder to predict whether a tweet contains COVID-19 misinformation. Our emotion encoder was fine-tuned on a novel annotated dataset and our COVID-19 misinformation encoder was fine-tuned on a subset of the COVID-HeRA dataset. Experimental results show superior results using the combination of emotion and misinformation encoders as opposed to a misinformation classifier alone. Furthermore, extensive result analysis was conducted, highlighting low quality labels and mismatched label distributions as key limitations to our study.
翻译:COVID-19的错误信息在Twitter等社交媒体平台上是有效的疫情管理的威胁。以前在推特上的COVID-19错误信息的工作否认了推特的语义特征作用,比如充满感情。因此,我们提出了一种新颖的COVID-19错误信息模型,它使用Tweet情感编码器和COVID-19错误信息编码器来预测一条Tweet是否包含COVID-19错误信息。我们的情感编码器是在新的已注释数据集上进行微调的,COVID-19错误信息编码器是在COVID-HeRA数据集的子集上进行微调的。实验结果表明,使用情感和错误信息编码器的组合优于仅使用错误信息分类器。此外,进行了广泛的结果分析,突出了标签质量低和标签分布不匹配作为我们研究的主要限制。