Vaccine hesitancy, which has recently been driven by online narratives, significantly degrades the efficacy of vaccination strategies, such as those for COVID-19. Despite broad agreement in the medical community about the safety and efficacy of available vaccines, a large number of social media users continue to be inundated with false information about vaccines and are indecisive or unwilling to be vaccinated. The goal of this study is to better understand anti-vaccine sentiment by developing a system capable of automatically identifying the users responsible for spreading anti-vaccine narratives. We introduce a publicly available Python package capable of analyzing Twitter profiles to assess how likely that profile is to share anti-vaccine sentiment in the future. The software package is built using text embedding methods, neural networks, and automated dataset generation and is trained on several million tweets. We find this model can accurately detect anti-vaccine users up to a year before they tweet anti-vaccine hashtags or keywords. We also show examples of how text analysis helps us understand anti-vaccine discussions by detecting moral and emotional differences between anti-vaccine spreaders on Twitter and regular users. Our results will help researchers and policy-makers understand how users become anti-vaccine and what they discuss on Twitter. Policy-makers can utilize this information for better targeted campaigns that debunk harmful anti-vaccination myths.
翻译:尽管医疗界对可用疫苗的安全和功效达成广泛一致,但大量社交媒体用户仍然充斥着有关疫苗的虚假信息,而且缺乏准确性或不愿接种疫苗。本研究的目标是通过开发一个能够自动识别传播抗疫苗说明的用户的系统,更好地了解抗疫苗情绪。我们引入了一个公开提供的Python软件包,能够分析Twitter的概况,以评估该特征在未来交流抗疫苗情绪的可能性。软件包是使用文本嵌入方法、神经网络和自动数据元件生成,并培训数百万次推特。我们发现这一模型可以准确地检测抗疫苗用户到一年后,然后他们就会用反疫苗标签或关键词。我们还展示了文本分析如何帮助我们了解反疫苗讨论的范例,通过检测反疫苗传播者之间的道德和情感差异来评估未来交流反疫苗感感感感情绪情绪情绪情绪情绪情绪情绪。我们发现这个模型是使用文本嵌入方法、神经神经性数据库生成者如何更好地利用我的有害信息。