Previous studies have highlighted the importance of vaccination as an effective strategy to control the transmission of the COVID-19 virus. It is crucial for policymakers to have a comprehensive understanding of the public's stance towards vaccination on a large scale. However, attitudes towards COVID-19 vaccination, such as pro-vaccine or vaccine hesitancy, have evolved over time on social media. Thus, it is necessary to account for possible temporal shifts when analysing these stances. This study aims to examine the impact of temporal concept drift on stance detection towards COVID-19 vaccination on Twitter. To this end, we evaluate a range of transformer-based models using chronological and random splits of social media data. Our findings demonstrate significant discrepancies in model performance when comparing random and chronological splits across all monolingual and multilingual datasets. Chronological splits significantly reduce the accuracy of stance classification. Therefore, real-world stance detection approaches need to be further refined to incorporate temporal factors as a key consideration.
翻译:先前的研究已经强调了疫苗接种作为控制 COVID-19 病毒传播的有效策略的重要性。了解大众对疫苗接种的立场是政策制定者必须具备的全面认知。然而,社交媒体上关于 COVID-19 疫苗接种的态度,例如赞成疫苗或疫苗犹豫,随着时间的推移而发生变化。因此,在分析这些立场时,需要考虑时间性的可能性。本研究旨在检查时间概念漂移对 Twitter 上针对 COVID-19 疫苗接种立场检测的影响。为此,我们使用社交媒体数据的时间序列和随机分割评估了一系列基于 transformer 的模型。我们的研究结果显示,在全部单语和多语数据集中,随机分割和时间分割的模型性能存在显著差异。时间分割明显降低了立场分类的准确性。因此,现实中立场检测方法需要进一步改善,将时间因素作为关键因素之一纳入考虑范畴。