The rapid development of social media changes the lifestyle of people and simultaneously provides an ideal place for publishing and disseminating rumors, which severely exacerbates social panic and triggers a crisis of social trust. Early content-based methods focused on finding clues from the text and user profiles for rumor detection. Recent studies combine the stances of users' comments with news content to capture the difference between true and false rumors. Although the user's stance is effective for rumor detection, the manual labeling process is time-consuming and labor-intensive, which limits the application of utilizing it to facilitate rumor detection. In this paper, we first finetune a pre-trained BERT model on a small labeled dataset and leverage this model to annotate weak stance labels for users' comment data to overcome the problem mentioned above. Then, we propose a novel Stance-aware Reinforcement Learning Framework (SRLF) to select high-quality labeled stance data for model training and rumor detection. Both the stance selection and rumor detection tasks are optimized simultaneously to promote both tasks mutually. We conduct experiments on two commonly used real-world datasets. The experimental results demonstrate that our framework outperforms the state-of-the-art models significantly, which confirms the effectiveness of the proposed framework.
翻译:社交媒体的迅速发展改变了人们的生活方式,同时也为出版和传播流言提供了一个理想的场所,这严重加剧了社会恐慌,引发了社会信任危机。早期内容基础方法侧重于从文本和用户简介中找到线索,以便发现谣言。最近的研究将用户的评论立场与新闻内容结合起来,以捕捉真实和虚假谣言之间的差异。虽然用户的姿态对于发现谣言是有效的,但人工标签过程耗费时间和劳力,限制了利用它来帮助发现谣言的运用。在本文中,我们首先对一个经过预先训练的BERT模型做了微小标签,并利用这一模型来说明用户评论数据的薄弱定位标签,以克服上述问题。然后,我们提出一个新的创新的Stance-aware加强学习框架(SRLF),以选择高质量、有标签的姿态数据进行示范培训和谣言检测。同时优化了立场选择和谣言检测任务,以推进这两项任务。我们在两个常用的现实数据集上进行了实验。实验结果显示,我们的框架大大超越了拟议框架的有效性。