The spread of rumors along with breaking events seriously hinders the truth in the era of social media. Previous studies reveal that due to the lack of annotated resources, rumors presented in minority languages are hard to be detected. Furthermore, the unforeseen breaking events not involved in yesterday's news exacerbate the scarcity of data resources. In this work, we propose a novel zero-shot framework based on prompt learning to detect rumors falling in different domains or presented in different languages. More specifically, we firstly represent rumor circulated on social media as diverse propagation threads, then design a hierarchical prompt encoding mechanism to learn language-agnostic contextual representations for both prompts and rumor data. To further enhance domain adaptation, we model the domain-invariant structural features from the propagation threads, to incorporate structural position representations of influential community response. In addition, a new virtual response augmentation method is used to improve model training. Extensive experiments conducted on three real-world datasets demonstrate that our proposed model achieves much better performance than state-of-the-art methods and exhibits a superior capacity for detecting rumors at early stages.
翻译:在社交媒体时代,谣言传播阻碍了真相的传播。以前的研究表明,由于缺乏标注资源,少数语言中出现的谣言难以检测。此外,未知的突发事件加剧了数据资源的稀缺性。 在这项工作中,我们提出了一个基于Prompt学习的新型零样本框架,用于检测不同领域或用不同语言呈现的谣言。具体而言,我们首先将社交媒体上传播的谣言表示为多样化的传播线程,然后设计了一种层次化的Prompt编码机制,用于学习对Prompt和谣言数据的语言无关的情境表示。为进一步增强域自适应性,我们从传播线程中建模领域不变的结构特征,以整合有影响力的社区回应的结构位置表示。此外,我们还使用了新的虚拟回应增强方法来改进模型训练。对三个真实世界的数据集进行的广泛实验表明,我们提出的模型比现有方法表现更好,并具有更强大的检测谣言的能力。