The spread of rumors along with breaking events seriously hinders the truth in the era of social media. Previous studies reveal that due to the lack of annotated resources, rumors presented in minority languages are hard to be detected. Furthermore, the unforeseen breaking events not involved in yesterday's news exacerbate the scarcity of data resources. In this work, we propose a novel zero-shot framework based on prompt learning to detect rumors falling in different domains or presented in different languages. More specifically, we firstly represent rumor circulated on social media as diverse propagation threads, then design a hierarchical prompt encoding mechanism to learn language-agnostic contextual representations for both prompts and rumor data. To further enhance domain adaptation, we model the domain-invariant structural features from the propagation threads, to incorporate structural position representations of influential community response. In addition, a new virtual response augmentation method is used to improve model training. Extensive experiments conducted on three real-world datasets demonstrate that our proposed model achieves much better performance than state-of-the-art methods and exhibits a superior capacity for detecting rumors at early stages.
翻译:在社交媒体时代,流言与破碎事件的传播严重阻碍了真相的真相。先前的研究显示,由于缺乏附加说明的资源,少数民族语言的流言很难被察觉。此外,昨天新闻中未涉及的意外突发事件加剧了数据资源的稀缺性。在这项工作中,我们提出了一个新的零点框架,其基础是迅速学习如何发现不同领域流言或以不同语言出现的流言。更具体地说,我们首先将流言作为多种传播线索在社交媒体上传播,然后设计一个等级化快速编码机制,以学习语言对流言和流言数据的不可知背景表现。为了进一步加强域的适应性,我们从传播线索中模拟域异变结构特征,以纳入有影响力的社区回应的结构性代表。此外,还采用了新的虚拟响应增强方法来改进模式培训。在三个真实世界数据集上进行的广泛实验表明,我们提议的模型的性能比最新方法要好得多,并展示了早期发现传言的先进能力。