Massive false rumors emerging along with breaking news or trending topics severely hinder the truth. Existing rumor detection approaches achieve promising performance on the yesterday`s news, since there is enough corpus collected from the same domain for model training. However, they are poor at detecting rumors about unforeseen events especially those propagated in different languages due to the lack of training data and prior knowledge (i.e., low-resource regimes). In this paper, we propose an adversarial contrastive learning framework to detect rumors by adapting the features learned from well-resourced rumor data to that of the low-resourced. Our model explicitly overcomes the restriction of domain and/or language usage via language alignment and a novel supervised contrastive training paradigm. Moreover, we develop an adversarial augmentation mechanism to further enhance the robustness of low-resource rumor representation. Extensive experiments conducted on two low-resource datasets collected from real-world microblog platforms demonstrate that our framework achieves much better performance than state-of-the-art methods and exhibits a superior capacity for detecting rumors at early stages.
翻译:与突发新闻或趋势论题同时出现的大规模虚假流言严重妨碍真理。现有的传闻探测方法在昨天的新闻上取得了有希望的表现,因为从同一领域收集到的样本足以用于示范培训。然而,由于缺乏培训数据和先前的知识(即低资源制度),它们无法发现关于意外事件的谣言,特别是因缺乏培训数据和先前知识(以不同语言传播的谣言)而以不同语言传播的谣言。在本文中,我们提出一个对抗式对比性学习框架,通过将资源丰富的传闻数据所学到的流言与资源低的流言相适应,来探测流言。我们的模式明确克服了通过语言校正和新颖的受监督的对比性培训模式对域和(或)语言使用的限制。此外,我们开发了一种对抗性强化机制,以进一步加强低资源流言代表性的稳健性。对从真实世界微博客平台收集的两种低资源数据集进行的广泛实验表明,我们的框架在早期发现谣言方面比国家方法做得好得多。