通过反反竞争学习探测微博客微博客上关于低资源域域的流言 (Detect Rumors in Microblog Posts for Low-Resource Domains via Adversarial Contrastive Learning)

Massive false rumors emerging along with breaking news or trending topics severely hinder the truth. Existing rumor detection approaches achieve promising performance on the yesterday's news, since there is enough corpus collected from the same domain for model training. However, they are poor at detecting rumors about unforeseen events especially those propagated in different languages due to the lack of training data and prior knowledge (i.e., low-resource regimes). In this paper, we propose an adversarial contrastive learning framework to detect rumors by adapting the features learned from well-resourced rumor data to that of the low-resourced. Our model explicitly overcomes the restriction of domain and/or language usage via language alignment and a novel supervised contrastive training paradigm. Moreover, we develop an adversarial augmentation mechanism to further enhance the robustness of low-resource rumor representation. Extensive experiments conducted on two low-resource datasets collected from real-world microblog platforms demonstrate that our framework achieves much better performance than state-of-the-art methods and exhibits a superior capacity for detecting rumors at early stages.

翻译：与突发新闻或趋势论题同时出现的大规模虚假流言严重妨碍真理。现有传闻探测方法在昨天的新闻上取得了有希望的表现,因为从同一领域收集到的样本足以用于示范培训。然而,由于缺乏培训数据和先前的知识(即低资源制度),它们无法发现关于意外事件的流言,特别是因缺乏培训数据和先前知识(以不同语言传播的流言)而以不同语言传播的流言。在本文中,我们建议建立一个对抗式对比学习框架,通过将资源丰富的传闻数据所学到的流言与资源低的流言相适应来检测流言。我们的模式明确克服了通过语言校正和新颖的监管对比培训模式对域和(或)语言使用的限制。此外,我们开发了一种对抗性强化机制,以进一步提高低资源流言代表性的稳健性。对从真实世界微博客平台收集的两套低资源数据集进行的广泛实验表明,我们的框架在早期发现谣言的能力方面比国家先进得多。