With the development of social media, social communication has changed. While this facilitates people's communication and access to information, it also provides an ideal platform for spreading rumors. In normal or critical situations, rumors will affect people's judgment and even endanger social security. However, natural language is high-dimensional and sparse, and the same rumor may be expressed in hundreds of ways on social media. As such, the robustness and generalization of the current rumor detection model are put into question. We proposed a novel \textbf{h}ierarchical \textbf{a}dversarial \textbf{t}raining method for \textbf{r}umor \textbf{d}etection (HAT4RD) on social media. Specifically, HAT4RD is based on gradient ascent by adding adversarial perturbations to the embedding layers of post-level and event-level modules to deceive the detector. At the same time, the detector uses stochastic gradient descent to minimize the adversarial risk to learn a more robust model. In this way, the post-level and event-level sample spaces are enhanced, and we have verified the robustness of our model under a variety of adversarial attacks. Moreover, visual experiments indicate that the proposed model drifts into an area with a flat loss landscape, leading to better generalization. We evaluate our proposed method on three public rumors datasets from two commonly used social platforms (Twitter and Weibo). Experiment results demonstrate that our model achieves better results than state-of-the-art methods.
翻译:随着社交媒体的发展,社会沟通已经发生了变化。 虽然这有利于人们的沟通和获取信息, 但它也为传播流言提供了一个理想的平台。 在正常或危急情况下, 流言将影响人们的判断, 甚至危及社会保障。 然而, 自然语言是高维和稀疏的, 而同样的传言也可能在社交媒体上以数百种方式表达。 因此, 当前的传闻检测模式的稳健性和概括性会受到质疑。 与此同时, 我们建议了一个新的 \ textbf{h}rextbf{a}dversarial\ textb{trub{t}rversarial\ textbf{transw}}}rdesraining model. 在正常或危急的情况下, 流言会影响人们的判断, 流言会影响人们的判断。 具体地说, HAT4RD是建立在梯度之上的, 通过在后级和事件级模块的嵌入层中增加对抗性干扰。 同时, 检测者会使用州级的梯流言来尽量减少对抗风险, 以最小化 模型来学习更稳健健的模型 。