Social networks (SNs) are increasingly important sources of news for many people. The online connections made by users allows information to spread more easily than traditional news media (e.g., newspaper, television). However, they also make the spread of fake news easier than in traditional media, especially through the users' social network connections. In this paper, we focus on investigating if the SNs' users connection structure can aid fake news detection on Twitter. In particular, we propose to embed users based on their follower or friendship networks on the Twitter platform, so as to identify the groups that users form. Indeed, by applying unsupervised graph embedding methods on the graphs from the Twitter users' social network connections, we observe that users engaged with fake news are more tightly clustered together than users only engaged in factual news. Thus, we hypothesise that the embedded user's network can help detect fake news effectively. Through extensive experiments using a publicly available Twitter dataset, our results show that applying graph embedding methods on SNs, using the user connections as network information, can indeed classify fake news more effectively than most language-based approaches. Specifically, we observe a significant improvement over using only the textual information (i.e., TF.IDF or a BERT language model), as well as over models that deploy both advanced textual features (i.e., stance detection) and complex network features (e.g., users network, publishers cross citations). We conclude that the Twitter users' friendship and followers network information can significantly outperform language-based approaches, as well as the existing state-of-the-art fake news detection models that use a more sophisticated network structure, in classifying fake news on Twitter.
翻译:社会网络(SNS)是许多人越来越重要的新闻来源。 用户的在线连接使得信息比传统新闻媒体(例如报纸、电视)更容易传播。 但是,它们也使得假新闻比传统媒体更容易传播,特别是通过用户的社会网络连接。 在本文中,我们侧重于调查SNS用户的用户连接结构是否能帮助在推特上进行假新闻探测。 特别是,我们提议在Twitter平台上以其追随者或友情网络为基础将用户嵌入用户,从而识别用户组成的群体。 事实上,通过在Twitter用户的社会网络连接的图表中应用不受监督的图形嵌入方法,我们发现与假新闻打交道的用户比传统媒体更加紧密的集群,尤其是通过用户的社会网络连接。因此,我们假设嵌入的用户网络能够帮助有效地检测假新闻。 通过使用一个公开的Twitter数据集,我们的结果显示,使用用户连接网络的图表方法,事实上可以比大多数基于语言的Twitter方式更有效地将假新闻分类。 具体地说,我们观察一个显著改进的版本的网络,作为文本的版本, 也只能通过BTF 。