基于图图的神经网络探测在线社交媒体上可疑用户的方法 (A Graph Neural Network based approach for detecting Suspicious Users on Online Social Media)

Online Social Media platforms (such as Twitter and Facebook) are extensively used for spreading the news to a wider public effortlessly at a rapid pace. However, now a days these platforms are also used with an aim of spreading rumors and fake news to a large audience in a short time span that can cause panic, fear, and financial loss to society. Thus, it is important to detect and control these rumors before it spreads to the masses. One way to control the spread of these rumors is by identifying possible suspicious users who are often involved in spreading the rumors. Our basic assumption is that the users who are often involved in spreading rumors are more likely to be suspicious in contrast to the users whose involvement in spreading rumors are less. This is due to the fact that sometimes, users may posts the rumor tweets by accident. In this paper, we use PHEME rumor tweet dataset which contains rumor and non-rumor tweets information on five incidents, that is, i) Charlie hebdo, ii)German wings crash, iii)Ottawa shooting, iv)Sydney siege, and v)Ferguson. We transform this rumor tweets dataset into suspicious users dataset before leveraging Graph Neural Network (GNN) based approach for identifying suspicious users. Specifically, we explore Graph Convolutional Network (GCN),which is a type of GNN, for identifying suspicious users and then we compare GCN results with the other three approaches which act as baseline approaches: SVM, RF and LSTM based deep learning architecture. Extensive experiments performed on real-world dataset, where we achieve up to 0.864 value for F1-Score and 0.720 value for AUC ROC, shows the effectiveness of GNN based approach for identifying suspicious users.

翻译：社交媒体在线平台(如Twitter和Facebook)被广泛用于将消息传播给更广大的公众,且速度不快。然而,如今,这些平台也被用来在很短的时间内将流言和假消息传播给广大观众,这可能会给社会造成恐慌、恐惧和金融损失。因此,在流言传播到大众之前,必须检测和控制这些流言。控制这些流言传播的一个办法是查明经常参与散布谣言的可能可疑用户。我们的基本假设是,经常参与散布谣言的用户更有可能与参与散布谣言的用户相比产生怀疑。这是因为,有时用户可能会在短短的时间内将谣言和假消息传播给广大观众造成恐慌、恐惧和金融损失。因此,我们使用PHEM的传言推文数据集,其中包含关于五起事件的流言和非争议性推文信息。即,Charlie hebdo,ii)德国的翅膀崩溃,iii)Ottawa拍摄,iv)Syney围困,以及varson的用户在传播谣言中更有可能。我们把这个流言推文推介数据系统在G型的网络上,我们用GOI-G数据库的直判数据,我们用直判的用户在GOLOLLA上,我们用来定位数据库的模型上,我们用直判数据搜索的计算数据。