已签字的传播活动侦查的残余因素 (Signed Latent Factors for Spamming Activity Detection)

Due to the increasing trend of performing spamming activities (e.g., Web spam, deceptive reviews, fake followers, etc.) on various online platforms to gain undeserved benefits, spam detection has emerged as a hot research issue. Previous attempts to combat spam mainly employ features related to metadata, user behaviors, or relational ties. These works have made considerable progress in understanding and filtering spamming campaigns. However, this problem remains far from fully solved. Almost all the proposed features focus on a limited number of observed attributes or explainable phenomena, making it difficult for existing methods to achieve further improvement. To broaden the vision about solving the spam problem and address long-standing challenges (class imbalance and graph incompleteness) in the spam detection area, we propose a new attempt of utilizing signed latent factors to filter fraudulent activities. The spam-contaminated relational datasets of multiple online applications in this scenario are interpreted by the unified signed network. Two competitive and highly dissimilar algorithms of latent factors mining (LFM) models are designed based on multi-relational likelihoods estimation (LFM-MRLE) and signed pairwise ranking (LFM-SPR), respectively. We then explore how to apply the mined latent factors to spam detection tasks. Experiments on real-world datasets of different kinds of Web applications (social media and Web forum) indicate that LFM models outperform state-of-the-art baselines in detecting spamming activities. By specifically manipulating experimental data, the effectiveness of our methods in dealing with incomplete and imbalanced challenges is valida

翻译：由于在各种在线平台上开展垃圾活动(如网络垃圾邮件、欺骗性评论、假追随者等)的趋势日益增长,以获得当之无愧的收益,垃圾检测已成为一个热研究问题。以前打击垃圾的尝试主要采用与元数据、用户行为或关联关系有关的特征。这些工程在理解和过滤垃圾信息运动方面取得了相当大的进展。然而,这一问题仍然远远没有完全解决。几乎所有拟议的特征都侧重于有限的观测到的属性或可解释的现象,使现有方法难以进一步改进。为了扩大解决垃圾检测领域垃圾问题和应对长期挑战(阶级不平衡和图不完全性)的愿景,我们建议重新尝试利用已签字的潜在因素来过滤欺诈活动。这一情景中多种在线应用的受垃圾污染的关系数据集由统一签署的网络加以解释。两种具有竞争力和高度不完全不完全不相同的潜在因素采矿模型是根据多种关系可能性估算(LFM-MRL)和签定的遥感实验室(MRM-M-M-S-S-Servial-Servical-Servical-Servical-Servical-deal-romocal-mogrational-mocal-mocal-mocal-mocal-mocal-mocal-mocal-mocal-mocal-mod-mod-modal-mod-mod-mod-mod-mod-modal-mod-modal-modal-mod-mod-modal-mod-la-modal-mod-mod-mod-mod-mod-mod-mod-mod-mod-mod-mod-mod-modal-mod-mod-mod-mod-mod-mod-mod-mod-mod-mod-mod-mod-mod-mod-modal-mod-mod-mod-mod-mod-mod-mod-mod-mod-mod-mod-mod-mod-mod-mod-mod-mod-mod-mod-mod-mod-mod-mod-mod-mod-