Nowadays, with the rise of Internet access and mobile devices around the globe, more people are using social networks for collaboration and receiving real-time information. Twitter, the microblogging that is becoming a critical source of communication and news propagation, has grabbed the attention of spammers to distract users. So far, researchers have introduced various defense techniques to detect spams and combat spammer activities on Twitter. To overcome this problem, in recent years, many novel techniques have been offered by researchers, which have greatly enhanced the spam detection performance. Therefore, it raises a motivation to conduct a systematic review about different approaches of spam detection on Twitter. This review focuses on comparing the existing research techniques on Twitter spam detection systematically. Literature review analysis reveals that most of the existing methods rely on Machine Learning-based algorithms. Among these Machine Learning algorithms, the major differences are related to various feature selection methods. Hence, we propose a taxonomy based on different feature selection methods and analyses, namely content analysis, user analysis, tweet analysis, network analysis, and hybrid analysis. Then, we present numerical analyses and comparative studies on current approaches, coming up with open challenges that help researchers develop solutions in this topic.
翻译:目前,随着全球互联网接入和移动装置的兴起,越来越多的人正在利用社交网络进行协作并获得实时信息。Twitter,微博客正在成为通信和新闻传播的重要来源,吸引了垃圾邮件用户的注意。到目前为止,研究人员采用了各种防御技术来探测垃圾邮件和打击Twitter上的垃圾邮件活动。为了克服这一问题,近年来研究人员提供了许多新颖技术,大大提高了垃圾邮件检测的性能。因此,它激发了对Twitter上垃圾邮件检测的不同方法进行系统审查的动力。这次审查侧重于系统地比较Twitter垃圾邮件检测的现有研究技术。文献审查分析显示,大多数现有方法都依赖于基于机器学习的算法。在这些机器学习算法中,主要差异与各种特征选择方法有关。因此,我们建议基于不同地物选择方法和分析,即内容分析、用户分析、推特分析、网络分析、混合分析以及混合分析,进行分类。然后,我们提出对当前方法进行数字分析和比较研究,提出挑战,帮助研究人员制定这一专题的解决办法。