As social media grows faster, harassment becomes more prevalent which leads to considered fake detection a fascinating field among researchers. The graph nature of data with the large number of nodes caused different obstacles including a considerable amount of unrelated features in matrices as high dispersion and imbalance classes in the dataset. To deal with these issues Auto-encoders and a combination of semi-supervised learning and the GAN algorithm which is called SGAN were used. This paper is deploying a smaller number of labels and applying SGAN as a classifier. The result of this test showed that the accuracy had reached 91\% in detecting fake accounts using only 100 labeled samples.
翻译:随着社交媒体的增长速度加快,骚扰现象变得更加普遍,导致研究人员认为假发现一个令人着迷的领域。数据图的性质加上大量节点,造成了不同的障碍,包括数据集中高度分散和不平衡类的矩阵中大量不相干特征。为了处理这些问题,使用了自动编码器以及半监督学习和称为SGAN的GAN算法。本文正在部署少量标签,并应用SGAN作为分类器。这一测试的结果显示,在仅使用100个标签样本来探测假账户时,准确性达到了91 ⁇ 。