A robust and reliable system of detecting spam reviews is a crying need in todays world in order to purchase products without being cheated from online sites. In many online sites, there are options for posting reviews, and thus creating scopes for fake paid reviews or untruthful reviews. These concocted reviews can mislead the general public and put them in a perplexity whether to believe the review or not. Prominent machine learning techniques have been introduced to solve the problem of spam review detection. The majority of current research has concentrated on supervised learning methods, which require labeled data - an inadequacy when it comes to online review. Our focus in this article is to detect any deceptive text reviews. In order to achieve that we have worked with both labeled and unlabeled data and proposed deep learning methods for spam review detection which includes Multi-Layer Perceptron (MLP), Convolutional Neural Network (CNN) and a variant of Recurrent Neural Network (RNN) that is Long Short-Term Memory (LSTM). We have also applied some traditional machine learning classifiers such as Nave Bayes (NB), K Nearest Neighbor (KNN) and Support Vector Machine (SVM) to detect spam reviews and finally, we have shown the performance comparison for both traditional and deep learning classifiers.
翻译:发现垃圾邮件审查的强大而可靠的系统是当今世界迫切需要的,以便购买产品而不必从网上网站欺骗。在许多在线网站,存在张贴审查的选项,从而为假的付费审查或不真实的审查创造空间。这些混杂的审查可以误导公众,使其陷入迷惑之中,使他们难以相信审查。引入了显眼的机器学习技术来解决垃圾邮件审查检测问题。目前大部分研究集中于有监督的学习方法,这需要贴标签的数据,当它出现在网上审查时是不够的。我们在文章中的重点是发现任何欺骗性文本审查。为了实现我们同标签和无标签的数据合作,并提出了深入的垃圾邮件审查检测学习方法,其中包括多Layer Percepron(MLM)、进化神经网络(CNN)和常规神经网络(RNNN)的变体,即长期记忆(LSTM),我们还应用了一些传统的机器学习分类,例如Nave Bayes(NB)、KNeestest Neest 和Sergistral Syal) 和我们所显示的Syal-Systrual-Syal(Neboral)和Supstrual-Syal) 和Sy-Syal-Supciew-Supstrucal-Supciew和Sycal-Supal) 和Supal-Supal-Supal) 和Sy-Neborview-Nebor) 和Supal-S)。