Nowadays, People prefer to follow the latest news on social media, as it is cheap, easily accessible, and quickly disseminated. However, it can spread fake or unreliable, low-quality news that intentionally contains false information. The spread of fake news can have a negative effect on people and society. Given the seriousness of such a problem, researchers did their best to identify patterns and characteristics that fake news may exhibit to design a system that can detect fake news before publishing. In this paper, we have described the Fake News Challenge stage #1 (FNC-1) dataset and given an overview of the competitive attempts to build a fake news detection system using the FNC-1 dataset. The proposed model was evaluated with the FNC-1 dataset. A competitive dataset is considered an open problem and a challenge worldwide. This system's procedure implies processing the text in the headline and body text columns with different natural language processing techniques. After that, the extracted features are reduced using the elbow truncated method, finding the similarity between each pair using the soft cosine similarity method. The new feature is entered into CNN and DNN deep learning approaches. The proposed system detects all the categories with high accuracy except the disagree category. As a result, the system achieves up to 84.6 % accuracy, classifying it as the second ranking based on other competitive studies regarding this dataset.
翻译:目前,人们更愿意在社交媒体上关注最新消息,因为它是廉价的、容易获得的和迅速传播的。然而,它可以传播虚假或不可靠、低质量的、故意含有虚假信息的新闻。假新闻的传播可能会对人民和社会产生负面影响。鉴于这一问题的严重性,研究人员尽力查明假新闻可能展示的模式和特点,以设计一个能够在出版前检测假新闻的系统。在本文中,我们描述了假新闻挑战第1阶段(FNC-1)的数据集,并概述了利用FNC-1数据集建立假新闻探测系统的竞争性尝试。提议的模型是用FNC-1数据集来评估的。竞争性数据集被视为一个开放的问题和全世界的挑战。这个系统的程序意味着用不同的自然语言处理技术处理头条和正文的文本列中的文字。此后,所提取的特征使用手肘脱节的方法减少了,用软调调相方法查找每对对夫妇之间的相似性。新的特征被输入CNNC和DNNNNE深级学习方法。拟议的模型是用FNC-1数据集来评估的。竞争性数据集被视为一个开放性和挑战性的问题。这个系统的所有类别,但根据高精度研究,只有高分级,只有84级的分类,才能对它进行检索。