In this paper, we combine two independent detection methods for identifying fake news: the algorithm VAGO uses semantic rules combined with NLP techniques to measure vagueness and subjectivity in texts, while the classifier FAKE-CLF relies on Convolutional Neural Network classification and supervised deep learning to classify texts as biased or legitimate. We compare the results of the two methods on four corpora. We find a positive correlation between the vagueness and subjectivity measures obtained by VAGO, and the classification of text as biased by FAKE-CLF. The comparison yields mutual benefits: VAGO helps explain the results of FAKE-CLF. Conversely FAKE-CLF helps us corroborate and expand VAGO's database. The use of two complementary techniques (rule-based vs data-driven) proves a fruitful approach for the challenging problem of identifying fake news.
翻译:在本文中,我们结合了两种独立的检测方法来识别假消息:VAGO算法使用语义规则,加上NLP技术来测量文本中的模糊性和主观性,而分类法莱克-CLF依靠进化神经网络的分类和监督深层次的学习来将文本分类为有偏见或合法。我们比较了四个公司两种方法的结果。我们发现VAGO的模糊性和主观性措施与FAGE-CLF对文本的分类之间有正相关关系。比较产生互利:VAGO帮助解释FAKE-CLF的结果。相反,FAKE-CLF帮助我们校验和扩大VAGO的数据库。使用两种互补技术(基于规则的对数据驱动的)证明在识别假新闻的挑战性问题上是一种富有成果的方法。