In this modern era, communication has become faster and easier. This means fallacious information can spread as fast as reality. Considering the damage that fake news kindles on the psychology of people and the fact that such news proliferates faster than truth, we need to study the phenomenon that helps spread fake news. An unbiased data set that depends on reality for rating news is necessary to construct predictive models for its classification. This paper describes the methodology to create such a data set. We collect our data from snopes.com which is a fact-checking organization. Furthermore, we intend to create this data set not only for classification of the news but also to find patterns that reason the intent behind misinformation. We also formally define an Internet Claim, its credibility, and the sentiment behind such a claim. We try to realize the relationship between the sentiment of a claim with its credibility. This relationship pours light on the bigger picture behind the propagation of misinformation. We pave the way for further research based on the methodology described in this paper to create the data set and usage of predictive modeling along with research-based on psychology/mentality of people to understand why fake news spreads much faster than reality.
翻译:在这个现代时代,通讯变得更快、更容易。这意味着错误的信息可以像现实一样迅速传播。考虑到假新闻对人们心理的伤害,以及这种新闻传播速度快于真理的事实,我们需要研究有助于传播假新闻的现象。一个依赖现实的公正的数据组对于建立评级新闻的分类模型是必要的。本文描述了创建这样一个数据集的方法。我们从Snope.com收集了数据,这是一个事实审查组织。此外,我们打算不仅为新闻分类,而且为寻找错误意图背后的动机所根据的模式而创建这一数据集。我们还正式界定了互联网的主张、其可信性以及这种主张背后的情绪。我们试图认识到主张及其可信度之间的关系。这种关系将揭示出错误信息传播背后的大图景。我们为根据本文描述的方法进行进一步的研究铺平了道路,以创建数据组和使用预测模型,同时根据人们的心理学/思想进行研究,了解假新闻为何传播的速度远远快于现实。