Voat.co was a news aggregator website that shut down on December 25, 2020. The site had a troubled history and was known for hosting various banned subreddits. This paper presents a dataset with over 2.3M submissions and 16.2M comments posted from 113K users in 7.1K subverses (the equivalent of subreddit for Voat). Our dataset covers the whole lifetime of Voat, from its developing period starting on November 8, 2013, the day it was founded, April 2014, up until the day it shut down (December 25, 2020). This work presents the largest and most complete publicly available Voat dataset, to the best of our knowledge. Along with the release of this dataset, we present a preliminary analysis covering posting activity and daily user and subverse registration on the platform so that researchers interested in our dataset can know what to expect. Our data may prove helpful to false news dissemination studies as we analyze the links users share on the platform, finding that many communities rely on alternative news press, like Breitbart and GatewayPundit, for their daily discussions. In addition, we perform network analysis on user interactions finding that many users prefer not to interact with subverses outside their narrative interests, which could be helpful to researchers focusing on polarization and echo chambers. Also, since Voat was one of the platforms banned Reddit communities migrated to, we are confident our dataset will motivate and assist researchers studying deplatforming. Finally, many hateful and conspiratorial communities were very popular on Voat, which makes our work valuable for researchers focusing on toxicity, conspiracy theories, cross-platform studies of social networks, and natural language processing.
翻译:Voat.co是一个新闻聚合网站,于2020年12月25日关闭。这个网站历史混乱,以主办各种被禁止的子编辑网站而著称。这个网站展示了一个数据集,包含超过2.3M提交材料和来自113K用户的16.2M评论,张贴在7.1K子versions(相当于Voat的子编辑)中。我们的数据集覆盖了Voat的整个一生,从2013年11月8日、2014年4月、直至其成立之日(2020年12月25日),直到网站关闭之日(2020年12月25日)为止。这个网站展示了最大和最完整的可公开获取的Voat数据集,以了解我们的知识。除了发布这个数据集外,我们还展示了一个包含发布活动以及每日用户的16.2M评论的数据集和16.2M评论。我们的数据集覆盖了113K用户对我们的数据集感兴趣的研究人员可以知道什么。我们的数据集从2013年11月8日、4月到许多用户在平台上共享的链接,发现许多社区都依赖替代的新闻报道,如Breitbart 和GnetPalPlational-pudd,用于我们日常讨论。此外的网络,此外,我们进行网络分析的网络分析,许多用户的动机分析会让用户更喜欢的理论化的理论循环更像化的理论, 。