Community-level bans are a common tool against groups that enable online harassment and harmful speech. Unfortunately, the efficacy of community bans has only been partially studied and with mixed results. Here, we provide a flexible unsupervised methodology to identify in-group language and track user activity on Reddit both before and after the ban of a community (subreddit). We use a simple word frequency divergence to identify uncommon words overrepresented in a given community, not as a proxy for harmful speech but as a linguistic signature of the community. We apply our method to 15 banned subreddits, and find that community response is heterogeneous between subreddits and between users of a subreddit. Top users were more likely to become less active overall, while random users often reduced use of in-group language without decreasing activity. Finally, we find some evidence that the effectiveness of bans aligns with the content of a community. Users of dark humor communities were largely unaffected by bans while users of communities organized around white supremacy and fascism were the most affected. Altogether, our results show that bans do not affect all groups or users equally, and pave the way to understanding the effect of bans across communities.
翻译:社区禁止是针对促成在线骚扰和有害言论的团体的一种常见工具。 不幸的是,社区禁止的效力只经过部分研究,结果也好坏参半。 在这里,我们提供了一种灵活、不受监督的方法,在社区禁令之前和之后(subreddidi)识别群体内语言和跟踪Reddit用户的活动。我们使用简单的单词频率差异来识别特定社区内代表过多的不寻常的单词,不是作为有害言论的代言,而是社区的语言标志。我们对15个被禁止的子改编应用了我们的方法,发现社区反应在子改编和子改编用户之间不尽相同。顶级用户更有可能总体上变得不那么活跃,而随机用户往往在不减少活动的情况下减少使用群体内语言。最后,我们发现一些证据表明,禁令的效力与社区的内容一致。黑暗幽默社区的用户在很大程度上不受禁令的影响,而围绕白人至上和法西斯主义组织的社区的用户则受影响最大。我们的结果表明,禁令并不平等地影响所有群体或用户。