Understanding collective decision making at a large-scale, and elucidating how community organization and community dynamics shape collective behavior are at the heart of social science research. In this work we study the behavior of thousands of communities with millions of active members. We define a novel task: predicting which community will undertake an unexpected, large-scale, distributed campaign. To this end, we develop a hybrid model, combining textual cues, community meta-data, and structural properties. We show how this multi-faceted model can accurately predict large-scale collective decision-making in a distributed environment. We demonstrate the applicability of our model through Reddit's r/place a large-scale online experiment in which millions of users, self-organized in thousands of communities, clashed and collaborated in an effort to realize their agenda. Our hybrid model achieves a high F1 prediction score of 0.826. We find that coarse meta-features are as important for prediction accuracy as fine-grained textual cues, while explicit structural features play a smaller role. Interpreting our model, we provide and support various social insights about the unique characteristics of the communities that participated in the r/place experiment. Our results and analysis shed light on the complex social dynamics that drive collective behavior, and on the factors that propel user coordination. The scale and the unique conditions of the r/place experiment suggest that our findings may apply in broader contexts, such as online activism, (countering) the spread of hate speech and reducing political polarization. The broader applicability of the model is demonstrated through an extensive analysis of the WallStreetBets community, their role in r/place and the GameStop short squeeze campaign of 2021.
翻译:大规模地理解集体决策,并阐明社区组织和社区动态如何影响集体行为是社会科学研究的核心所在。我们在此工作中研究成千上万社区的行为,有数百万活跃成员参与。我们定义了一项新颖的任务:预测哪个社区将开展意想不到的大规模分散运动。我们为此开发了一个混合模型,将文字提示、社区元数据和结构属性结合起来。我们展示了这一多面模型如何准确预测分布环境中的大规模集体决策。我们通过Reddit的大规模在线实验展示了我们的模型的适用性。我们展示了我们模型的可应用性,在这个实验中,数以百万计的用户自行组织起来,为了实现其议程而进行了冲突与合作。我们混合模型的F1预测得分高达0.826。我们发现,模糊的元运动特征对于预测准确性与精细的文本提示一样重要,而明确的结构性特征则发挥较小的作用。在解释我们的模型时,我们提供并支持各种关于社区独特的社会特征的社会洞察力,这些特性是参与到共同的轨道/轨道上。我们通过复杂的组织分析,我们所展示的系统分析,我们所展示的复杂地分析结果和精确的系统分析。