At the end of October 2022, Elon Musk concluded his acquisition of Twitter. In the weeks and months before that, several questions were publicly discussed that were not only of interest to the platform's future buyers, but also of high relevance to the Computational Social Science research community. For example, how many active users does the platform have? What percentage of accounts on the site are bots? And, what are the dominating topics and sub-topical spheres on the platform? In a globally coordinated effort of 80 scholars to shed light on these questions, and to offer a dataset that will equip other researchers to do the same, we have collected all 375 million tweets published within a 24-hour time period starting on September 21, 2022. To the best of our knowledge, this is the first complete 24-hour Twitter dataset that is available for the research community. With it, the present work aims to accomplish two goals. First, we seek to answer the aforementioned questions and provide descriptive metrics about Twitter that can serve as references for other researchers. Second, we create a baseline dataset for future research that can be used to study the potential impact of the platform's ownership change.
翻译:2022年10月底, Elon Musk 完成了Twitter的获取。 在此前的几周和几个月里,我们公开讨论了几个问题,这些问题不仅对平台的未来买主感兴趣,而且与计算社会科学研究界关系重大。例如,平台有多少活跃用户? 网站的账户中有多少比例是机器人? 以及平台上哪些是主导主题和次主题领域? 在全球80名学者协调下,努力阐明这些问题,并提供一套数据集,让其他研究人员也能够这样做。 我们收集了所有在2022年9月21日开始的24小时时间内公布的3.75亿个推特。 据我们所知,这是第一个完整的24小时Twitter数据集,可供研究界使用。当前工作的目的是实现两个目标。首先,我们要回答上述问题,并提供可以作为其他研究人员参考的关于Twitter的描述性指标。 其次,我们为未来研究建立基线数据集,可用于研究平台所有权变化的潜在影响。