TikTok is a relatively novel and widely popular media platform. In response to its expanding user base and cultural impact, researchers are turning to study the platform; however, TikTok, like many social media platforms, restricts external access to data. Prior works have acquired data from scraping the platform, user self-reports, and from accounts created by researchers for the study's purpose. Existing techniques, while yielding important insights, contain limitations for gathering large-scale quantitative insights on how real TikTok users behave on the platform. We bridge this research gap by implementing a data donation system to collect TikTok data. Our system leverages users' right to access their data enabled by the EU's GDPR regulation. We recruit 347 TikTok users, ask them to request their data from TikTok, and then use our system to customize, anonymize, and donate their data. We collect 4.9M videos viewed 9.2M times by our participants -- and associated engagement metrics -- to analyze how people consume content on TikTok, how prevalent liking behavior is on TikTok, and whether there are substantial differences across our participants' demographics. We conclude our work by discussing the lessons learned and future avenues for implementing data donation systems, which we believe offer a promising avenue for collecting user behavioral traces to understand social phenomena through the lens of the Web.
翻译:TikTok是一个相对新颖和广受欢迎的媒体平台。 研究人员为了应对其用户基础的不断扩大和文化影响,正在研究平台; 然而, TikTok 和许多社交媒体平台一样,正在研究平台; 限制数据外部访问。 先前的工程从剪切平台、用户自我报告以及研究人员为研究目的创建的账户中获取数据。 现有技术在产生重要见解的同时,对收集大规模量化知识以了解TikTok 用户在平台上的真实行为有局限性。 我们通过实施数据捐赠系统来弥补这一研究差距。 我们的系统利用了用户获得欧盟GDPR监管所允许的数据的权利。 我们招募了347 TikTok 用户, 要求他们从TikTok 用户那里获取数据, 然后再利用我们的系统定制、 匿名和 捐赠数据。 我们收集了4.9M 视频, 参与者在平台上观看了9. 2 OM 时间, 以及相关的参与指标 -- 来分析人们如何消费 TikTok 的内容, 我们如何普遍喜欢 TikTok 的行为, 我们在TikTok 上总结了他们的数据, 以及我们用户未来如何理解如何理解如何在通过社交上学习我们的数据路径, 我们通过用户如何理解如何理解未来如何学习。