One of the most significant differences of M5 over previous forecasting competitions is that it was held on Kaggle, an online platform of data scientists and machine learning practitioners. Kaggle provides a gathering place, or virtual community, for web users who are interested in the M5 competition. Users can share code, models, features, loss functions, etc. through online notebooks and discussion forums. This paper aims to study the social influence of virtual community on user behaviors in the M5 competition. We first research the content of the M5 virtual community by topic modeling and trend analysis. Further, we perform social media analysis to identify the potential relationship network of the virtual community. We study the roles and characteristics of some key participants that promote the diffusion of information within the M5 virtual community. Overall, this study provides in-depth insights into the mechanism of the virtual community's influence on the participants and has potential implications for future online competitions.
翻译:M5与以往的预测竞赛相比,M5与以往的预测竞赛最显著的区别之一是,它是在由数据科学家和机器学习实践者组成的在线平台Kaggle上举行的。Kaggle为对M5竞赛感兴趣的网络用户提供了一个聚集点或虚拟社区。用户可以通过在线笔记本和讨论论坛分享代码、模型、特征、损失功能等。本文旨在研究虚拟社区对M5竞赛中用户行为的社会影响。我们首先通过主题建模和趋势分析来研究M5虚拟社区的内容。此外,我们还进行了社会媒体分析,以确定虚拟社区的潜在关系网络。我们研究了一些关键参与者的作用和特点,以促进M5虚拟社区内部信息传播。总体而言,这项研究深入了解虚拟社区对参与者的影响机制,并对未来的在线竞争具有潜在影响。