The dynamics of popularity in online media are driven by a combination of endogenous spreading mechanisms and response to exogenous shocks including news and events. However, little is known about the dependence of temporal patterns of popularity on event-related information, e.g. which types of events trigger long-lasting activity. Here we propose a simple model that describes the dynamics around peaks of popularity by incorporating key features, i.e., the anticipatory growth and the decay of collective attention together with circadian rhythms. The proposed model allows us to develop a new method for predicting the future page view activity and for clustering time series. To validate our methodology, we collect a corpus of page view data from Wikipedia associated to a range of planned events, that are events which we know in advance will have a fixed date in the future, such as elections and sport events. Our methodology is superior to existing models in both prediction and clustering tasks. Furthermore, restricting to Wikipedia pages associated to association football, we observe that the specific realization of the event, in our case which team wins a match or the type of the match, has a significant effect on the response dynamics after the event. Our work demonstrates the importance of appropriately modeling all phases of collective attention, as well as the connection between temporal patterns of attention and characteristic underlying information of the events they represent.
翻译:在线媒体受欢迎程度的动态是由内生传播机制和对外部冲击(包括新闻和事件)的反应相结合的内在传播机制和对外部冲击(包括新闻和事件)的反应所驱动的。然而,人们对受欢迎时间模式对事件相关信息的依赖程度知之甚少,例如哪些类型的事件触发长期活动。我们在这里提出了一个简单的模型,通过纳入关键特征,即预测性增长和集体注意力的消减,加上环球节奏,来描述受欢迎高峰周围的动态。拟议的模型使我们能够开发一种预测未来页面浏览活动和集群时间序列的新方法。为了验证我们的方法,我们收集了从维基百科到一系列计划活动(我们事先知道的事件将在未来有固定的日期,例如选举和体育活动)的页面浏览数据。我们的方法优于现有的预测和集群任务模式。此外,我们注意到,维基百科网页上与联足球有关的网页被限制的具体实现,即团队赢得匹配或匹配的类型,这对事件后的反应动态产生了重要影响。我们的工作展示了它们作为整个事件基本关注时间和关注程度之间的典型联系的重要性。