As relational event models are an increasingly popular model for studying relational structures, the reliability of large-scale event data collection becomes more and more important. Automated or human-coded events often suffer from non-negligible false-discovery rates in event identification. And most sensor data is primarily based on actors' spatial proximity for predefined time windows; hence, the observed events could relate either to a social relationship or random co-location. Both examples imply spurious events that may bias estimates and inference. We propose the Relational Event Model for Spurious Events (REMSE), an extension to existing approaches for interaction data. The model provides a flexible solution for modeling data while controlling for spurious events. Estimation of our model is carried out in an empirical Bayesian approach via data augmentation. Based on a simulation study, we investigate the properties of the estimation procedure. To demonstrate its usefulness in two distinct applications, we employ this model to combat events from the Syrian civil war and student co-location data. Results from the simulation and the applications identify the REMSE as a suitable approach to modeling relational event data in the presence of spurious events.
翻译:由于关系事件模型是研究关系结构的日益流行的模式,大规模事件数据收集的可靠性变得越来越重要,自动或人为编码事件往往在事件识别中出现不可忽略的虚假发现率。大多数传感器数据主要基于行为者对预定时间窗口的空间距离;因此,所观察到的事件可能与社会关系或随机合用同一地点有关。两个例子都暗示了可能偏向估计和推断的虚假事件。我们提议了“净化事件关系事件模型”(REMSE),这是现有互动数据方法的延伸。该模型为模拟数据提供了灵活的解决办法,同时控制了虚假事件。我们模型的模拟是通过数据扩增的经验性巴耶斯方法进行的。根据模拟研究,我们调查了估计程序的性质。为了在两种不同的应用中证明其效用,我们使用这一模型来打击叙利亚内战事件和学生合用同一地点的数据。模拟和应用的结果确定,REMSE是模拟事件数据的恰当方法,用于模拟发生时模拟事件的数据。