The self-exciting Hawkes process is widely used to model events which occur in bursts. However, many real world data sets contain missing events and/or noisily observed event times, which we refer to as data distortion. The presence of such distortion can severely bias the learning of the Hawkes process parameters. To circumvent this, we propose modeling the distortion function explicitly. This leads to a model with an intractable likelihood function which makes it difficult to deploy standard parameter estimation techniques. As such, we develop the ABC-Hawkes algorithm which is a novel approach to estimation based on Approximate Bayesian Computation (ABC) and Markov Chain Monte Carlo. This allows the parameters of the Hawkes process to be learned in settings where conventional methods induce substantial bias or are inapplicable. The proposed approach is shown to perform well on both real and simulated data.
翻译:自我激发的霍克斯进程被广泛用于模拟连续发生的事件。然而,许多真实的世界数据集包含缺失的事件和(或)新观察到的事件时间,我们称之为数据扭曲。这种扭曲的存在会严重偏向霍克斯进程参数的学习。为了绕开这一点,我们提议明确模拟扭曲功能。这导致一个具有难以捉摸的可能性功能的模型,使得难以运用标准参数估计技术。因此,我们开发了ABC-Hawkes算法,这是基于阿普约巴伊西亚计算(ABC)和马克夫连锁蒙特卡洛(Markov Chail Calle Monte Carlo)的新的估算方法。这样可以让霍克斯进程参数在常规方法引起严重偏差或不适用的地方学习。提议的方法在真实和模拟数据上都表现良好。