In recent years, mining the knowledge from asynchronous sequences by Hawkes process is a subject worthy of continued attention, and Hawkes processes based on the neural network have gradually become the most hotly researched fields, especially based on the recurrence neural network (RNN). However, these models still contain some inherent shortcomings of RNN, such as vanishing and exploding gradient and long-term dependency problems. Meanwhile, Transformer based on self-attention has achieved great success in sequential modeling like text processing and speech recognition. Although the Transformer Hawkes process (THP) has gained huge performance improvement, THPs do not effectively utilize the temporal information in the asynchronous events, for these asynchronous sequences, the event occurrence instants are as important as the types of events, while conventional THPs simply convert temporal information into position encoding and add them as the input of transformer. With this in mind, we come up with a new kind of Transformer-based Hawkes process model, Temporal Attention Augmented Transformer Hawkes Process (TAA-THP), we modify the traditional dot-product attention structure, and introduce the temporal encoding into attention structure. We conduct numerous experiments on a wide range of synthetic and real-life datasets to validate the performance of our proposed TAA-THP model, significantly improvement compared with existing baseline models on the different measurements is achieved, including log-likelihood on the test dataset, and prediction accuracies of event types and occurrence times. In addition, through the ablation studies, we vividly demonstrate the merit of introducing additional temporal attention by comparing the performance of the model with and without temporal attention.
翻译:近些年来,从霍克斯工艺的非同步序列中挖掘知识是一个值得持续关注的主题,而基于神经网络的霍克斯工艺逐渐成为最热的研究领域,特别是基于神经网络(RNN)的复发。然而,这些模型仍然包含RNN的一些内在缺陷,例如脱落和爆炸梯度和长期依赖性问题。与此同时,基于自我意识的变异器在像文本处理和语音识别这样的顺序建模方面取得了巨大的成功。尽管变异器霍克斯工艺(THP)取得了巨大的性能改进,但THP并没有有效地利用在不同步事件中的时间信息,因为这些不同步的序列,事件的发生时间与事件的类型一样重要,而传统的THPSP将时间信息转换成位置编码,并把它们添加为变异质器的投入。怀着这个想法,我们产生了一种新型的变异式变异式变动器工艺模型,即Temooralat Regard Hawes 进程(TA-THP)取得了巨大的性能改进,我们用许多次的合成实验和变现模型来对数据进行模拟的实验和变现时程的轨迹观察。