In this work, we identify open research opportunities in applying Neural Temporal Point Process (NTPP) models to industry scale customer behavior data by carefully reproducing NTPP models published up to date on known literature benchmarks as well as applying NTPP models to a novel, real world consumer behavior dataset that is twice as large as the largest publicly available NTPP benchmark. We identify the following challenges. First, NTPP models, albeit their generative nature, remain vulnerable to dataset imbalances and cannot forecast rare events. Second, NTPP models based on stochastic differential equations, despite their theoretical appeal and leading performance on literature benchmarks, do not scale easily to large industry-scale data. The former is in light of previously made observations on deep generative models. Additionally, to combat a cold-start problem, we explore a novel addition to NTPP models - a parametrization based on static user features.
翻译:在这项工作中,我们通过仔细复制根据已知文献基准发布的最新最新文献基准的NTP模型,以及将NTP模型应用于新颖的、真实的世界消费者行为数据集,确定在将神经时点进程模型(NTPP)应用于行业规模的客户行为数据时的公开研究机会,该模型是公众可获得的最大NTPP基准的两倍。我们确定了以下挑战。首先,NTPP模型,尽管具有遗传性质,仍然易受数据集失衡的影响,无法预测罕见事件。第二,基于随机差异方程式的NTPP模型,尽管其理论吸引力和在文献基准方面的领先性表现不易与大型行业数据相比。前者是参照以前对深层基因化模型的观察而来的。此外,为了应对冷发问题,我们探索了NTPPP模型的新补充,即基于静态用户特征的假称。