This work proposes a novel method to robustly and accurately model time series with heavy-tailed noise, in non-stationary scenarios. In many practical application time series have heavy-tailed noise that significantly impacts the performance of classical forecasting models; in particular, accurately modeling a distribution over extreme events is crucial to performing accurate time series anomaly detection. We propose a Spliced Binned-Pareto distribution which is both robust to extreme observations and allows accurate modeling of the full distribution. Our method allows the capture of time dependencies in the higher order moments of the distribution such as the tail heaviness. We compare the robustness and the accuracy of the tail estimation of our method to other state of the art methods on Twitter mentions count time series.
翻译:这项工作提出了一种新颖的方法,在非静止的情景中,用重尾噪音来强有力和准确地模拟时间序列。在许多实际应用的时间序列中,有重尾噪音,严重影响古典预测模型的性能;特别是,精确模拟极端事件的分布对于准确的时间序列异常检测至关重要。我们建议采用一个对极端观测和准确模拟全发的分流式。我们的方法允许在分布的较高时段(如尾巴重)中捕捉时间依赖性。我们将我们方法的可靠性和尾部估计的准确性与Twitter艺术方法的其他状态进行比较,并提到时间序列。