Time series anomaly detection strives to uncover potential abnormal behaviors and patterns from temporal data, and has fundamental significance in diverse application scenarios. Constructing an effective detection model usually requires adequate training data stored in a centralized manner, however, this requirement sometimes could not be satisfied in realistic scenarios. As a prevailing approach to address the above problem, federated learning has demonstrated its power to cooperate with the distributed data available while protecting the privacy of data providers. However, it is still unclear that how existing time series anomaly detection algorithms perform with decentralized data storage and privacy protection through federated learning. To study this, we conduct a federated time series anomaly detection benchmark, named FedTADBench, which involves five representative time series anomaly detection algorithms and four popular federated learning methods. We would like to answer the following questions: (1)How is the performance of time series anomaly detection algorithms when meeting federated learning? (2) Which federated learning method is the most appropriate one for time series anomaly detection? (3) How do federated time series anomaly detection approaches perform on different partitions of data in clients? Numbers of results as well as corresponding analysis are provided from extensive experiments with various settings. The source code of our benchmark is publicly available at https://github.com/fanxingliu2020/FedTADBench.
翻译:建立有效检测模型通常需要集中储存适当的培训数据,然而,这一要求有时无法在现实情况下得到满足。作为解决上述问题的一种普遍办法,联邦学习表明,它有能力与分布的数据合作,同时保护数据提供者的隐私。然而,目前尚不清楚的是,现有的时间序列异常检测算法如何通过分散的数据存储和通过联合学习保护隐私而发挥分散的数据存储和隐私的作用。为了研究这一点,我们开展了一个联合时间序列异常检测基准,名为FedT ADBench,其中涉及五个具有代表性的时间序列异常检测算法和四种受欢迎的联合学习方法。我们想回答以下问题:(1) 如何利用时间序列异常检测算法与分布的数据合作,同时保护数据提供者的隐私。(2) 联邦学习方法对于时间序列异常检测最为合适?(3) 联邦时间序列异常检测法如何在不同客户的数据分割上发挥作用?结果的数量以及相应的分析来自广泛的时间序列测算法和各种基准。