Time-series anomaly detection is an important task and has been widely applied in the industry. Since manual data annotation is expensive and inefficient, most applications adopt unsupervised anomaly detection methods, but the results are usually sub-optimal and unsatisfactory to end customers. Weak supervision is a promising paradigm for obtaining considerable labels in a low-cost way, which enables the customers to label data by writing heuristic rules rather than annotating each instance individually. However, in the time-series domain, it is hard for people to write reasonable labeling functions as the time-series data is numerically continuous and difficult to be understood. In this paper, we propose a Label-Efficient Interactive Time-Series Anomaly Detection (LEIAD) system, which enables a user to improve the results of unsupervised anomaly detection by performing only a small amount of interactions with the system. To achieve this goal, the system integrates weak supervision and active learning collaboratively while generating labeling functions automatically using only a few labeled data. All of these techniques are complementary and can promote each other in a reinforced manner. We conduct experiments on three time-series anomaly detection datasets, demonstrating that the proposed system is superior to existing solutions in both weak supervision and active learning areas. Also, the system has been tested in a real scenario in industry to show its practicality.
翻译:时间序列异常现象检测是一项重要任务,该行业广泛应用了时间序列异常现象检测。由于人工数据批注费用昂贵,效率低,大多数应用都采用未经监督的异常现象检测方法,但结果通常不尽理想,对终端客户不满意。 薄弱的监督是获得大量标签的有利范例,它使客户能够以低成本方式编写超常规则,而不是单独说明每个案例,从而给数据贴上标签。然而,在时间序列领域,人们很难写合理的标签功能,因为时间序列数据在数量上是连续的,难以理解。在本文件中,我们提议采用拉贝尔-Efficient 交互时间-关系异常现象检测(LEIAD)系统,使用户能够通过与系统进行少量互动来改进不受监督的异常现象检测结果。为了实现这一目标,该系统整合了薄弱的监管和积极学习,同时仅使用少数标签数据自动生成标签功能。所有这些技术都是互补的,并且能够相互加强。我们在三个时间序列中进行实验,以实际解决方案显示系统的积极性,我们在三个系统测试领域进行高级的测试,同时展示了系统测试。