Outlier detection is an important topic in machine learning and has been used in a wide range of applications. Outliers are objects that are few in number and deviate from the majority of objects. As a result of these two properties, we show that outliers are susceptible to a mechanism called fluctuation. This article proposes a method called fluctuation-based outlier detection (FBOD) that achieves a low linear time complexity and detects outliers purely based on the concept of fluctuation without employing any distance, density or isolation measure. Fundamentally different from all existing methods. FBOD first converts the Euclidean structure datasets into graphs by using random links, then propagates the feature value according to the connection of the graph. Finally, by comparing the difference between the fluctuation of an object and its neighbors, FBOD determines the object with a larger difference as an outlier. The results of experiments comparing FBOD with seven state-of-the-art algorithms on eight real-world tabular datasets and three video datasets show that FBOD outperforms its competitors in the majority of cases and that FBOD has only 5% of the execution time of the fastest algorithm. The experiment codes are available at: https://github.com/FluctuationOD/Fluctuation-based-Outlier-Detection.
翻译:外部探测是机器学习中的一个重要话题, 并被用于广泛的应用中。 外部探测是数量少且与大多数对象不同的对象。 由于这两个属性, 我们显示外部探测很容易被称作波动机制。 本条提出一种叫作基于波动的外部探测( FBOD) 的方法, 它实现低线线性复杂度, 纯粹基于波动概念, 不使用任何距离、 密度或隔离度, 与所有现有方法基本不同 。 FBOD 首先使用随机链接将 Euclidean 结构数据集转换成图表, 然后根据图形的链接传播特征值 。 最后, 通过比较一个对象及其邻居的波动差异, FBOD 确定对象, 其差异更大。 FBOD 与基于8个真实世界的表格数据集的7个州- 高级算法和 3个视频数据集的实验结果显示 FBOD 在大多数案例中超越其竞争者, 并且 FBOFD 只能使用最快的时间结构 。 FOFIAD 。