In this paper we propose a novel method for semi-supervised anomaly detection (SSAD). Our classifier is named QMS22 as its inception was dated 2022 upon the framework of quadratic multiform separation (QMS), a recently introduced classification model. QMS22 tackles SSAD by solving a multi-class classification problem involving both the training set and the test set of the original problem. The classification problem intentionally includes classes with overlapping samples. One of the classes contains mixture of normal samples and outliers, and all other classes contain only normal samples. An outlier score is then calculated for every sample in the test set using the outcome of the classification problem. We also include performance evaluation of QMS22 against top performing classifiers using ninety-five benchmark imbalanced datasets from the KEEL repository. These classifiers are BRM (Bagging-Random Miner), OCKRA (One-Class K-means with Randomly-projected features Algorithm), ISOF (Isolation Forest), and ocSVM (One-Class Support Vector Machine). It is shown by using the area under the curve of the receiver operating characteristic curve as the performance measure, QMS22 significantly outperforms ISOF and ocSVM. Moreover, the Wilcoxon signed-rank tests reveal that there is no statistically significant difference when testing QMS22 against BRM nor QMS22 against OCKRA.
翻译:在本文中,我们提出了一个半监督异常现象检测(SSAD)的新颖方法。我们的分类器名为 QMS22 。 我们的分类器的名称是 QMS22, 因为它的开始日期是2022年, 是在最近推出的分类模型(QMS QMS ) 的四面形多形分解(QMS ) 框架内。 QMS22 解决了SSAD 的多级分类问题, 涉及到培训组和最初问题的测试组。 分类问题有意包括有重叠样品的类别。 其中一种类别包含正常样品和外部异常特征的混合物, 而所有其他类别只包含正常样品。 然后, 使用分类问题的结果来计算每套样本中的每个样本的分数。 我们还包括用95个基准的不平衡数据集(QMS QMS ) 对高级性能分类师的绩效评估。 这些分类器是BRM(Blagg-random Miner), Ok(O- Class ), ISO- IMS 的状态测试区域, 显示S 的成绩压质变, 测试为ISO- 。