Cost-Sensitive Online Classification has drawn extensive attention in recent years, where the main approach is to directly online optimize two well-known cost-sensitive metrics: (i) weighted sum of sensitivity and specificity; (ii) weighted misclassification cost. However, previous existing methods only considered first-order information of data stream. It is insufficient in practice, since many recent studies have proved that incorporating second-order information enhances the prediction performance of classification models. Thus, we propose a family of cost-sensitive online classification algorithms with adaptive regularization in this paper. We theoretically analyze the proposed algorithms and empirically validate their effectiveness and properties in extensive experiments. Then, for better trade off between the performance and efficiency, we further introduce the sketching technique into our algorithms, which significantly accelerates the computational speed with quite slight performance loss. Finally, we apply our algorithms to tackle several online anomaly detection tasks from real world. Promising results prove that the proposed algorithms are effective and efficient in solving cost-sensitive online classification problems in various real-world domains.
翻译:近年来,成本敏感在线分类引起了广泛的注意,主要办法是直接在网上优化两种众所周知的成本敏感度指标:(一) 敏感度和具体度的加权和(二) 加权分类成本;然而,以往的方法只考虑数据流的第一阶信息,但在实践中是不够的,因为许多最近的研究证明,纳入第二阶信息可以提高分类模型的预测性能。因此,我们提议在本文中采用一套具有适应性规范化的成本敏感度在线分类算法。我们从理论上分析拟议的算法,并在广泛的实验中从经验上验证其有效性和特性。然后,为了更好地交换性能和效率,我们进一步将草图技术引入我们的算法中,这种算法大大加快计算速度,造成相当轻微的性能损失。最后,我们运用我们的算法来处理现实世界的若干在线异常检测任务。预言的结果证明,拟议的算法在解决各种现实世界领域的成本敏感的在线分类问题方面是有效和高效的。