Anomalous pattern detection aims to identify instances where deviation from normalcy is evident, and is widely applicable across domains. Multiple anomalous detection techniques have been proposed in the state of the art. However, there is a common lack of a principled and scalable feature selection method for efficient discovery. Existing feature selection techniques are often conducted by optimizing the performance of prediction outcomes rather than its systemic deviations from the expected. In this paper, we proposed a sparsity-based automated feature selection (SAFS) framework, which encodes systemic outcome deviations via the sparsity of feature-driven odds ratios. SAFS is a model-agnostic approach with usability across different discovery techniques. SAFS achieves more than $3\times$ reduction in computation time while maintaining detection performance when validated on publicly available critical care dataset. SAFS also results in a superior performance when compared against multiple baselines for feature selection.
翻译:异常模式探测的目的是查明明显偏离常态的情况,并广泛适用于各个领域。在最新技术中提出了多种异常检测技术。然而,普遍缺乏一种原则性和可扩展的高效发现特征选择方法。现有的特征选择技术往往是通过优化预测结果的绩效而不是系统偏离预期进行。在本文件中,我们提议了一个基于宽度的自动特征选择框架,通过特征驱动的概率比的宽度,将系统性结果偏差编码起来。SAFS是一种模式性能偏差,在不同发现技术中具有可使用的性。在对公众提供的关键护理数据集进行验证时,SAFS在计算时,在保持检测性能的同时实现了3美元以上的减少。SFS在与选择特征的多个基线相比,还取得了优异的性能。