A new Lossy Causal Temporal Convolutional Neural Network Autoencoder for anomaly detection is proposed in this work. Our framework uses a rate-distortion loss and an entropy bottleneck to learn a compressed latent representation for the task. The main idea of using a rate-distortion loss is to introduce representation flexibility that ignores or becomes robust to unlikely events with distinctive patterns, such as anomalies. These anomalies manifest as unique distortion features that can be accurately detected in testing conditions. This new architecture allows us to train a fully unsupervised model that has high accuracy in detecting anomalies from a distortion score despite being trained with some portion of unlabelled anomalous data. This setting is in stark contrast to many of the state-of-the-art unsupervised methodologies that require the model to be only trained on "normal data". We argue that this partially violates the concept of unsupervised training for anomaly detection as the model uses an informed decision that selects what is normal from abnormal for training. Additionally, there is evidence to suggest it also effects the models ability at generalisation. We demonstrate that models that succeed in the paradigm where they are only trained on normal data fail to be robust when anomalous data is injected into the training. In contrast, our compression-based approach converges to a robust representation that tolerates some anomalous distortion. The robust representation achieved by a model using a rate-distortion loss can be used in a more realistic unsupervised anomaly detection scheme.
翻译:在这项工作中,提出了用于异常现象检测的新的Lossy Causal Temal Convolution Convolution Neal网络自动编码器。 我们的框架使用一个速率扭曲损失和催化瓶颈来学习压缩潜伏代表任务。 使用速率扭曲损失的主要想法是引入代表灵活性, 忽视或对具有特殊模式( 如异常) 的不合理事件变得强大。 这些异常现象表现为在测试条件下可以准确检测到的独特扭曲特征。 这个新架构允许我们训练一个完全不受监督的模型, 该模型在检测扭曲得分异常方面具有高度的准确性, 尽管我们受过部分未贴标签的反常数据的培训。 这个设置与许多要求模型仅接受“ 正常数据” 培训的州级非超常化方法形成鲜明对比。 我们争辩说,这部分违反了对异常现象检测进行非超常性培训的概念,因为模型使用一种知情的模型选择了正常的培训。 此外,有证据表明,它也会影响模型的不常态化能力。 我们证明, 一种在稳健性分析方法中, 当它们被训练到正常数据时, 将数据变成正常的正态时, 我们的模型会使用一种正常的变压模式。