A common assumption of novelty detection is that the distribution of both "normal" and "novel" data are static. This, however, is often not the case - for example scenarios where data evolves over time or scenarios in which the definition of normal and novel depends on contextual information, both leading to changes in these distributions. This can lead to significant difficulties when attempting to train a model on datasets where the distribution of normal data in one scenario is similar to that of novel data in another scenario. In this paper we propose a context-aware approach to novelty detection for deep autoencoders to address these difficulties. We create a semi-supervised network architecture that utilises auxiliary labels to reveal contextual information and allow the model to adapt to a variety of contexts in which the definitions of normal and novel change. We evaluate our approach on both image data and real world audio data displaying these characteristics and show that the performance of individually trained models can be achieved in a single model.
翻译:新发现的一个常见假设是,“正常”和“新奇”数据的分布都是静止的。然而,这种情况往往并不存在,例如,数据随着时间变化而变化的情况,或正常和新颖的定义取决于背景信息的情景,都会导致这些分布的变化。这可能导致在试图培训数据集模型时出现重大困难,因为在一个情景中正常数据的分布与另一个情景中新数据相似。在本文中,我们提出了一种背景认知方法,用于为深层自动编码器进行新颖的检测,以解决这些困难。我们创建了一个半监督的网络结构,利用辅助标签披露背景信息,使模型适应正常和新变化定义的各种背景。我们评估了显示这些特征的图像数据和真实世界音频数据的方法,并表明个人培训模型的性能可以在单一模型中实现。