Detecting abnormal events in video is commonly framed as a one-class classification task, where training videos contain only normal events, while test videos encompass both normal and abnormal events. In this scenario, anomaly detection is an open-set problem. However, some studies assimilate anomaly detection to action recognition. This is a closed-set scenario that fails to test the capability of systems at detecting new anomaly types. To this end, we propose UBnormal, a new supervised open-set benchmark composed of multiple virtual scenes for video anomaly detection. Unlike existing data sets, we introduce abnormal events annotated at the pixel level at training time, for the first time enabling the use of fully-supervised learning methods for abnormal event detection. To preserve the typical open-set formulation, we make sure to include disjoint sets of anomaly types in our training and test collections of videos. To our knowledge, UBnormal is the first video anomaly detection benchmark to allow a fair head-to-head comparison between one-class open-set models and supervised closed-set models, as shown in our experiments. Moreover, we provide empirical evidence showing that UBnormal can enhance the performance of a state-of-the-art anomaly detection framework on two prominent data sets, Avenue and ShanghaiTech. Our benchmark is freely available at https://github.com/lilygeorgescu/UBnormal.
翻译:在视频中检测异常事件通常被构建为一类分类任务,在这种情况下,训练视频仅包含正常事件,而测试视频涵盖正常和异常事件。在此情况下,异常检测是一个开放式问题。但是,一些研究将异常检测归纳为动作识别。这是一个闭合的场景,无法测试系统检测新的异常类型的能力。为此,我们提出了UBnormal,一个新的监督式开放式基准,由多个虚拟场景组成,用于视频异常检测。与现有数据集不同,我们在训练时引入了像素级别注释的异常事件,首次使得异常事件检测可以使用完全监督式学习方法。为了保持典型的开放式配方,我们确保在我们的训练和测试视频集合中包含不同类型的异常集合。据我们所知,UBnormal是第一个视频异常检测基准,允许一种公平的单类开放集模型和监督式闭合集模型之间的直接比较,如我们的实验所示。此外,我们提供了实证证据表明,UBnormal可以提高最先进的异常检测框架在两个著名数据集,即 Avenue 和 ShanghaiTech 上的性能。我们的基准在 https://github.com/lilygeorgescu/UBnormal 免费提供。