Abnormal event detection in video is a complex computer vision problem that has attracted significant attention in recent years. The complexity of the task arises from the commonly-adopted definition of an abnormal event, that is, a rarely occurring event that typically depends on the surrounding context. Following the standard formulation of abnormal event detection as outlier detection, we propose a background-agnostic framework that learns from training videos containing only normal events. Our framework is composed of an object detector, a set of appearance and motion auto-encoders, and a set of classifiers. Since our framework only looks at object detections, it can be applied to different scenes, provided that normal events are defined identically across scenes and that the single main factor of variation is the background. To overcome the lack of abnormal data during training, we propose an adversarial learning strategy for the auto-encoders. We create a scene-agnostic set of out-of-domain pseudo-abnormal examples, which are correctly reconstructed by the auto-encoders before applying gradient ascent on the pseudo-abnormal examples. We further utilize the pseudo-abnormal examples to serve as abnormal examples when training appearance-based and motion-based binary classifiers to discriminate between normal and abnormal latent features and reconstructions. We compare our framework with the state-of-the-art methods on four benchmark data sets, using various evaluation metrics. Compared to existing methods, the empirical results indicate that our approach achieves favorable performance on all data sets. In addition, we provide region-based and track-based annotations for two large-scale abnormal event detection data sets from the literature, namely ShanghaiTech and Subway.
翻译:视频中异常事件检测是一个复杂的计算机视觉问题,近年来引起了人们的极大关注。任务的复杂性来自对异常事件通常采用的定义,即通常取决于周围环境的罕见事件。在标准设定异常事件检测为异常检测为异常检测为异常检测后,我们提议了一个背景认知框架,从仅包含正常事件的培训视频中学习。我们的框架由物体检测器、一组外观和动作自动摄像仪以及一组分类器组成。由于我们的所有框架只看对象检测,因此它可以应用到不同的场景,条件是正常事件的定义在场景之间相同,而且变化的单一主要因素是背景。为了克服培训中缺少异常事件检测为异常检测检测的异常事件,我们为自动解读者提出了一个对抗性学习战略。我们创建了一个场景认知框架,由一个外观假异常检测器、一组外观和动作自动解读器组成,然后将梯度作为变异常态,我们进一步利用假变异性示例示例示例示例,在常规测试和四级数据库中,我们用正常的变相比对当前数据进行对比。