For the weakly supervised anomaly detection task, most existing work is limited to the problem of inadequate video representation due to the inability to model long-time contextual information. We propose a weakly supervised adaptive graph convolutional network (WAGCN) to model the contextual relationships among video segments. And we fully consider the influence of other video segments on the current segment when generating the anomaly probability score for each segment. Firstly, we combine the temporal consistency as well as feature similarity of video segments for composition, which makes full use of the association information among spatial-temporal features of anomalous events in videos. Secondly, we propose a graph learning layer in order to break the limitation of setting topology manually, which adaptively extracts sparse graph adjacency matrix based on data. Extensive experiments on two public datasets (i.e., UCF-Crime dataset and ShanghaiTech dataset) demonstrate the effectiveness of our approach.
翻译:对于监督不力的异常点探测任务,大多数现有工作仅限于由于无法模拟长期背景信息而导致的视频代表性不足的问题。我们提议建立一个监督不力的适应性图变网络(WAGCN),以模拟视频各部分之间的背景关系。我们充分考虑其他视频部分在产生每个部分的异常概率分数时对当前部分的影响。首先,我们结合了视频各部分的时间一致性和特征的相似性,充分利用了视频中异常事件空间时空特征之间的关联信息。第二,我们提出一个图表学习层,以打破手工设置地形学的限制,这种结构根据数据,根据适应性提取稀少的图相近矩阵。关于两个公共数据集(即UCF-犯罪数据集和上海科技数据集)的广泛实验显示了我们的方法的有效性。