Video anomaly detection is an essential but challenging task. The prevalent methods mainly investigate the reconstruction difference between normal and abnormal patterns but ignore the semantics consistency between appearance and motion information of behavior patterns, making the results highly dependent on the local context of frame sequences and lacking the understanding of behavior semantics. To address this issue, we propose a framework of Appearance-Motion Semantics Representation Consistency that uses the gap of appearance and motion semantic representation consistency between normal and abnormal data. The two-stream structure is designed to encode the appearance and motion information representation of normal samples, and a novel consistency loss is proposed to enhance the consistency of feature semantics so that anomalies with low consistency can be identified. Moreover, the lower consistency features of anomalies can be used to deteriorate the quality of the predicted frame, which makes anomalies easier to spot. Experimental results demonstrate the effectiveness of the proposed method.
翻译:视频异常现象的探测是一项重要但具有挑战性的任务。流行的方法主要是调查正常和异常模式之间的重建差异,但忽视行为模式的外观和动作信息之间的语义一致性,使结果高度依赖框架序列的当地背景,缺乏行为语义学的理解。为了解决这一问题,我们提议了一个外观和动作语义代表一致性框架,利用正常和异常数据之间的外观和动作语义代表一致性差距。双流结构旨在将正常样本的外观和动作信息表达形式编码,并提议出现新的一致性损失,以加强特征语义学的一致性,从而能够以低一致性的方式识别异常现象。此外,异常现象的较低一致性特征可以用来降低预测框架的质量,从而更容易发现异常现象。实验结果显示了拟议方法的有效性。</s>