Video anomaly detection is a challenging task because of diverse abnormal events. To this task, methods based on reconstruction and prediction are wildly used in recent works, which are built on the assumption that learning on normal data, anomalies cannot be reconstructed or predicated as good as normal patterns, namely the anomaly result with more errors. In this paper, we propose to discriminate anomalies from normal ones by the duality of normality-granted optical flow, which is conducive to predict normal frames but adverse to abnormal frames. The normality-granted optical flow is predicted from a single frame, to keep the motion knowledge focused on normal patterns. Meanwhile, We extend the appearance-motion correspondence scheme from frame reconstruction to prediction, which not only helps to learn the knowledge about object appearances and correlated motion, but also meets the fact that motion is the transformation between appearances. We also introduce a margin loss to enhance the learning of frame prediction. Experiments on standard benchmark datasets demonstrate the impressive performance of our approach.
翻译:由于各种异常事件,探测视频异常是一项艰巨的任务。对于这项任务,基于重建和预测的方法在近期工作中被疯狂地使用,其依据的假设是,根据正常数据学习,无法对异常进行重建,也不能以与正常模式相同的方式预测异常情况,即异常结果与更多错误的异常结果。在本文中,我们提议将异常情况与正常情况区分开来,因为正常光学流的双重性有助于预测正常框架,但对异常框架不利。正常情况光学流是从一个框架预测出来的,使运动知识以正常模式为重点。同时,我们将外观-动作通信计划从框架重建扩大到预测,这不仅有助于了解物体外观和相关运动的知识,而且满足了外观之间的转变。我们还引入了差差差,以加强对框架预测的学习。对标准基准数据集的实验显示了我们方法的惊人表现。