Anomaly detection with weakly supervised video-level labels is typically formulated as a multiple instance learning (MIL) problem, in which we aim to identify snippets containing abnormal events, with each video represented as a bag of video snippets. Although current methods show effective detection performance, their recognition of the positive instances, i.e., rare abnormal snippets in the abnormal videos, is largely biased by the dominant negative instances, especially when the abnormal events are subtle anomalies that exhibit only small differences compared with normal events. This issue is exacerbated in many methods that ignore important video temporal dependencies. To address this issue, we introduce a novel and theoretically sound method, named Robust Temporal Feature Magnitude learning (RTFM), which trains a feature magnitude learning function to effectively recognise the positive instances, substantially improving the robustness of the MIL approach to the negative instances from abnormal videos. RTFM also adapts dilated convolutions and self-attention mechanisms to capture long- and short-range temporal dependencies to learn the feature magnitude more faithfully. Extensive experiments show that the RTFM-enabled MIL model (i) outperforms several state-of-the-art methods by a large margin on four benchmark data sets (ShanghaiTech, UCF-Crime, XD-Violence and UCSD-Peds) and (ii) achieves significantly improved subtle anomaly discriminability and sample efficiency. Code is available at https://github.com/tianyu0207/RTFM.
翻译:以监督不力的视频等级标签进行异常检测通常被描述为一个多例学习(MIL)问题,我们的目标是确定含有异常事件的片段,每个视频都作为视频片段的袋子。虽然目前的方法显示有效的检测性能,但对正面情况的认可,即异常视频中罕见异常片段的认可,在很大程度上受到主要负面情况的偏差,特别是当异常事件是微妙的异常事件,与正常事件相比差异很小时,这一问题在许多方法中更加严重,忽视了重要的视频时间依赖性。为了解决这一问题,我们引入了一种新型和理论上健全的方法,名为Robust Temothal特效学习(RTFMust TFMT),该方法培养了特质级学习功能,以有效识别正面情况,大大改进了MIL方法对异常视频中负面情况的稳健性。RTFMT还调整了长期和短期时间依赖性机制,以更加忠实地了解特征程度。广泛的实验显示,在IMFM-R-BL-SB-SB-G-G-SD-G-G-CRiral-SD-CFiral-G-G-Gyal-SL-G-SD-Gy-G-SD-SD-G-Gy-SD-SD-CRiral-G-G-G-G-SB-SB-SB-G-G-G-G-G-CRB-T-T-T-T-G-G-G-G-G-G-G-G-G-T-G-T-T-T-T-G-G-T-G-G-G-G-T-T-T-T-T-C-C-C-C-T-T-C-C-C-T-T-G-G-G-G-G-G-G-G-G-G-G-C-G-T-G-G-G-C-C-C-C-C-C-C-G-G-G-G-G-G-G-G-G-G-G-G-G-G-G-G-G-G-C-G-G-G-G-G-G-G-G-G-G