Pose-based anomaly detection is a video-analysis technique for detecting anomalous events or behaviors by examining human pose extracted from the video frames. Utilizing pose data alleviates privacy and ethical issues. Also, computation-wise, the complexity of pose-based models is lower than pixel-based approaches. However, it introduces more challenges, such as noisy skeleton data, losing important pixel information, and not having enriched enough features. These problems are exacerbated by a lack of anomaly detection datasets that are good enough representatives of real-world scenarios. In this work, we analyze and quantify the characteristics of two well-known video anomaly datasets to better understand the difficulties of pose-based anomaly detection. We take a step forward, exploring the discriminating power of pose and trajectory for video anomaly detection and their effectiveness based on context. We believe these experiments are beneficial for a better comprehension of pose-based anomaly detection and the datasets currently available. This will aid researchers in tackling the task of anomaly detection with a more lucid perspective, accelerating the development of robust models with better performance.
翻译:利用数据减轻隐私和伦理问题。此外,在计算方面,基于表面的模型的复杂性低于基于像素的方法。然而,它提出了更多的挑战,例如,噪音骨骼数据,失去重要的像素信息,没有足够丰富的特征。这些问题由于缺少异常现象探测数据集而加剧,而这些数据集足以代表真实世界的情景。在这项工作中,我们分析和量化两个众所周知的视频异常数据集的特征,以更好地了解基于表面的异常探测的困难。我们向前迈出了一步,探索基于图像的模型和轨迹的区别力量,以便根据背景发现视频异常及其有效性。我们认为这些实验有助于更好地了解基于表面的异常探测和现有数据集。这将有助于研究人员从更清晰的角度处理异常探测任务,加快开发稳健模型,提高性能。</s>