Detecting fights from still images shared on social media is an important task required to limit the distribution of violent scenes in order to prevent their negative effects. For this reason, in this study, we address the problem of fight detection from still images collected from the web and social media. We explore how well one can detect fights from just a single still image. We also propose a new dataset, named Social Media Fight Images (SMFI), comprising real-world images of fight actions. Results of the extensive experiments on the proposed dataset show that fight actions can be recognized successfully from still images. That is, even without exploiting the temporal information, it is possible to detect fights with high accuracy by utilizing appearance only. We also perform cross-dataset experiments to evaluate the representation capacity of the collected dataset. These experiments indicate that, as in the other computer vision problems, there exists a dataset bias for the fight recognition problem. Although the methods achieve close to 100% accuracy when trained and tested on the same fight dataset, the cross-dataset accuracies are significantly lower, i.e., around 70% when more representative datasets are used for training. SMFI dataset is found to be one of the two most representative datasets among the utilized five fight datasets.
翻译:从社交媒体共享的静态图像中检测战斗,是限制暴力场景分布以防止其负面影响的一个重要任务。为此原因,我们在本研究中处理从从网络和社交媒体收集的静态图像中检测战斗的问题。我们探索从仅从一个静态图像中检测战斗有多好。我们还提议建立一个新的数据集,名为社会媒体对抗图像(SMIF),由真实世界的战斗动作图像组成。拟议数据集的广泛实验结果显示,战斗动作可以从静态图像中成功识别。即使不利用时间信息,也有可能通过只使用外观来探测高度精确的战斗。我们还进行交叉数据集实验,以评估所收集的数据集的显示能力。这些实验表明,与其他计算机视觉问题一样,战斗识别问题存在数据集偏差。虽然在对相同的格斗数据集进行训练和测试时,方法接近100%的准确度,交叉数据集的准确度则大大降低。也就是说,在使用更具代表性的数据集进行最多用于战斗的数据时,发现大约70%。SMAFIS数据集中的一项数据是用来进行最有代表性的数据。