In the design of action recognition models, the quality of videos in the dataset is an important issue, however the trade-off between the quality and performance is often ignored. In general, action recognition models are trained and tested on high-quality videos, but in actual situations where action recognition models are deployed, sometimes it might not be assumed that the input videos are of high quality. In this study, we report qualitative evaluations of action recognition models for the quality degradation associated with transcoding by JPEG and H.264/AVC. Experimental results are shown for evaluating the performance of pre-trained models on the transcoded validation videos of Kinetics400. The models are also trained on the transcoded training videos. From these results, we quantitatively show the degree of degradation of the model performance with respect to the degradation of the video quality.
翻译:在行动识别模型的设计中,数据集中的视频质量是一个重要问题,然而,质量和性能之间的权衡往往被忽视。一般而言,行动识别模型是用高质量的视频进行培训和测试的,但在使用行动识别模型的实际情况下,有时可能不认为输入视频是高质量的。在本研究报告中,我们报告了对JPEG和H.264/AVC转码相关质量退化的行动识别模型的质量评估。实验结果显示于评价经过培训的关于动因学转码验证视频的模型的性能。这些模型还接受了转码培训视频培训。从这些结果中,我们从数量上显示了该模型在视频质量退化方面表现的退化程度。