In this work, an automatic and simple framework for hockey ice-rink localization from broadcast videos is introduced. First, video is broken into video-shots by a hierarchical partitioning of the video frames, and thresholding based on their histograms. To localize the frames on the ice-rink model, a ResNet18-based regressor is implemented and trained, which regresses to four control points on the model in a frame-by-frame fashion. This leads to the projection jittering problem in the video. To overcome this, in the inference phase, the trajectory of the control points on the ice-rink model are smoothed, for all the consecutive frames of a given video-shot, by convolving a Hann window with the achieved coordinates. Finally, the smoothed homography matrix is computed by using the direct linear transform on the four pairs of corresponding points. A hockey dataset for training and testing the regressor is gathered. The results show success of this simple and comprehensive procedure for localizing the hockey ice-rink and addressing the problem of jittering without affecting the accuracy of homography estimation.
翻译:在这项工作中,从广播录像中引入了一个自动和简单的冰冰点定位框架。首先,视频通过视频框的分层分层和基于直方图的阈值被打破成视频片段。为了将冰点模型的框进行本地化,实施并培训了ResNet18的后退器,它以一个框架的方式返回到模型的四个控制点。这导致视频中的投影问题。在推论阶段,冰点模型控制点的轨迹是平滑的,对于给定的视频截图的所有连续框架而言,它涉及一个汉式窗口和达到的坐标。最后,光滑的同性恋矩阵是用对四对对应点的直接线变法来计算,一个用于培训和测试回归点的曲棍球数据集正在收集中。结果显示,在不影响同系估计准确性的情况下,冰点的本地化和解决折叠问题的简单和全面程序是成功的。