Puck localization is an important problem in ice hockey video analytics useful for analyzing the game, determining play location, and assessing puck possession. The problem is challenging due to the small size of the puck, excessive motion blur due to high puck velocity and occlusions due to players and boards. In this paper, we introduce and implement a network for puck localization in broadcast hockey video. The network leverages expert NHL play-by-play annotations and uses temporal context to locate the puck. Player locations are incorporated into the network through an attention mechanism by encoding player positions with a Gaussian-based spatial heatmap drawn at player positions. Since event occurrence on the rink and puck location are related, we also perform event recognition by augmenting the puck localization network with an event recognition head and training the network through multi-task learning. Experimental results demonstrate that the network is able to localize the puck with an AUC of $73.1 \%$ on the test set. The puck location can be inferred in 720p broadcast videos at $5$ frames per second. It is also demonstrated that multi-task learning with puck location improves event recognition accuracy.
翻译:Puck 本地化是冰冰球视频分析中的一个重要问题,有助于分析游戏、确定游戏位置和评估Puck拥有情况。由于球体体小,由于球体和棋盘的高球速度和球体隔热,运动过度模糊,因此问题具有挑战性。在本文中,我们在播放冰球视频中引入并安装了一个滑球本地化网络。网络利用NHL专家的逐个游戏说明,并使用时间环境来定位球体。玩家位置通过在球体位置上绘制以高山为基础的空间热映的编码播放器位置的注意机制纳入网络。由于在球场和球场位置上的事件发生是相关的,我们还通过增加一个事件识别点来确认事件,通过活动识别头,并通过多塔斯克学习来培训网络。实验结果显示,网络能够用AUC的73.1 + + 美元来定位球体。可以通过在测试集上以每秒5美元框的720p广播视频来推断Puck位置。还证明了多塔斯克位置的精确度,通过学习定位来提高位置的精确度。