In-game win probability models, which provide a sports team's likelihood of winning at each point in a game based on historical observations, are becoming increasingly popular. In baseball, basketball and American football, they have become important tools to enhance fan experience, to evaluate in-game decision-making, and to inform coaching decisions. While equally relevant in soccer, the adoption of these models is held back by technical challenges arising from the low-scoring nature of the sport. In this paper, we introduce an in-game win probability model for soccer that addresses the shortcomings of existing models. First, we demonstrate that in-game win probability models for other sports struggle to provide accurate estimates for soccer, especially towards the end of a game. Second, we introduce a novel Bayesian statistical framework that estimates running win, tie and loss probabilities by leveraging a set of contextual game state features. An empirical evaluation on eight seasons of data for the top-five soccer leagues demonstrates that our framework provides well-calibrated probabilities. Furthermore, two use cases show its ability to enhance fan experience and to evaluate performance in crucial game situations.
翻译:游戏赢概率模型为体育队提供了在基于历史观察的比赛的每一点赢得比赛的可能性,这种游戏赢赢概率模型越来越受欢迎。在棒球、篮球和美式足球中,这些模型已成为重要的工具,可以增强球迷经验、评估球中决策并指导教练决策。虽然这些模型在足球中同样相关,但这些模型的采用却因体育低分性造成的技术挑战而受阻。在本文件中,我们为足球引入了一个游戏赢赢概率模型,解决现有模型的缺点。首先,我们证明其他体育竞赛的赢概率模型能够提供足球准确的估计数,特别是在一场比赛结束时。第二,我们引入了一种新型的贝叶斯统计框架,通过利用一系列背景游戏特点来估计赢、打平和损失概率。对前五个足球联盟的8个数据季节进行的经验评估表明,我们的框架提供了准确的概率。此外,两个使用案例表明它有能力提高球迷的经验,并评估关键比赛状况的性能。