We present tractable methods for detecting changes in player performance metrics and apply these methods to Major League Baseball (MLB) batting and pitching data from the 2023 and 2024 seasons. First, we derive principled benchmarks for when performance metrics can be considered statistically reliable, assuming no underlying change, using distributional assumptions and standard concentration inequalities. We then propose a changepoint detection algorithm that combines a likelihood-based approach with split-sample inference to control false positives, using either nonparametric tests or tests appropriate to the underlying data distribution. These tests incorporate a shift parameter, allowing users to specify the minimum magnitude of change to detect. We demonstrate the utility of this approach across several baseball applications: detecting changes in batter plate discipline metrics (e.g., chase and whiff rate), identifying velocity changes in pitcher fastballs, and validating velocity changepoints against a curated ground-truth dataset of pitchers who transitioned from relief to starting roles. Our method flags meaningful changes in 91% of these `ground-truth' cases and reveals that, for some metrics, more than 60% of detected changes occur in-season. While developed for baseball, the proposed framework is broadly applicable to any setting involving monitoring of individual performance over time.
翻译:我们提出了检测玩家表现指标变化的可处理方法,并将这些方法应用于2023和2024赛季美国职业棒球大联盟(MLB)的击球与投球数据。首先,基于分布假设和标准集中不等式,在假设无潜在变化的前提下,我们推导了判定表现指标具有统计可靠性的理论基准。随后,我们提出一种结合似然方法与分割样本推断的变点检测算法,通过非参数检验或适配数据分布的检验来控制误报率。这些检验引入了偏移参数,允许用户指定待检测变化的最小幅度。我们在多个棒球应用场景中验证了该方法的实用性:检测击球员击球纪律指标(如追打率和挥空率)的变化、识别投手快速球的速度变化,以及对照从救援投手转为先发投手的真实标注数据集验证速度变点。该方法在91%的‘真实标注’案例中成功标记出有意义的变化,并揭示对于某些指标,超过60%的检测变化发生在赛季期间。虽然本方法针对棒球运动开发,但所提框架广泛适用于任何涉及个体表现长期监测的场景。