In most popular sports leagues, like the MLB, NBA, and NFL, none of the commonly used statistics take into account the strengths of the opponents a player faces. One of the main reasons for this is the conventional belief that a player's luck tends to even out over the course of a season. The other main reason is the difficulties of finding a sensible algorithm to both quantify the strengths of the opponents and incorporate such quantifications into a renormalization of a player's statistics. In this paper, we first argue that certain statistics, such as Earned Run Average (ERA) or Fielding Independent Pitching (FIP) can be significantly skewed by opponents' strengths in the MLB. We then present an algorithm to renormalize such statistics, using FIP as the main example. This is achieved by observing that certain opponent statistics for all 30 teams in the MLB (e.g. the collection of each game's opponent FIP value over the course of a season) follow a universal distribution, up to scaling and shift. This enables us to establish a data set for a hypothetical average team and to develop a pitching statistic based on FIP which accounts for the strength of a pitcher's schedule through methods based on equipercentile equating. It is called aFIP, which measures what a pitcher's FIP would have been if he had faced a league-average offensive team every time he pitched. We find that there is a significant difference between aFIP and FIP for some pitchers during the 2019 season and other seasons as well, adding a new tool for player evaluation. This could make millions of dollars of difference in player contracts and in profits for teams as they enhance the accuracy with which they make player acquisitions. The universal distribution we observed also has many possible future applications throughout the sports world.
翻译:在大多数流行的体育联盟中,比如MLB、NBA和NFL, 通常使用的统计数据都没有考虑到玩家所面临的对手的强项。 其中一个主要原因是, 通常认为玩家的运气在一季中趋于平坦。 另一个主要原因是, 很难找到一个合理的算法, 既量化对手的强项, 又将这种量化纳入玩家统计数据的重新统一。 在本文中, 我们首先认为, 某些统计数据, 例如 收益运行平均值( ERA) 或 Field Int Pitching ( FIP), 可以被 MLB 中的对手的强项大大扭曲。 我们然后用一个算法来重新整理这种统计数据, 使用FIP 的主要例子。 这是通过观察, 找到一个合理的算法, 将游戏的对手FIP 值纳入到一个新的分布, 用来提升和转换。 这使我们能够为一个假设的平均时间组设置一个数据组, 并开发一个精确的比额, 使FIP 公司在每一季中进行一个计算。