In sports, an aging curve depicts the relationship between average performance and age in athletes' careers. This paper investigates the aging curves for offensive players in the Major League Baseball. We study this problem in a missing data context and account for different types of dropouts of baseball players during their careers. In particular, the performance metrics associated with the missing seasons are imputed using a multiple imputation model for multilevel data, and the aging curves are constructed based on the imputed datasets. We first perform a simulation study to evaluate the effects of different dropout mechanisms on the estimation of aging curves. Our method is then illustrated with analyses of MLB player data from past seasons. Results suggest an overestimation of the aging curves constructed without imputing the unobserved seasons, whereas a better estimate is achieved with our approach.
翻译:在体育领域,一个老化曲线描述了运动员职业生涯中平均表现和年龄之间的关系。本文调查了大联盟棒球中进攻性球员的老化曲线。我们在缺少数据的背景下研究这一问题,并记录了棒球运动员在职业生涯中不同类型辍学者的情况。特别是,与失踪赛季有关的性能衡量标准是使用多级数据的多重估算模型估算的,而老化曲线是根据估算的数据集构建的。我们首先进行模拟研究,以评估不同辍学机制对老化曲线估计的影响。然后用分析过去赛季的MLB玩家数据来说明我们的方法。结果显示,在不估计未观察到的赛季的情况下,高估了所构建的老化曲线,而我们的方法则是作出更好的估计。