Sports betting's recent federal legalisation in the USA coincides with the golden age of machine learning. If bettors can leverage data to accurately predict the probability of an outcome, they can recognise when the bookmaker's odds are in their favour. As sports betting is a multi-billion dollar industry in the USA alone, identifying such opportunities could be extremely lucrative. Many researchers have applied machine learning to the sports outcome prediction problem, generally using accuracy to evaluate the performance of forecasting models. We hypothesise that for the sports betting problem, model calibration is more important than accuracy. To test this hypothesis, we train models on NBA data over several seasons and run betting experiments on a single season, using published odds. Evaluating various betting systems, we show that optimising the forecasting model for calibration leads to greater returns than optimising for accuracy, on average (return on investment of $110.42\%$ versus $2.98\%$) and in the best case ($902.01\%$ versus $222.84\%$). These findings suggest that for sports betting (or any forecasting problem where decisions are made based on the predicted probability of each outcome), calibration is a more important metric than accuracy. Sports bettors who wish to increase profits should therefore optimise their forecasting model for calibration.
翻译:最近美国联邦体育赌博的联邦法律化最近在美国的体育赌博,恰好与机器学习的黄金时代相吻合。如果赌徒能够利用数据来准确预测结果的概率,那么他们就能发现,当赌博者的胜算有利的时候,他们就能发现。由于体育赌博仅在美国就是一个数十亿美元的行业,因此确定这种机会可能非常有利。许多研究人员对体育结果预测问题应用了机器学习,通常使用精确度来评价预测模型的性能。我们假设,对于体育赌博问题,模型校准比准确性要重要得多。为了测试这一假设,我们在几个季节里对NBA数据进行模型培训,并在一个季节里用公布的胜算进行赌博试验。在评估各种赌博时,我们显示,对各种赌博的预测模式的优化比对准确性的优化要好得多,平均(投资回报为110.42 美元对2.98 美元),在最佳案例中(902.01美元对222.84美元对222.84 美元) 。这些结论表明,对于体育赌博(或任何预测问题,根据每个结果的预期概率来作决定的预测,利用公布的概率来进行赌博,因此,校准比标准更重要得多。</s>