Sports data is more readily available and consequently, there has been an increase in the amount of sports analysis, predictions and rankings in the literature. Sports are unique in their respective stochastic nature, making analysis, and accurate predictions valuable to those involved in the sport. In response, we focus on Siamese Neural Networks (SNN) in unison with LightGBM and XGBoost models, to predict the importance of matches and to rank teams in Rugby and Basketball. Six models were developed and compared, a LightGBM, a XGBoost, a LightGBM (Contrastive Loss), LightGBM (Triplet Loss), a XGBoost (Contrastive Loss), XGBoost (Triplet Loss). The models that utilise a Triplet loss function perform better than those using Contrastive loss. It is clear LightGBM (Triplet loss) is the most effective model in ranking the NBA, producing a state of the art (SOTA) mAP (0.867) and NDCG (0.98) respectively. The SNN (Triplet loss) most effectively predicted the Super 15 Rugby, yielding the SOTA mAP (0.921), NDCG (0.983), and $r_s$ (0.793). Triplet loss produces the best overall results displaying the value of learning representations/embeddings for prediction and ranking of sports. Overall there is not a single consistent best performing model across the two sports indicating that other Ranking models should be considered in the future.
翻译:体育数据更容易获得,因此,在文学中,体育分析、预测和排名的数量有所增加。体育具有独特的独特性,具有各自的随机性,进行分析和准确的预测对参与体育的人来说是有价值的。作为回应,我们侧重于Siamse神经网络(SNN),与LightGBM和XGBost模型相配合,以预测比赛的重要性和在Rugby和Basskeball的队级。六种模型已经开发并进行了比较,一个LightGBM、一个XGBost、一个LightGBM(Contrastem Loss)、一个LightGBM(Triplet Loss)、一个XGBoost(Criplet Loss)、一个XGBost(Triplet loss)。使用Triplet损失功能的模型比使用相对匹配损失模型的要好。 显然,LightGBM(Triftle)是NBA(SOTA)和NDCG(O-R)最高等级排名的模型,这是用来预测SO-RBSA(NR)的S-RDR),这是用来预测中最佳的SLA(NBR)和最高排名。