For personalized ranking models, the well-calibrated probability of an item being preferred by a user has great practical value. While existing work shows promising results in image classification, probability calibration has not been much explored for personalized ranking. In this paper, we aim to estimate the calibrated probability of how likely a user will prefer an item. We investigate various parametric distributions and propose two parametric calibration methods, namely Gaussian calibration and Gamma calibration. Each proposed method can be seen as a post-processing function that maps the ranking scores of pre-trained models to well-calibrated preference probabilities, without affecting the recommendation performance. We also design the unbiased empirical risk minimization framework that guides the calibration methods to learning of true preference probability from the biased user-item interaction dataset. Extensive evaluations with various personalized ranking models on real-world datasets show that both the proposed calibration methods and the unbiased empirical risk minimization significantly improve the calibration performance.
翻译:对于个性化排名模型,用户偏好某一项的准确校准概率具有巨大的实际价值。虽然现有工作在图像分类方面显示了有希望的结果,但对于个性化排名,概率校准没有进行很多探索。在本文中,我们的目标是估计用户喜欢某项的准概率。我们调查了各种参数分布,并提出了两种参数校准方法,即高萨校准和伽玛校准。每一种拟议方法都可以被视为后处理功能,将预培训模型的分数排到良好校准的优惠概率,而不影响建议性能。我们还设计了不带偏见的经验风险最小化框架,指导校准方法,以便从偏差的用户-项目互动数据集中了解真正的优先概率。对现实世界数据集中各种个性化排序模型的广泛评价显示,拟议的校准方法和不偏倚的经验风险最小化都大大改进了校准性。