Unbiased learning to rank (ULTR) aims to train an unbiased ranking model from biased user click logs. Most of the current ULTR methods are based on the examination hypothesis (EH), which assumes that the click probability can be factorized into two scalar functions, one related to ranking features and the other related to bias factors. Unfortunately, the interactions among features, bias factors and clicks are complicated in practice, and usually cannot be factorized in this independent way. Fitting click data with EH could lead to model misspecification and bring the approximation error. In this paper, we propose a vector-based EH and formulate the click probability as a dot product of two vector functions. This solution is complete due to its universality in fitting arbitrary click functions. Based on it, we propose a novel model named Vectorization to adaptively learn the relevance embeddings and sort documents by projecting embeddings onto a base vector. Extensive experiments show that our method significantly outperforms the state-of-the-art ULTR methods on complex real clicks as well as simple simulated clicks.
翻译:无偏见的排名学习(LUCTR)旨在从有偏向的用户点击日志中培训一个公正的排名模型(LUCTR) 。 当前的LUCTR方法大多基于测试假设( EH) 。 该假设假设假设假定点击概率可以被分解成两个星标函数, 一个与排名特征有关, 另一个与偏差因素有关。 不幸的是, 特性、 偏差因素和点击之间的相互作用在实践中很复杂, 通常无法以这种独立的方式进行分解。 将点击数据与 EH 匹配可能导致模型错误的区分并带来近似错误。 在本文中, 我们提议以矢量为基础的 EH 方法, 并将点击概率作为两个矢量函数的点产值。 这个解决方案是完整的, 因为它在安装任意点击功能时具有普遍性。 基于这个假设, 我们提议了一个名为矢量的新模型, 通过投射嵌入基矢量, 来适应性地学习嵌入和整理文件的关联性。 广泛的实验显示, 我们的方法在复杂的实际点击上, 以及简单的模拟点击上, 大大超越了最先进的土控方法。