Fast item ranking is an important task in recommender systems. In previous works, graph-based Approximate Nearest Neighbor (ANN) approaches have demonstrated good performance on item ranking tasks with generic searching/matching measures (including complex measures such as neural network measures). However, since these ANN approaches must go through the neural measures several times during ranking, the computation is not practical if the neural measure is a large network. On the other hand, fast item ranking using existing hashing-based approaches, such as Locality Sensitive Hashing (LSH), only works with a limited set of measures. Previous learning-to-hash approaches are also not suitable to solve the fast item ranking problem since they can take a significant amount of time and computation to train the hash functions. Hashing approaches, however, are attractive because they provide a principle and efficient way to retrieve candidate items. In this paper, we propose a simple and effective learning-to-hash approach for the fast item ranking problem that can be used for any type of measure, including neural network measures. Specifically, we solve this problem with an asymmetric hashing framework based on discrete inner product fitting. We learn a pair of related hash functions that map heterogeneous objects (e.g., users and items) into a common discrete space where the inner product of their binary codes reveals their true similarity defined via the original searching measure. The fast ranking problem is reduced to an ANN search via this asymmetric hashing scheme. Then, we propose a sampling strategy to efficiently select relevant and contrastive samples to train the hashing model. We empirically validate the proposed method against the existing state-of-the-art fast item ranking methods in several combinations of non-linear searching functions and prominent datasets.
翻译:快速项目排序是推荐器系统中的一项重要任务。 在以往的工程中, 基于图形的 Apbear Aprear Nearest Mighbbor (ANN) 方法在项目排序任务上表现良好, 采用了通用搜索/ 匹配措施( 包括神经网络测量等复杂措施 ) 。 但是, 由于这些 ANN 方法在排序中必须通过神经测量数倍, 如果神经测量是一个大型网络, 计算是不切实际的。 另一方面, 使用基于散列的现有方法, 如本地敏感散列( LSH) 快速项目排序方法, 只能用有限的一套措施来操作。 先前的搜索到列表方法也不适合解决快速项目排序问题, 因为它们可以花费大量的时间和计算来训练仓列功能。 但是, 挂动方法具有吸引力, 因为它们提供了一种原则, 并高效地检索候选项目。 在本文中, 我们提出一个简单、 学习到快速项目排序的组合方法, 包括神经网络测量。 具体地, 我们用一个不精确的模型来解决这个不精确的项目排序问题, 。