We focus on Maximum Inner Product Search (MIPS), which is an essential problem in many machine learning communities. Given a query, MIPS finds the most similar items with the maximum inner products. Methods for Nearest Neighbor Search (NNS) which is usually defined on metric space don't exhibit the satisfactory performance for MIPS problem since inner product is a non-metric function. However, inner products exhibit many good properties compared with metric functions, such as avoiding vanishing and exploding gradients. As a result, inner product is widely used in many recommendation systems, which makes efficient Maximum Inner Product Search a key for speeding up many recommendation systems. Graph based methods for NNS problem show the superiorities compared with other class methods. Each data point of the database is mapped to a node of the proximity graph. Nearest neighbor search in the database can be converted to route on the proximity graph to find the nearest neighbor for the query. This technique can be used to solve MIPS problem. Instead of searching the nearest neighbor for the query, we search the item with maximum inner product with query on the proximity graph. In this paper, we propose a reinforcement model to train an agent to search on the proximity graph automatically for MIPS problem if we lack the ground truths of training queries. If we know the ground truths of some training queries, our model can also utilize these ground truths by imitation learning to improve the agent's search ability. By experiments, we can see that our proposed mode which combines reinforcement learning with imitation learning shows the superiorities over the state-of-the-art methods
翻译:我们的焦点是最大产品搜索(MIPS),这是许多机器学习社区的一个基本问题。根据一个查询,MIPS发现最相似的项目与最大内部产品最为相似。 以公制空间定义的近邻搜索方法通常不会显示MIPS问题的满意性能, 因为内部产品是一个非计量功能。 但是, 与公制功能相比, 内产产品具有许多良好的特性, 例如避免消失和爆炸梯度。 因此, 许多建议系统广泛使用内产产品, 这使得高效的最大内产产品搜索成为加速许多建议系统的密钥。 NPS 问题基于图表的方法显示与其他类方法相比的优越性。 数据库的每个数据点被定位为近邻搜索方法的节点。 数据库中最近邻的搜索可以转换为近端图像的路径, 以近端搜索的方式。 与最近的近端查询相比, 我们用最高级的内产产品搜索方法搜索 。 在本文中, 我们建议一个强化的实验模型, 将精细度模型用于培训机构搜索。