Improving the quality of search results can significantly enhance users experience and engagement with search engines. In spite of several recent advancements in the fields of machine learning and data mining, correctly classifying items for a particular user search query has been a long-standing challenge, which still has a large room for improvement. This paper introduces the "Shopping Queries Dataset", a large dataset of difficult Amazon search queries and results, publicly released with the aim of fostering research in improving the quality of search results. The dataset contains around 130 thousand unique queries and 2.6 million manually labeled (query,product) relevance judgements. The dataset is multilingual with queries in English, Japanese, and Spanish. The Shopping Queries Dataset is being used in one of the KDDCup'22 challenges. In this paper, we describe the dataset and present three evaluation tasks along with baseline results: (i) ranking the results list, (ii) classifying product results into relevance categories, and (iii) identifying substitute products for a given query. We anticipate that this data will become the gold standard for future research in the topic of product search.
翻译:提高搜索结果的质量可以大大提高用户的经验和与搜索引擎的接触。尽管最近在机器学习和数据挖掘领域取得了一些进展,但正确分类特定用户搜索查询的项目是一项长期挑战,仍然有很大的改进空间。本文介绍“购物查询数据集”,这是一套庞大的关于亚马逊地区困难搜索查询和结果的数据集,公开发布,目的是促进提高搜索结果质量的研究。数据集包含大约13万个独特的查询和260万个人工标签(query, product)相关判断。数据集有英文、日文和西班牙文的查询。“购物查询数据集”在KDDCup'22的一项挑战中正在使用。在本文件中,我们描述了数据集,并提出了三项评价任务以及基线结果:(一) 排列结果清单,(二) 将产品结果分类为相关分类,(三) 确定特定查询的替代产品。我们预计,这一数据将成为今后产品搜索主题研究的黄金标准。