Query suggestion refers to the task of suggesting relevant and related queries to a search engine user to help in query formulation process and to expedite information retrieval with minimum amount of effort. It is highly useful in situations where the search requirements are not well understood and hence it has been widely adopted by search engines to guide users' search activity. For news websites, user queries have a time sensitive nature inherent in them. When some new event happens, there is a sudden burst in queries related to that event and such queries are sustained over a period of time before fading away with that event. In addition to this temporal aspect of search queries fired at news websites, they have an addition distinct quality, i.e., they are intended to get event related information majority of the times. Existing work on generating query suggestions involves analyzing query logs to suggest queries which are relevant and related to the search intent of the user. But in case of news websites, when there is a sudden burst in information related to a particular event, there are not enough search queries fired by other users which leads to lack of click data, and hence giving query suggestions related to some old event or even some irrelevant suggestions altogether. Another problem with query logs in the context of online news is that, they mostly contain queries related to popular events and hence fail to capture less popular events or events which got overshadowed by some other more sensational event. We propose a novel approach to generate event-centric query suggestions using metadata of news articles published by news media. We compared our proposed framework with existing state of the art query suggestion mechanisms provided by Google News, Bing News, Google Search and Bing Search on various parameters.
翻译:查询建议指的是向搜索引擎用户提出相关和相关询问的任务,以帮助查询制定过程,并以最低努力量加快信息检索;在搜索要求不完全被理解,因此被搜索引擎广泛采用以指导用户搜索活动的情况下非常有用;对于新闻网站来说,用户询问具有固有的时间敏感性。当发生一些新事件时,与该事件有关的询问突然爆发,这类询问在逐渐消失之前的一段时间里持续。除了在新闻网站上进行的时间性搜索查询的参数外,它们还具有不同的质量,即它们旨在获取与事件相关的大多数时间的信息;在生成查询建议的现有工作包括分析查询日志,以提出与用户搜索意图相关和相关的询问。但是,在新闻网站出现与某一事件有关的信息突然爆发时,其他用户提出的搜索查询方式不够,导致缺少点击数据,从而对一些旧事件提出查询建议,或者甚至一些不相关的评论,因此,在网上新闻查询时,我们提出的另一个问题是,通过对B事件进行更低的图像,因此,对事件进行更低的在线查询。