Search engines play an essential role in our daily lives. Nonetheless, they are also very crucial in enterprise domain to access documents from various information sources. Since traditional search systems index the documents mainly by looking at the frequency of the occurring words in these documents, they are barely able to support natural language search, but rather keyword search. It seems that keyword based search will not be sufficient for enterprise data which is growing extremely fast. Thus, enterprise search becomes increasingly critical in corporate domain. In this report, we present an overview of the state-of-the-art technologies in literature for three main purposes: i) to increase the retrieval performance of a search engine, ii) to deploy a search platform to a cloud environment, and iii) to select the best terms in expanding queries for achieving even a higher retrieval performance as well as to provide good query suggestions to its users for a better user experience.
翻译:搜索引擎在我们的日常生活中发挥着必不可少的作用。然而,在企业领域,搜索引擎对于从各种信息来源获取文件也非常重要。由于传统的搜索系统主要通过查看这些文件中出现的单词的频率来对文件进行索引,因此它们几乎无法支持自然语言搜索,而是支持关键词搜索。关键词搜索似乎不足以满足正在迅速增长的企业数据。因此,企业搜索在公司领域变得日益重要。我们在本报告中概述了文献中最先进的技术,主要有三个目的:(一) 提高搜索引擎的检索性能;(二) 将搜索平台部署到云层环境,以及(三) 选择扩大查询的最佳术语,以达到更高的检索性能,并向用户提供良好的查询建议,以便获得更好的用户经验。