We present in this paper a novel approach for as-you-type top-$k$ keyword search over social media. We adopt a natural "network-aware" interpretation for information relevance, by which information produced by users who are closer to the seeker is considered more relevant. In practice, this query model poses new challenges for effectiveness and efficiency in online search, even when a complete query is given as input in one keystroke. This is mainly because it requires a joint exploration of the social space and classic IR indexes such as inverted lists. We describe a memory-efficient and incremental prefix-based retrieval algorithm, which also exhibits an anytime behavior, allowing to output the most likely answer within any chosen running-time limit. We evaluate it through extensive experiments for several applications and search scenarios, including searching for posts in micro-blogging (Twitter and Tumblr), as well as searching for businesses based on reviews in Yelp. They show that our solution is effective in answering real-time as-you-type searches over social media.
翻译:在本文中,我们提出了一个针对社交媒体的“you type” 顶值-k$k$关键字搜索的新颖方法。 我们对信息相关性采用了自然的“网络认知”解释,认为与搜索者关系较近的用户所提供的信息更为相关。在实践中,这种查询模式对在线搜索的实效和效率提出了新的挑战,即使一个键盘输入了一个完整的查询作为输入。这主要是因为它需要共同探索社会空间和典型的IR指数,如倒转列表。我们描述了一个记忆高效的和递增的前缀检索算法,它也显示了一种随时的动作,允许在任何选定的运行时限内输出最有可能的答案。我们通过对一些应用和搜索情景的广泛实验,包括搜索微博中(Twitter和Tumblr)的职位,以及根据Yelp的评述寻找企业。它们表明,我们的解决方案在实时对社交媒体进行“y-y-ty”搜索方面是有效的。