利用用户行为历史来个人化电子邮件搜索 (Leveraging User Behavior History for Personalized Email Search)

An effective email search engine can facilitate users' search tasks and improve their communication efficiency. Users could have varied preferences on various ranking signals of an email, such as relevance and recency based on their tasks at hand and even their jobs. Thus a uniform matching pattern is not optimal for all users. Instead, an effective email ranker should conduct personalized ranking by taking users' characteristics into account. Existing studies have explored user characteristics from various angles to make email search results personalized. However, little attention has been given to users' search history for characterizing users. Although users' historical behaviors have been shown to be beneficial as context in Web search, their effect in email search has not been studied and remains unknown. Given these observations, we propose to leverage user search history as query context to characterize users and build a context-aware ranking model for email search. In contrast to previous context-dependent ranking techniques that are based on raw texts, we use ranking features in the search history. This frees us from potential privacy leakage while giving a better generalization power to unseen users. Accordingly, we propose a context-dependent neural ranking model (CNRM) that encodes the ranking features in users' search history as query context and show that it can significantly outperform the baseline neural model without using the context. We also investigate the benefit of the query context vectors obtained from CNRM on the state-of-the-art learning-to-rank model LambdaMart by clustering the vectors and incorporating the cluster information. Experimental results show that significantly better results can be achieved on LambdaMart as well, indicating that the query clusters can characterize different users and effectively turn the ranking model personalized.

翻译：有效的电子邮件搜索引擎可以方便用户的搜索任务并提高其通信效率。用户可以对电子邮件的各种排名信号有不同的偏好, 例如根据他们手头的任务甚至工作, 其相关性和耐久性等, 用户可能会对电子邮件的不同级别信号有不同的偏好。因此, 对所有用户来说, 统一匹配模式并不是最佳的。相反, 有效的电子邮件排名器应该通过考虑用户的特性进行个性化排序。现有的研究已经从不同的角度探索用户的特性, 使电子邮件搜索结果个人化。但是, 用户的搜索历史历史记录很少受到关注, 用户的搜索历史记录在网络搜索中被证明是有用的, 但他们在网络搜索中并没有研究, 电子邮件搜索的效果仍然未知。与以前基于原始文本的背景排序技术相比, 我们使用搜索历史的特征。这使我们从潜在的隐私模式渗漏中解析出, 同时给隐蔽用户提供更好的一般化能力。因此, 我们提议根据环境对内脏模型排序模型( CNRMRM) 模型, 将内部的排序结果有效地转换到用户的排序, 将排序结果显示为历史检索状态。我们通过通过数据库的上下行的排序, 能够对内流进行深入的排序进行深入的学习, 显示, 。