For the past few years, we used Apache Lucene as recommendation frame-work in our scholarly-literature recommender system of the reference-management software Docear. In this paper, we share three lessons learned from our work with Lucene. First, recommendations with relevance scores below 0.025 tend to have significantly lower click-through rates than recommendations with relevance scores above 0.025. Second, by picking ten recommendations randomly from Lucene's top50 search results, click-through rate decreased by 15%, compared to recommending the top10 results. Third, the number of returned search results tend to predict how high click-through rates will be: when Lucene returns less than 1,000 search results, click-through rates tend to be around half as high as if 1,000+ results are returned.
翻译:在过去几年里,我们用阿帕奇·卢塞内作为参考管理软件Docear的学术-文学推荐系统的建议框架。在本文中,我们分享了与卢塞内合作的三项经验教训。首先,相关评分低于0.025的建议的点击率大大低于相关评分高于0.025的建议。第二,通过随机从卢塞恩的顶部50次搜索结果中挑选十项建议,点击通率比建议顶部10次结果减少了15%。第三,返回的搜索结果数往往预测点击率会有多高:当卢塞内返回不到1,000次搜索结果时,点击通率往往比返回1,000+的结果高出一半左右。