We opensource under CC BY 4.0 license Lib-SibGMU - a university library circulation dataset - for a wide research community, and benchmark major algorithms for recommender systems on this dataset. For a recommender architecture that consists of a vectorizer that turns the history of the books borrowed into a vector, and a neighborhood-based recommender, trained separately, we show that using the fastText model as a vectorizer delivers competitive results.
翻译:我们根据CC BY 4.0 许可为广泛的研究界开发Lib-SibGMU(一所大学图书馆循环数据集),并为该数据集的推荐者系统设定主要算法基准。 对于由将借书历史转换为向量的矢量化器组成的推荐者架构,以及一个单独培训的以邻里为基础的推荐者,我们显示使用快速图文模型作为向量化器可以带来竞争性的结果。