Recommendation algorithms are susceptible to popularity bias: a tendency to recommend popular items even when they fail to meet user needs. A related issue is that the recommendation quality can vary by demographic groups. Marginalized groups or groups that are under-represented in the training data may receive less relevant recommendations from these algorithms compared to others. In a recent study, Ekstrand et al. investigate how recommender performance varies according to popularity and demographics, and find statistically significant differences in recommendation utility between binary genders on two datasets, and significant effects based on age on one dataset. Here we reproduce those results and extend them with additional analyses. We find statistically significant differences in recommender performance by both age and gender. We observe that recommendation utility steadily degrades for older users, and is lower for women than men. We also find that the utility is higher for users from countries with more representation in the dataset. In addition, we find that total usage and the popularity of consumed content are strong predictors of recommender performance and also vary significantly across demographic groups.
翻译:推荐算法容易受到流行偏差的影响:倾向于推荐受欢迎的项目,即使它们未能满足用户的需要。一个相关的问题是建议的质量可能因人口群体而不同。在培训数据中代表性不足的边缘化群体或群体从这些算法中可能得到的相关建议与其他算法相比不那么相关。在最近的一项研究中,Ekstrand等人调查了推荐人业绩如何因受欢迎程度和人口统计而异,发现两个数据集的二进制性别之间在推荐效用上存在统计上的重大差异,以及基于一个数据集的年龄产生的重大影响。我们在此转载了这些结果,并进行了更多的分析。我们在统计学上发现,推荐人业绩在年龄和性别两方面都存在显著差异。我们发现,建议效用对老年用户来说持续下降,而女性比男子低。我们还发现,对于在数据集中代表性较多的国家的用户来说,其效用更高。此外,我们发现,消费内容的总使用率和受欢迎程度是推荐人业绩的有力预测,而且各人口群体之间也有很大差异。