Recommender systems for software engineering (RSSEs) assist software engineers in dealing with a growing information overload when discerning alternative development solutions. While RSSEs are becoming more and more effective in suggesting handy recommendations, they tend to suffer from popularity bias, i.e., favoring items that are relevant mainly because several developers are using them. While this rewards artifacts that are likely more reliable and well-documented, it would also mean that missing artifacts are rarely used because they are very specific or more recent. This paper studies popularity bias in Third-Party Library (TPL) RSSEs. First, we investigate whether state-of-the-art research in RSSEs has already tackled the issue of popularity bias. Then, we quantitatively assess four existing TPL RSSEs, exploring their capability to deal with the recommendation of popular items. Finally, we propose a mechanism to defuse popularity bias in the recommendation list. The empirical study reveals that the issue of dealing with popularity in TPL RSSEs has not received adequate attention from the software engineering community. Among the surveyed work, only one starts investigating the issue, albeit getting a low prediction performance.
翻译:软件工程领域中的推荐系统 (RSSEs) 帮助软件工程师在辨别不同的开发解决方案时处理日益增长的信息过载。虽然 RSSEs 正在变得越来越有效地提供方便的建议,但它们往往受到流行偏差的影响,即偏向于那些主要因为许多开发人员在使用而相关的项目。虽然这样可以奖励更可靠、文档更完整的工件,但也意味着很少有人使用那些非常特定或最近的工件。本文研究了第三方库 (TPL) RSSEs 中的流行偏差问题。首先,我们调查现有的 RSSEs 研究是否已经解决了流行偏差的问题。然后,我们定量评估了四个现有的 TPL RSSEs,探索它们处理推荐热门项目的能力。最后,我们提出了一种机制来减轻推荐列表中的流行偏差问题。经验研究表明,TPL RSSE 中如何处理流行度的问题还未引起软件工程社区的足够重视。在调查的工作中,只有一个开始研究这个问题,但得到了较低的预测性能。