Deep learning-based recommender systems (DRSs) are increasingly and widely deployed in the industry, which brings significant convenience to people's daily life in different ways. However, recommender systems are also shown to suffer from multiple issues,e.g., the echo chamber and the Matthew effect, of which the notation of "fairness" plays a core role.While many fairness notations and corresponding fairness testing approaches have been developed for traditional deep classification models, they are essentially hardly applicable to DRSs. One major difficulty is that there still lacks a systematic understanding and mapping between the existing fairness notations and the diverse testing requirements for deep recommender systems, not to mention further testing or debugging activities. To address the gap, we propose FairRec, a unified framework that supports fairness testing of DRSs from multiple customized perspectives, e.g., model utility, item diversity, item popularity, etc. We also propose a novel, efficient search-based testing approach to tackle the new challenge, i.e., double-ended discrete particle swarm optimization (DPSO) algorithm, to effectively search for hidden fairness issues in the form of certain disadvantaged groups from a vast number of candidate groups. Given the testing report, by adopting a simple re-ranking mitigation strategy on these identified disadvantaged groups, we show that the fairness of DRSs can be significantly improved. We conducted extensive experiments on multiple industry-level DRSs adopted by leading companies. The results confirm that FairRec is effective and efficient in identifying the deeply hidden fairness issues, e.g., achieving 95% testing accuracy with half to 1/8 time.
翻译:深度学习推荐系统(DRS)越来越广泛地应用于工业领域,从不同方面为人们的日常生活带来了极大的便利。然而,推荐系统也被证明存在多种问题,例如信息壁垒和马太效应,其中“公平性”概念起着核心作用。虽然针对传统的深度分类模型已经开发了许多公平性概念和相应的公平性测试方法,但它们基本上很难应用于 DRS。其中一个主要困难是目前还缺乏对现有公平性概念与深度推荐系统的各种测试要求之间的系统性理解和映射,更不用说进一步的测试或调试活动了。为了解决这个差距,我们提出了 FairRec,这是一个统一的框架,支持从多个定制化的角度,例如模型实用性、物品多样性、物品流行度等,测试 DRS 的公平性。我们还提出了一种新的、高效的搜索测试方法来应对新的挑战,即双端离散粒子群优化(DPSO)算法,以从大量的候选组中有效地搜索出某些弱势群体的隐藏的公平性问题。在给出测试报告后,通过对这些发现的弱势群体采取简单的重新排名缓解策略,我们展示了 DRS 的公平性可以显著改善。我们在多个领先公司采用的多个工业级 DRS 进行了广泛的实验。结果证实,FairRec 有效且高效地识别出深度隐藏的公平性问题,例如实现95%的测试准确性,时间缩短了一半到1/8。