Automatic recommendation systems based on deep neural networks have become extremely popular during the last decade. Some of these systems can however be used for applications which are ranked as High Risk by the European Commission in the A.I. act, as for instance for online job candidate recommendation. When used in the European Union, commercial AI systems for this purpose will then be required to have to proper statistical properties with regard to potential discrimination they could engender. This motivated our contribution, where we present a novel optimal transport strategy to mitigate undesirable algorithmic biases in multi-class neural-network classification. Our stratey is model agnostic and can be used on any multi-class classification neural-network model. To anticipate the certification of recommendation systems using textual data, we then used it on the Bios dataset, for which the learning task consists in predicting the occupation of female and male individuals, based on their LinkedIn biography. Results show that it can reduce undesired algorithmic biases in this context to lower levels than a standard strategy.
翻译:在过去十年中,基于深神经网络的自动推荐系统变得极为流行。但是,其中一些系统可以用于欧盟委员会在A.I.中列为高风险的应用程序,例如在线候选职位建议。在欧洲联盟使用时,为此目的的商业AI系统必须具备适当的统计属性,以了解它们可能产生的潜在歧视。这促使我们作出贡献,我们提出了一种新的最佳运输战略,以在多级神经网络分类中减少不良的逻辑偏差。我们的策略是模范的不可知性,可以用于任何多级分类神经网络模型。为预测使用文本数据的建议系统的认证,我们随后在生物数据集中使用了它,为此的学习任务包括预测男女个体的占用情况,基于他们的联结自传。结果显示,它能够将这一背景下的不理想的算法偏差降低到低于标准战略的水平。</s>